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ROLE DIFFERENTIATION, EMPATHY, AND NEUROSIS 
IN URBAN MIGRANTS AND LOWER-CLASS RESIDENTS 
OF SANTIAGO, CHILE * 


1 


WALTER G. STEPHAN ? anp COOKIE STEPHAN 


University of Minnesota 


Interviews were conducted with 60 lower-class male residents of Santiago, 

Chile. Half of the subjects were urban migrants; the other half were born in : 
Santiago. Projective measures of role differentiation and empathy were col- 

lected, as well as demographic information and a measure of neurosis. As 

predicted, role differentiation and empathy were positively correlated with each 

other and negatively correlated with neurosis. Urban-born subjects were higher 

in role differentiation and lower in neurosis than migrants; both groups were 

similar in empathy. Age and education seemed to be important variables in- 

fluencing these relationships. The results are discussed in terms of transition 

problems of urban migrants. 


of the self. This lack of integration would be 
manifested in symptom formation. Evidence 
for the relationship between empathy and 
adjustment is provided in the work of Gough 
(1948) and Cameron and Margaret (1951). 

Sarbin (1954) and Sarbin and Allen 
(1968) have also hypothesized that the num- 
ber of roles in an individual's behavior rep- 
ertoire is related to his social adjustment. 
This hypothesis has not been investigated 


Empathy has been defined as the capacity 
to take the role of another (Lerner, 1958). 
Sarbin (1954) and Sarbin and Allen (1968) 
have hypothesized that empathic ability is 
positively related to social adjustment. A 
similar hypothesis can be derived from the 
relationship between cognitive development 
and symptom formation discussed by Feffer 
| (1967). Feffer translated Piagets (1950) 

concept of decentering into interpersonal 


terms, defining it as an individual’s capacity 
to assume the roles of others in social situa- 
tions. This concept bears an obvious relation- 
ship to empathy. F effer feels that it is associ- 
ated with symptom formation, because lack 
of decentering in interpersonal relations may 
covary with a lack of integration of aspects 


1 This research was sponsored by the University of 
Minnesota—Universidad de Concepcion cooperative 
program funded by the Ford Foundation. 

The authors would like to thank the following 
people: Armando de Ramon of Promocion Popular, 
who granted us permission to use census data gath- 
ered by that organization ; Otto Doerr of the Con- 
cepcion Psychiatric Clinic for permitting us to inter- 
view some of his patients; Antonio Fernandez of 
the Universidad de Concepcion, who drew the pic- 
tures used in the projective tests; Cesar Torreabla, 
a graduate of the School of Sociology of the Uni- 
| versidad de Chile, who was the third rater for the 

projective measures; and Paul Rosenblatt, Penny 
Baron, David Gilfillan, and Kenneth Dion of the 
University of Minnesota, who commented on earlier 
versions of the manuscript. 

? Requests for reprints should be sent to Walter 
G. Stephan, who is now at the Department of Psy- 
chology, University of Texas, Austin, Texas 78712. 


empirically. A study by Sarbin and Rosenberg 
(1955) provides some support if Sarbin’s 
number-of-roles concept is expanded to include 
not only all of the roles an individual is capa- 
ble of enacting but also those that he is capa- 
ble of perceiving or discriminating when oth- 
ers enact them. This expansion of Sarbin’s 
number-of-roles concept seems justified by his 
own emphasis on discrimination between roles 
and by the supposition that “learning a role 
adequately requires the learning of the entire 
role set [Sarbin & Allen, 1968, p. 546]." In 
their study, Sarbin and Rosenberg found that 
“poor” role perceivers checked selí-descrip- 
tive adjectives similar to those checked by 
neurotics (for males) and hysterics (for fe- 
males). Role perception was measured by the 
number of modal responses given to verbal 
descriptions of different individuals. This 
measure may be thought of as the number of 
common roles an individual distinguished and, 
as such, provides support for the hypothesis. 
The number of different roles an individual 
perceives seems to be related to what an- 


thropologists and sociologists have called role 
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differentiation, that is, “the momentary state 
of role structure in any society, the members 
of which attach separate meanings to a de- 
terminate number of roles [Southall, 1959, p. 
20]." Southall has operationalized this con- 
cept by distinguishing five categories of roles: 
kinship and ethnic, economic, political, re- 
ligious and ritualistic, and recreational. 

In his 1959 paper, Southall also examined 
role differentiation in relation to the rural- 
urban continuum. He hypothesized that due to 
the highly differentiated nature of the urban 
environment, transition to such an environ- 
ment from a rural one would be very difficult. 
Relating this hypothesis to Sarbin's hypothe- 
sis concerning number of roles (hereafter 
dealt with in the expanded sense, i.e., role dif- 
lerentiation) and to social adjustment, it 
might be further hypothesized that urban mi- 
grants from relatively undifferentiated rural 
social systems would suffer role-learning and 
adjustment problems. Evidence supporting 
this hypothesis was found by Leacock (1961) 
in a survey of five studies of menta] disorders 
in various societies, She concluded that urban 
migrants have a higher incidence of psychosis 
and other mental diseases than rural or urban 
residents and that the incidence of all mental 
disease is negatively correlated with social 
class. If social adjustment is operationalized 
to mean absence of mental disorder, then an 
optimum test of Sarbin (1954) and Southall’s 
(1959) hypotheses would be to study lower- 
class urban migrants to a large industrialized 
(or industrializing) city from a relatively 
undifferentiated rural society. With such a 
sample and an appropriate control of lower- 
class urban residents, the following hypoth- 
eses could be formulated: (a) Role differenti- 
ation, empathy, and social adjustment (ab- 
sence of neurosis) are Positively correlated 
with one another in both groups. (b) The 
migrants are lower in role differentiation and 


higher in neurosis than the urban-born resi- 
dents. 


Sample 


The sample consisted of 60 lower-class male resis 
dents of Santiago, Chile. Half of the subjects were 
born in Santiago and had lived there all their lives; 
the other half had migrated to Santiago from rural 
areas within the past 10 years. Persons migrating 
from towns with a population larger than 1,000 were 
excluded, All subjects were presently employed or 
seeking employment. The subjects were randomly 
selected from a population of lower-class Santiago? 
residents, identified by a recent government census 
of economically marginal neighborhoods. | 
were sampled from four such neighborhoc 
tered widely over the northern section of 
The authors interviewed each subject in 


The subjects in the two samples were similar in 
average age (migrants = 33,8 years, residents = 333 
years) and income (migrants = $750. per year, resi- 
dents = $775 per year). There was a significant dií- 
lerence in average level of education (migrants — 
37 years, residents = 5,7 years; F= 975, p< 01), 
and the implications of this difference were reflected 1 
in the number of migrants and residents who were 
literate (migrants — 15, residents 227; x? — 9.6, $ 
< 01). 


Procedure 


The interview consisted of four parts, adminis- 
tered in the following order: a background question- ^ 
naire, a measurement of neurosis, and a measure: ~ 
ment of empathy. The questionnaire covered demo- 7 


graphic data such as age, education, place of birth, } 
and work and residential histories. The neurosis 
measure consisted of 11 of the questionnaire items | 
previously used by MacMillan (1957) to measure 

neurosis in rural Canada, Adaptations of this in- 3 


strument have been used in the United States (Jack- | 
son, 1962) and in two related studies in Peru (Alers, > 
1965; Kellert, Williams, Whyte, & Alberti, 1967). < 
The questions relate entirely to common neurotic 
symptoms of a psychophysiological nature, such as 
digestive problems, nightmares, cold sweats, and 
bodily aches and pains | 
The validity of the neurosis measure was tested | 
by interviewing 14 lower-class diagnosed neurotics 
of both sexes at two different psychiatric clinics, one 
in Concepcion and the other in Santiago, The mean 
Score of the diagnosed neurotics on a 33-point scale 
Was 29, The mean for the sample of 30 lower-class 
residents of Santiago was 17.1, a difference signifi- J| 
cant at the 0001 level. l 
A projective test was used to measure role differ- | 
entiation. It consisted of TAT-type drawings of 
social situations. The pictures contained from four 
lo seven people of all ages and sexes engaged in 
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various types of social activity. One drawing depicted 

ix people in a small store: an old man, two younger 
men, two children, and one middle-aged woman; 
iher drawings depicted a street scene, people seated 
it a table, and people standing and talking, Subjects 
were asked to tell interesting and complete stories 
about the pictures. All of the subjects listened to a 
tape-recorded story about a picture not in the test 
series and then made up a story about a second 
sample picture. The interviewer asked questions 
about the sample picture to aid the subjects in over- 
coming the difficulties of inventing stories. No help 
was given for pictures in the test series. The sub- 
jects’ stories were recorded on a portable tape re- 
corder. 

The protocols were scored for the number of roles 
used to describe the depicted individuals. The roles 
were classified into Southall’s (1959) five categories. 
In addition, age and sex roles were also tabulated. 
Role designates consisted almost entirely of nouns. 
The exceptions were sex roles, which are often indi- 
cated by articles and adjectives in Spanish, and some 
adjectives which are used to indicate economic roles. 
No verbs describing the behavior of the depicted 
individuals were accepted as role designates. The 
primary measure of role differentiation was the sum 
of the number of different roles that the subject 
employed in each of the four stories he told. One 
limitation in the use of this instrument as a measure 
of role differentiation is that it measures propensity 
to differentiate roles, not ability to differentiate 
roles, This limitation is somewhat attenuated by the 
common assumption underlying projective measures, 
that the subject will respond with the categories 
most salient to him. 

Empathy was also measured by use of a projec- 
tive test, The concept of Lerner's (1958) empathy 
test, which consists entirely of verbal requests, was 
combined with a technique developed by Rogers and 
Beal (1959) to get bias-free data from rural re- 
spondents, Rogers and Beal presented drawings to 
their subjects and asked them to describe the rela- 
tionships among the depicted individuals, It was 
felt that the use of drawings, in addition to verbal 
requests, aid the subject in projecting himself into 
the situation and standardize the situation into 
which the subject is asked to project himself. The 
drawings used depicted only two people. A brief 
explanation of the interaction taking place was given 
to the subject, and he was then asked to take the 
role of one of the depicted individuals and tell what 
he would do or think in that situation, and why. In 
the four pictures, the subject was asked to take the 
role of a small boy, an adolescent girl, a working-class 
male, and an upper-class male. 

The method employed to measure empathy in this 
study was similar to that developed by Feffer and 
his associates (Feffer, 1959; Feffer & Gourevitch, 
1960; Feffer & Jahelka, 1968) to study role-taking 
abilities. Feffer used a projective technique also, 
asking his subjects to describe a social scene and 
then to consider, in turn, the perspective of each 
person depicted in the scene The protocols were eval- 


uated in terms of the level of description of the 
actors and the subjects ability to adopt different 
perspectives. The latter may be considered to be a 
measure of empathy which diífers from that used in 
the present study with respect to the ease with which 
the behavior described can be compared to normative 
prescriptions, Using Feffer's technique, the perspec- 
tives adopted by the subject depend on his initial 
description of the actors. In the present study, the 
subject was asked to assume the role of a given 
actor in a defined situation. 

Three raters judged the responses to the empathy 
test. Five-point scales were used in rating three 
aspects of each protocol, The three aspects 
were the appropriateness of the thought or action 
proposed, the appropriateness of the motive pro- 
posed, and the amount of time the subject used the 
first person to refer to the individual whose role he 
was asked to assume. The appropriateness of the 
action and motive were judged against criteria estab- 
lished by the raters after discussions of various 
protocols selected at random. The extent to which 
the first person was employed was included in the 
empathy measure, because it was felt that the use 
of the first person indicated that the subject had 
entered fully into the role of the other person. 
Unfortunately, the judging of this aspect of some 
protocols presented difficulties, because the first and 
third persons have identical forms in some tenses 
in Spanish. In the limited number of cases in which 
this problem arose, it was settled by a discussion 
among the raters. After it had been determined 
whether or not the subject had intended to use the 
first person, the judgment as to the extent to which 
the first person had been employed was made by the 
raters individually. The total of these three aspects 
of all four pictures comprised the empathy index. 

The interrater reliabilities ranged from .95 to .97. 
These correlations may have been somewhat inflated 
by the prior agreement among the raters as to 
whether or not the first person had been employed 
by the subject. 


RESULTS AND DISCUSSION 


Hypothesis I 

The predicted positive relationship between 
role differentiation and empathy was sup- 
ported by the data for both migrants (7 = .42, 
p< .05) and residents (r= .43, p < .05), 
as was the correlation between role differenti- 
ation and neurosis (r= —.33, p < .05, for 
migrants; r= —.32, p < .05, for residents). 
The predicted negative correlation between 
neurosis and empathy was found for the 
sample of migrants (r — —.32, p < .05), but 
not for the residents (r — —.06, ms). It is 
possible that this last result was due to the 
dysfunctional aspects of empathy for some 
residents, in that it may have given them 


Mapa 
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TABLE 1 


CORRELATION Or AGE AND EDUCATION 
WITH DEPENDENT MEASURES 


Role 
Variable diferen- | Empathy | Neurosis 
tiation 
— 351% —.20 ,31** 
ge 


No. yr. of education* 24° | —.25° 


> Age or education has been partialed out, 
+p «.10, two-tailed 

s P <05, two-tailed. 

mep « 01, two-tailed 


greater insight into a social system where 
upward mobility is extremely difficult. 

A correlational analysis of the combined 
samples was performed, relating age and edu- 
cation of the measures of role differentiation, 
empathy, and neurosis (Table 1). The cor- 
relation. between age and education (r= 
—.24, p< .10) was partialed out of these 
relationships, The results indicated that the 
Presence of neurotic symptoms increased 
with age (r = 31, p < .05), while role dif- 
ferentiation decreased with age (r= =s, 
P < .05). There was also à nonsignificant 
tendency for empathy to decrease with age 
(r= —.20, ns). The results pertaining to 
age may be the Consequences of being less 
active socially and physically, as well as 
representing the cumulative effects of frustra- 
tion, hard work, and, in some cases, an 
inadequate diet, 

Amount of education Was positively cor- 
related with role differentiation (r = .38, 
? < .01). There were also marginally signifi- 
cant trends, indicating that education was 
Positively correlated with empathy (r = 24, 
P < .10) and negatively correlated with the 


and education, while only marginally signifi- 
cant, is similar to a finding reported by Rogers 
and Herzog (1966), who found that empathy 


to ascribed rank, led the subject to view his 
situation as one of personal failure. The 
consequence of this negative self-evalu on 
was the development of neurotic symptoms, 
In particular, he found that Status in-on- 
sistencies characterized by high levels of «du- 
cation and low levels of achieved Success led 
to symptom formation. This type of status 
inconsistency may apply to the lower-ciiss 
subjects in this study who had moderate 1. ls 
of education by Chilean standards, but held 
low-income jobs. 


Hypothesis II 


The predictions concerning differences in 
role differentiation and neurosis between sub- 
jects with rural and urban backgrounds were 
strongly confirmed. Using the sum of the 
number of different roles (in all categories) 
mentioned in each of the four stories as a 


P < 001). The residents were also less neu- 
rotic than the migrants (F — 3.95, p < 10). 

The Propensity of the residents to differen- 
tiate more roles may have been due in part 
to their greater verbal fluency. The residents 
told longer stories than the migrants (F = 
3.38, p < .10), but this was offset by the 
tendency of the migrants to use their words 


(age, sex, kinship, economic, political, reli- 
gious, and recreational) employed in cach 
Stories. The residents were 
higher than the migrants on this measure of 
role differentiation also (migrants — 11.5 total 
Categories, residents — 13.4 categories; F= 
This last result occurred 
because the residents used the categories of 
kinship and economic roles more than the 


_ Combining these findings with the correla- 
tional results, it Appears that being brought 
Up in an urban environment with the associ- 


-A 


vied opportunity for greater education facili- 

ated role differentiation, but had less effect 

n the development of empathy. This pattern 
;esults can be accounted for in terms of 

Ssrbim and Allen's (1968) statement that 

role learning is predicted on empathy. If this 
s true, then empathy could develop in the 

absence of a complex social environment, 
while the learning of many roles could not. 

The correlation between role differentiation 

and empathy could also be interpreted in 

terms of Sarbin and Allen's statement, since, 
in the absence of empathy, little role learning 
would be expected. 

, In a more general sense, both role differen- 

, tiation and empathy may be thought of as 

aspects of cognitive development. The role 

of education may be to stimulate cognitive 
development through the learning of abstract 
principles, such as writing and arithmetic. 

This type of cognitive development may gen- 

eralize to other phenomena, such as role dis- 

crimination and learning. Also, the educa- 
tional environment introduces the child to 
modern values and norms through its teachers 
and textbooks. This may account for the 
associations found between role differentia- 
tion, empathy, and education. An alternative 

explanation of these results would be that a 

selection process was operating, such that only 

the more socially perceptive children chose 
or were sent to school. 

_ Data obtained on the subjects’ work his- 
tories revealed that those earning more than 
the legal minimum wage were better educated 
(F = 5.71, p < 03) and had fewer neurotic 
symptoms (F= 4.15, p< .05) than those 
earning the minimum wage or less. The most 
likely explanation for those earning more than 
the minimum wage being better educated than 
those earning the minimum wage or less is that 
Chilean employers place a heavy emphasis on 
education, both in employing people and 
advancing them. That subjects earning the 
minimum wage or less should have been 
higher in neurosis than those earning more 
than the minimum wage may mean either 
that their neuroses made them poor em- 
ployees, or that their low income caused them 
to develop neurotic symptoms. An internal 
analysis of the data on income showed that 
the migrants earning more than the minimum 


earning more than the minimum wage to 
differentiate more roles (F = 2.15, p < 45). 
Taken together with the finding that em- 
pathy and neurosis were negatively corrélated 
for migrants, these results seem to indicate 
that empathy is important to job success and 
in avoiding neurosis for migrants. It may be 
that empathy facilitates adaptation to an 
urban social environment by aiding the indi- 
vidual in learning both his own new roles and 
the roles of others, It might be noted that 
the change to an urban environment involves 
a change in the quality of roles that must 
be learned. In a rural social system, many 
occupational and other roles are ascribed, 


while in an urban social environment similar — 


roles must be achieved. This difference in 
the manner of attaining status may be antici- 
pated to cause considerable strain on the 
urban migrant, and it may constitute a par- 
tial explanation of the finding that migrants 
were higher in neurosis than residents and 
that those attaining a modicum of achieved 
status, as measured by income, were less 
neurotic than those who were less successful. 

Additional data on the subjects’ work his- 
tories indicated that migrants had held fewer 
long-term jobs than residents. Twenty of the 
migrants had changed jobs since their arrival 
or within the last 5 years, while only 12 of 
the residents had changed jobs within the last 
5 years (x° = 4.04, p < .05). The most prob- 
able explanation of this result is that migrants, 
due to their unfamiliarity with the urban 
environment, had a harder time finding the 
more desirable long-term jobs. It is also pos- 
sible that employers discriminated against 
migrants because of their rural backgrounds 
or lack of education, Internal analyses of 
these data showed that migrants who had not 
changed jobs since their arrival or within the 
last 5 years were better educated (F = 5.48, 
p < 05) and higher in role differentiation 
(F —9.64, p « 01) than those who had 
changed jobs during that period. If long- 
term employment is assumed to be a desired 
goal for migrants, these results reinforce the 
suggestion that education is crucial to work 


pP oon wmm a ee EY "TCU P SM 
6 WALTER G. STEPHAN AND COOKIE STEPHAN 


success. The result regarding role differenti- 
ation strengthens the interpretation that role 
differentiation plays an important part in the 
migrants' success. 


CONCLUSION 


The prevalence of urban migration in many 
areas of the world makes urban migrants a 
particularly important study group. Fully 
one-third of the population of Santiago con- 
sists of migrants (Herrick, 1965), many of 
whom live in slums similar to the barriadas 
of Lima and the favelas of Rio de Janeiro. 
The results of the present study indicate 
that migrants íace a considerable task of 
role learning and differentiation upon entering 
the urban environment. It also demonstrates 
that migrants may be as well equipped to 
learn these roles as their urban-born counter- 
parts, at least in terms of empathic ability. 
The difference between lower-class migrants 
and residents with respect to long-term em- 
ployment and differences on the neurosis mea- 
sure attests to the difficulties that migrants 
suffer in adapting to the urban environment. 
It should be added, however, that the mi- 
grants in the present study almost invariably 
preferred the new environment, Only one of 
the migrants sampled expressed a desire to 
return to rural life, while the rest preferred 
the city. Finally, the results give some 
indication that job success for migrants 
depends on empathy, role differentiation, and 
education, 
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SYMBOLIC CODING PROCESSES IN OBSERVATIONAL 
LEARNING ' 


MARVIN S. GERST * 
z Stanford University 


' This experiment tested the hypothesis that symbolic coding of modeling stimuli 
enhances observational learning, and that different types of codes are differ- 
entially effective over time and with modeled responses varying in verbaliza- 
bility. Subjects observed a filmed model perform complex motor responses, 
each of which they either described in concrete terms, reactivated imaginally, 
or coded in the form of summary labels which encompassed the essential ele- 
ments, All three coding operations facilitated immediate reproduction of mod- 
eled responses, Subjects employing imaginal and summary label codes achieved 
higher matching scores than those in the concrete verbal description group who 
exceeded the level of observational learning shown by control subjects. The 
form of memory code did not interact with verbalizability of modeling stimuli, 

M but, as predicted, summary labeling proved superior for retention of modeled 
responses. In the delayed test of behavioral reproduction, the subjects in the 
labeling condition were able to perform more modeled behavior than the 
subjects in the other treatment conditions, which did not differ from each ; 
other. The influence of symbolic representation on modeling was further E 
revealed by evidence of a significant relationship between retention of summary j 
codes and delayed reproduction of modeled responses, 


It is increasingly evident from research on 
observational learning that modeling serves as 
one of the principal modes of acquiring new 
patterns of behavior. Although the phenom- 
enon has been well documented empirically, 
the mechanisms governing observational learn- 
ing have not been extensively investigated. In 
the theory recently proposed by Bandura 
(1969), observational learning is conceptual- 
ized as a multiprocess phenomenon encom- 
passing (a) attentional processes that regulate 
sensory registration of modeling stimuli, (5) 
retention processes that are influenced by 
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rehearsal operations and symbolic coding of 
modeled events into easily remembered 
schemes, (c) motoric reproduction processes 
which are concerned with the availability of 
component responses and the utilization of 
symbolic codes in guiding behavioral repro- 
duction, and (d) incentive or motivational 
processes which determine whether or not 
acquired responses will be activated into overt 
performances. The present experiment was 
specifically designed to elucidate the role of 
symbolic coding activities in learning through 
observation of modeled behavior. 

Under conditions where observers do not 
overtly perform matching responses during 
the acquisition period, modeled responses can 
be acquired only in representational forms. In 
the aforementioned theory, observational 
learning entails two representational systems, 
an imaginal and a verbal one. It is assumed. 
that during and after exposure to modeling 
stimuli, observers transform these events into 
symbolic forms and organize the essential 
elements into familiar and more easily re- 
membered schemes. After modeled events have 
been coded into images or their verbal equiv- 
alents for memory representation, they func- 
tion as mediators for later response retrieval 
and reproduction. 


The influence of symbolic representation in 
observational learning was shown in a study 
by Bandura, Grusec, and Menlove (1966), 
who found that children who generated 
verbal equivalents of the modeled responses 
during presentation later reproduced more 
matching responses than children who at- 
tentively observed the modeled displays with- 
out verbalization; the latter group, in turn, 
achieved a higher level of observational 
learning than those who engaged in symbolic 
activities designed to prevent implicit verbal 
coding of modeled behavior. 

The above study provides some evidence 
that symbolic coding can enhance observa- 
tional learning. However, it remains to be 
determined whether different coding systems 
vary in their relative efficacy. The utility of 
different coding operations may also depend 
partly upon the characteristics of the modeled 
responses. Verbal coding may be optimal for 
facilitating acquisition and retention of mod- 
eled stimuli which can be easily described, 
whereas responses difficult to describe verbally 
may require a nonverbal coding operation, 
such as visualization. Moreover, symbolic 
codes which are relatively easy to remember 
and to retrieve should be more serviceable 
guides for delayed reproduction of modeled 
behavior than codes that are difficult to retain. 

The role played by symbolic activities in 
both learning and memory has been the sub- 
ject of increasing experimentation. Miller 
(1956), who drew attention to coding proc- 
esses some years ago, proposed the unitization 
hypothesis, according to which individual 
items are categorized or organized into larger 
coherent units or “chunks.” He suggested that 
the process of chunking or recoding into 
fewer but more informationally rich categories 
is primarily verbal. Symbolic language allows 
one to bring to bear on any set of stimuli a 
coding system which is already well learned 
and organized into a very large number of 
association networks. This system allows most 
information to be coded into a mode which is 
easy to store and retrieve. Depending on the 
nature of the stimuli and the necessity of 
veridical communication, the coding operation 
may be a process shared by others (e.g., con- 
verting binary numbers into base 10 numbers) 
or idiosyncratic to the person, developed 
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irom his past experience, as when subjectivel 
meaningful but socially uninformative n 
are attached to a set of shapes. Miller als 
suggested that the coding process need not bt 
exclusively verbal. It is possible that image! 
may also serve as a representational device. 
Numerous investigations have shown that 
people do, in fact, reorganize stimuli inte 
larger inclusive categories and transform 
them into symbolic systems that permit 
storage and retrieval. Thus, for example; 
tention of stimulus events is facilitated by 
organizational factors (Cofer, 1965; Lantz. 
Stefflre, 1964; Mandler, 1968; Tulvi 
1966); verbal description of stimulus ob; 


Ranken, 1963a, 1963b); and imagery cal 
substantially augment associative learn 
(Bower, 1969; Paivio, 1969). 

In the present experiment, adult subjects 
observed a filmed model perform intri 
motor responses varying in verbalizabili 
after each of which they engaged in one 
the following symbolic activities: devi 
summary labels (summary labeling) wh 
encompassed the constituent clements 
meaningful events; reinstated the modeled 
responses into vivid imagery (imaginal cod- 
ing); described the specific movement 
position elements in concrete terms (ver 
description); or performed arithmetic 
lations designed to impede symbolic coding 
the modeling stimuli (controls). The accura 
with which the subjects reproduced the m 
eled responses was measured immedia! 
after symbolic coding and again following 
delay period. The experimental conditii 
were chosen because they provide a means 
investigating the role of imaginal and ver! 
mediators in observational learning and, a 
tionally, may represent ways in which pers 
would naturalistically attempt to reme 
modeled stimuli where immediate or dela; 
motoric rehearsal is impractical or soci 
inappropriate. : 

On the assumption that easily verbali 
responses can also be easily visualized, it 
hypothesized that the concrete descrip! 


Svusortrc Cobro Processes 
However, for modeled responses which are 
ücult to describe verbally, it was predicted 
ıt subjects in the visualization condition 
ould achieve better matching performances 
an those who coded the modeling stimuli 
nto concrete descriptions of constituent ele- 
nents. 

The higher form of verbal code, in which 
observers impose subjectively meaningful and 
abbreviated linguistic labels on complex mod- 
cling stimuli, enables observers to organize 
diverse response elements into an efficient 
code which can be easily stored, retained in- 
tact over long periods, and quickly retrieved, 
Since any modeled activity can be likened to 
some meaningful object or event, however 
idiosyncratic, the effectiveness of this type of 
coding operation should also be relatively 
independent of the nature of the modeling 
stimuli. In accord with this reasoning, it was 
predicted that subjects using summary label- 
ing would achieve a higher level of observa- 
tional learning of all types of modeled re- 
sponses and that they would retain signifi- 
cantly more matching behavior than subjects 
in the other treatment conditions. 


Mzruop * 

Subjects 

The subjects were male and female college students, 
drawn írom the introductory psychology course and 
paid volunteers. A total of 72 subjects was ran- 
domly assigned to one of three experimental groups 
and the control condition (» — 18 in each group). 
There were 35 males and 37 females, with essentially 
equalized sex distribution in the diíferent groups. 


Modeling Stimuli 

The modeling stimuli that the subjects observed 
and later reproduced were motoric responses drawn 
from the manual language of the deaf (Riekehof, 
1963). Each modeled response, which consisted of 
intricate movements of the arms, hands, and fingers, 
was constructed by combining two manual words and 


3A full description of the experimental arrange- 
ments, stimuli, raw data, and instructions to the 
subjects has been deposited with the National Aux- 
iliary Publications Service. Order Document No. 
01476 from the National Auxiliary Publications Ser- 
vice of the American Society for Information Science, 
c/o CCM Information Sciences, Inc, 909 3rd Ave- 
nue, New York, New Vork 10022. Remit in ad- 
vance $5.00 for photocopies or $2.00 for microfiche 
and make checks payable to: Research and Micro- 
film Publications, Inc. 
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Approximately 100 motoric responses, 
codability, were each performed twice by a 


recorded and later rated independen 
by two judges. Each response was analyzed into a 
predetermined set of movement and position elements 
for the purpose of scoring verbal codability, The 
judges rated the degree (from 0 to 3) to which ae 
scribed elements corresponded to those contained in — 
the modeled responses. Interjudge reliability was 
high, the correlation coefficient being 88 for the — 
responses included in the formal experiment, < 
Ten motoric responses were chosen on the basis 
of their verbal codability scores. Five of these re- — 
sponses, referred to as the low verbal set, were rela- 
tively difficult to describe accurately. The mean 
accuracy score for this set was 40%, with a range 
of scores from 31% to 54%. The remaining five — 
responses, designated as the high verbal set, were 
chosen because they could be easily described in 
verbal terms, the mean accuracy being 69%, with a 
range of 61%-80%. Illustrative examples of re- 
sponses in these two subgroups are provided in 
Figure 1. ^ 
The two sets of motoric items were then presented 
to 10 additional subjects, who were asked to repro- 
duce behaviorally each response 1 minute after 
presentation. During the 1-minute delay period, 
which was instituted to approximate procedures that 
would be employed in the actual experiment, the 
subjects were required to count backward by threes 
from a designated number. These data were col- 
lected to ensure that the response items were not 
so readily reproducible that there would be little 
possibility for the coding operations to exert an in- 
fluence, nor so difficult as to preclude any improve- — 
ment through coding. Low verbalizable responses | 
were reproduced at a mean accuracy of 38%, with 
scores ranging from 23% to 35%. The accuracy 
scores for the high verbal set ranged from 42% to 


81%, with a mean value of 66%. The modeled re- 
sponses were thus demonstrated to be sufficiently — 
difficult to permit the experimental procedures to 
exert an influence. r 
The 10 motoric responses were recorded on black 
and white 16-millimeter silent flm. The model, who 
was photographed in a sitting position with only her P 
torso visible, performed each response twice, with — 
the break signaled by a brief dropping of her hands E 
zd 
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Fic. 1. Examples of high and low verbalizable test stimuli employed. 


to her lap. Each sequence required 6-10 seconds to 
complete. 

The film was organized with four practice items 
at the beginning, followed by the high verbal and 
low verbal responses presented in an alternating se- 
quence. Each item was preceded by 4 seconds of 
black leader, followed by 14 seconds of opaque leader, 
and then another 1 second of black leader. The 
opaque leader warned the subject that the sequence 
was about to appear, and the immediately following 
black leader was used to provide contrast for the 
lighter modeled displays. 


Apparatus 


The filmed responses were projected on a wall 
approximately 15 feet from the subject. The sub- 
ject’s behavioral reproductions were recorded on a 
remotely controlled videotape recorder located out- 
side the experimental room, The videocamera, 
equipped with a telephoto lens for close-in re- 
cording, was positioned facing the subject and at 
the same distance as the movie projector, so as not 
to be intrusive. The subject's verbal productions 
were recorded simultaneously on the videotape and 
an independent audiotape recorder. 


Experimental Sequence 


Subjects were told that the experiment was con- 
cerned with the way people remember material pre- 
sented visually. They were informed that they would 
be shown a series of modeled Tesponses, one at a 
time, immediately after which they would be asked 
to engage in a designated symbolic activity for 1 
minute. At the end of this period, they were told 
that they would be asked to reproduce the modeled 
response as accurately as they could. 

The four practice items were then presented in 
the same manner as the subsequent test items, except 
that the subjects were again shown each modeled 


response aíter their behavioral reproduction was 
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completed. This procedure allowed the subjects to 
become familiar with the type of movements em- 
ployed and to gain some facility in utilizing the 
coding system that they would be required to employ 
with the experimental items. 


Coding Procedures 


Summary labeling. Subjects in this condition were 
instructed to develop summary labels which would 
encompass the constituent elements of the modeled 
movements in easily remembered form, Subjects were 
told that each movement or partial movement might 
call to mind some action of persons, animals, or 
some object which they might use to remind them of 
the movement. Further, they were told that if a 
summary label or phrase did not come to mind, 
they should try to construct one and continue to 
repeat it during the time provided and that they 
could change labels during the 1-minute coding 
period if they happened to think of one more fitting. 

Imaginal coding. Subjects in this group were told 
to close their eyes after the stimulus presentation 
and to visualize the modeled responses in vivid and 
detailed imagery. They were also instructed to con- 
tinually reinstate these images during the entire 1- 
minute coding period. In order to monitor their 
visualization activities, the subjects were instructed 
to signal, by raising a finger, each time a modeled 
response was imaginally reactivated. In addition, 
after the acquisition test for each item, they rated 
the clarity of their visualization on a 7-point rating 
scale. 

Verbal description. Subjects in this condition were 
instructed to describe aloud the specific movements 
and positions of the modeled responses as concretely 
and accurately as possible and to continue to repeat 
aloud these descriptions throughout the 1-minute 
Postexposure coding period. They were cautioned 
not to use analogies or similes—for example, “It 
looks like she is making a square” or “She seems to 
be covering a pot"—but rather to describe the re- 
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sponses in terms of exact movements and positions of 
the arms, hands, and fingers. 

Controls. During the I-minute interval following 
exposure, the subjects in the control group were 
rejuired to count, to the beat of a metronome, 
either backward or forward by 7s or 15s from num- 
bers supplied by the experimenter. This cognitive 
activity was sufficiently demanding to prevent the 
subjects from engaging in any symbolic coding, but 
it was highly dissimilar to, and thus would not 
interfere with, retention of the modeled contents. 


Immediate Reproduction 


The 10 test items were presented 1 at a time. 
Aíter a modeled response was portrayed twice, the 
projector was stopped, and the subject was in- 
structed to engage in his assigned coding activities 
for 1 minute. He then reproduced the movements 
he had seen modeled, and his performances were 
recorded on videotape. This procedure was repeated 
for each item. 


Delayed Reproduction 


After the 10 items were completed, the subjects 
were informed that they would be administered a 
different task, ostensibly designed to measure their 
capacity to retain verbally presented material. The 
material employed was an essay describing the life 
and work of Gauss, “The Prince of Mathematicians” 
(Newman, 1956). They were instructed to read -this 
essay aloud as rapidly as possible, consistent with 
comprehension, and were informed that they would 
be subsequently asked questions about the material, 

The choice of relatively complex verbal material 
and the announcement of a comprehension test were 
designed to concentrate the subject’s attention on 
this distracting task and thus prevent him from 
engaging in any symbolic rehearsal of the modeled 
responses during the delay period, In this way, the 
long-term retention of the modeled material could be 
measured without being confounded by unknown 
amounts of covert rehearsal. 

Subjects engaged in this task for 15 minutes, and 
at the end of this period, they were asked to repro- 
duce as many of the previously observed modeled 
responses as possible, within 34 minutes. This time 
period was selected because it was found that pre- 
test subjects exhausted the total number of responses 
which they could perform within this interval. Sub- 
jects were informed that they could reproduce the 
responses in any order desired, that they should 
utilize the coding strategy previously employed to 
aid their recall, and that they should use the full 
34-minute period to recall as many movements as 
possible. These performances were recorded on video- 
tape for later scoring. 


Scoring Procedure 


Each modeled response was analyzed into its com- 
ponent elements, which included the starting position 
of the fingers, hands, and arms, as well as their 
subsequent movements and final positions. For each 


TABLE 1 


MEAN PERCENTAGE OF MopeLen Responses, RE- 
PRODUCED AS A FUNCTION OF CODING ACTIVITY, 
VERBALIZABILITY OF THE MODELED RESPONSES, 
AND TiwE-REPRODUCTION TEST 


Immediate Delay 
Condition ov 
High | Low | High | Low 
ver verbal verbal verbal 
Summary labeling 78.86 | 72.18 | 43.22 | 42.82 | 59,27 
Imaginal coding 81.43 | 72.05 | 41.76 | 28.31 | 55,89 
Verbal description 74.54 | 60.19 | 38.40 | 29.61 | 50.68 
Control 61.74 | 53.80 | 35.51 | 22,98 | 43.53 
M 74.14 | 64.58 | 39.72 | 30.93 
Overall M 69,36 35.33 


response the scorable elements were noted and en- 
tered in the appropriate places on a master scoring 
grid. Each response was analyzed into its separate 
elements, and a quantitative score was assigned to 
each “Position” element in terms of whether it was 
omitted (0), a fragmentary match (1), a substantial 


Watch (2), or a perfect match (3). Elements in the 


*Movement" column were assigned scores of either 
0, 2, 4, or 6, depending upon the fidelity of the re- 
production. These movement scores are twice the 
value of the position scores, because it was consid- 
ered desirable to weight the movement and position 
scores equally. The total number of points that the 
subjects received for each response were summed 
separately for the high verbal and low verbal items 
and divided by the total points possible to provide 
the percentage of behavioral reproduction scores. 

Two judges each rated reproductions of one-half 
of the total sample, randomly drawn from the four 
groups. In addition, they rated independently 20 
protocols in common to assess interjudge reliability. 
The judges scored the immediate reproductions with 
reliability of r= 90 for the high verbal items and 
y=.91 for the low verbal items. The corresponding 
reliability coefficients for their scoring of delayed 
reproduction were .92 and .96, for the high verbal 
and low verbal items, respectively. 


RESULTS 


Data for males and females were pooled 
since tests revealed no significant differences 
on any of the dependent measures. Table 1 
presents the mean percentage of modeled re- 
sponses that were reproduced by the subjects 
in the different treatment conditions. 

Results of the analysis of variance per- 
formed on these data disclosed that the main 
effects of treatment conditions (F = 11.47, p 
< .001), time of the reproduction test (F = 
726.35, p < .001), and verbalizability (F = 
41.83, p< .001) were highly significant 
sources of variance. In addition, it was found 
that the treatment conditions had a differen- 
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Fic. 2. Total percentage of reproduction scores for 
all groups. 


tial effect, depending upon the time at which 
the test for behavioral reproduction was con- 
ducted (F = 4.49, p< .01). However, the 
hypothesized interaction between type of cod- 
ing activity and verbalizability of the modeled 
responses did not emerge as a significant 
source of variance (F — 1.57). These findings 
are discussed separately below. 


Symbolic Coding Effects 


The treatment effects are presented graphi- 
cally in Figure 2. Comparisons of pairs of 
treatment conditions by ¢ tests revealed that 
the subjects in the summary labeling condi- 
tion did not differ from the imaginal coding 
group, but the labeling group did achieve su- 
perior reproduction when compared to sub- 
jects who utilized verbal description (4 = 
4.22, p < .001) and those in the control group 
(t= 7.74, p < .001). Imaginal coding was 
likewise superior to verbal description (¢ = 
2.56, p < .02) and to the performance of the 
controls (t = 6.08, p < .001). Subjects who 
used verbal description, in turn, achieved a 
higher level of observational learning than the 
controls (t = 3.52, p < .01). 


Time of Reproduction 


All groups performed significantly more 
matching behaviors 1 minute after exposure 


than following a 15-minute interval, during 
which they could not rehearse previously 
learned modeled responses. The imm: ate 
reproduction scores for all groups com! ned 
was 69.3696, whereas the subjects were able 
to perform 35.33% of previously moucled 
behavior during the delayed test. This tempo- 
ral variable was involved in a significan: two- 
way interaction and is discussed more fully 
later. 


Verbalizabilit y 


As might be expected, all groups acquired 
and retained significantly more easily de- 
scribed modeled responses than those nol 
readily subject to verbal coding. The cogr 
of reproduction of high and low verbalizable 
responses was 56.93% and 47.76%, respec- 
tively. In the delayed recall period, all groups 
except the summary labeling group showed 
significantly less retention of low verbalizable 
than high verbalizable responses. The ¢ values 
for these comparisons were as follows 
mary labeling £< 1; verbal descripti 
t = 6.67; imaginal coding, t = 10.21; c 
t— 9.51. 

However, since the three-way interaction 
was not of statistically significant magnitude, 
the differences can only be considered as sug- 
gestive evidence that coding in summary 
labeling form enables observers to retain both 
difficult and easily verbalized material 
equally well. Thus, it would be desirable to 
examine this phenomenon further in future 
investigations. 


Interaction Effect of Coding and 
Temporal Factors 


Figure 3 graphically presents the significant 
two-way interaction between treatment condi- 
tions and the time of behavioral reproduc- 
tion, indicating that the coding operations 
employed in the present investigation are 
differentially powerful when behavioral repro- 
duction is attempted shortly after exposure 
and after a delay of some length. 

In order to determine which differences 
contributed to the significant interaction, 
pairs of conditions were compared sepa- 
rately with regard to immediate and delayed 
reproduction scores. Intergroup comparisons of 
matching performances in the immediate re- 
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juction test yielded the following pattern 

fferences: summary labeling and imaginal 

ig groups did not differ from each other 

1). Subjects in the summary labeling 

idition achieved higher reproduction scores 

an those in the verbal description group 

= 241, p < .05) and in the control group 

(t = 5.23, p < .001). Similarly, the subjects 

who employed imaginal coding were superior 

to the verbal description group (t= 2.77, 

p< 01) and the controls (¢= 5.51, p< 

.001). And finally, the verbal description group 

exceeded the controls (t = 2.82, p< .01) in 
accuracy of matching behavior. 

In the delayed reproduction test, however, 
the summary labeling group retained more of 
the observed behaviors than the subjects in 
the imaginal coding group (t = 2.36, p < 05), 
in the verbal description condition (¢ = 2.66, 
p< 02), and those in the control group 
(£— 4.07, p< .001). These latter three 
groups, on the other hand, did not differ 
significantly from one another. The magnitude 
of the decrement from immediate to delayed 
reproduction for the imaginal coding group, 
when compared to the other groups, showed 
this coding operation to be the most vulner- 
able to temporal loss (f = 4.41, p< .001). 


Integrated Reproduction of Modeled 
Responses 


It was assumed that coding of modeled 
responses into meaningful summary labels 
facilitates observational learning because such 
memory codes organize diverse elements of 
a modeled display into an easily stored form. 
Consequently, appropriate responses can be 
more easily retrieved from this type of mem- 
ory code than from representational systems 
that rely on the coding of individual ele- 
ments. This assumption was tested by tabu- 
lating the number of modeled responses that 
the subjects in the different treatment condi- 
tions were able to reproduce accurately at 
the delayed test in complete integrated form, 
that is, containing all the elements enacted in 
the correct sequence. For the different treat- 
ment conditions, the mean number of re- 
sponses reproduced in their entirety were as 
follows: summary labeling, 4.17; imaginal 
coding, 1.83; verbal description, 1.44; con- 
trol, 1.94. Analysis of variance of these 
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Fic. 3, Immediate and delayed reproduction of 
modeled responses as a function of symbolic coding 
activities. 


data revealed a significant treatment effect 
(F = 9.76, p < .01). Intergroup comparisons 
revealed that the labeling treatment was sig- 
nificantly different (p < .001) from each of 
the other conditions, which did not differ from 
each other. 

It is conceivable that the above differences 
may be partly influenced by the total amount 
of modeled behavior retained, Therefore, to 
examine this possibility, the percentage of the 
total items reproduced in the delayed test 
which were of well-integrated form was calcu- 
lated. The mean percentages for the different 
treatment conditions were as follows: sum- 
mary labeling, 63%; imaginal coding, 3766; 
verbal description, 27%; control, 42%. 
Analysis of variance performed on these 
scores yielded a highly significant difference 
(F = 67.63, p< .001). As in the previous 
analysis, the labeling condition differed sig- 
nificantly from the other groups, which did 
not differ from each other. 


Labeling at Delayed Reproduction 


During the delayed reproduction test, none 
of the subjects in the verbal description con- 
dition could recall their concrete descriptions 
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TABLE 2 
CORRELATIONS BETWEEN MEASURES OF VIVIDNESS 
Or IMAGERY AND BEHAVIORAL 
REPRODUCTION SCORES 
Imaginal coding* r 
Mean clarity vs. immediate reproduction 3l 
Mean clarity (low verbal) vs. immediate low 
mean reproduction ; | 
Mean clarity (high verbal) vs. immediate high 
mean reproduction 2 .08 
Mean clarity vs. delayed reproduction 42* 


IE 
Mean clarity (low verbal) vs. delayed low | 
mean reproduction 24 
Mean clarity (high verbal) vs. delayed high 
mean reproduction 


of the modeled responses, while all of the 
labeling subjects were able to recall their 
summary codes for at least some of the 
modeled items. To test whether retention of 
summary codes facilitates delayed behavioral 
reproduction, matching scores were compared 
between items for which summary codes were 
and were not retained. Modeled responses, for 
which the subjects could produce the sum- 
mary codes were performed at a much higher 
level of accuracy (52%) than those for which 
the code was lost (7%), a difference that is 
highly significant (£ = 11.52, p < .001). 


Clarity of Imagery and Behavioral 
Reproduction 


One would expect that clarity of imagery 
and behavioral reproduction would be posi- 
tively related, since vivid imaginal represen- 
tation provides a better guide for subsequent 
behavioral reproduction. To test this, product- 
moment correlations were computed between 
ratings of the clarity with which imagery was 
achieved and behavioral reproductions at 
immediate and delayed time periods. It can 
be seen from the correlations presented in 
Table 2 that the hypothesized relationship 
received some support. Imaginal clarity and 
accuracy of reproduction are positively cor- 
related, although only two of the coefficients 
are of statistically significant magnitude. The 
benefits of vivid imagery are most apparent 
with modeled responses that are difficult to 
code verbally (clarity versus immediate low 
reproduction) and in retention of previously 


acquired modeled responses (clarity versus 
delayed reproduction). 


Discussion 


Findings of the present experiment provide 
evidence for the hypothesis that symbolic 
coding operations play an important role in 
observational learning. This is shown by the 
fact that all three coding systems investigated 
facilitated initial behavioral reproduction, as 
compared to the performance of the subj 
who had no opportunity to code the modeling 
stimuli. 

The hypothesis that summary labeling 
would prove superior to the other codim 
systems also received qualified support. Al- 
though the summary labeling and imaginal 
conditions did not differ in immediate response 
reproduction, the subjects in the former group 
achieved a higher level of response acquisition 
than did the verbal description and control 
groups. Moreover, the subjects who coded the 
modeling stimuli into meaningful, concise 
labels retained significantly more modeled 
responses overall than the subjects in all other 
conditions. 

The relative superiority of the summary 
labeling scheme is shown even more clearly 
when matching performances are scored ac- 
cording to a stringent criterion, requiring all 
of the response elements to be reproduced in 
the exact sequence in which they were origi- 
nally modeled. Subjects who coded the model- 
ing stimuli into concise labels were able to 
reproduce approximately twice as many well- 
integrated responses in the retention test, as 
compared to the matching performances of 
the other groups. As noted previously, the 
efficacy of summary codes is most likely at- 
tributable to the fact that they permit ob- 
servers to organize relatively complex stimuli 
into meaningful, concise, and easily remem- 
bered verbal codes. Consequently, modeled re- 
sponses can be conveniently stored in this 
form and later used to reproduce the cor- 
responding responses. 

When persons represent relatively meaning- 
less patterns of stimulation in terms of mean- 
ingful activities of familiar objects, the concise 
label is also likely to evoke relevant imagery. 
Subjects may therefore utilize both verbal and 
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visual guides for later motoric reproduction. 
Ihe imagery-producing effects of stimulation 
nay also partly explain why the control sub- 
jects were able to achieve some acquisition 
and retention of modeled contents. Although 
the subjects in this group were prevented from 
coding the modeling stimuli into verbal forms, 
they nevertheless may have utilized imaginal 
residues of the initial external stimulation. It 
would be of considerable interest to study 
observational learning under conditions 
where the development of both verbal and 
imaginal representations of modeling stimuli 
is effectively impeded. 

It is noteworthy that imagery proved supe- 
rior to concrete verbalization in facilitating 
observational learning. This finding may be 
interpreted in several ways. Perhaps modeled 
responses can be more easily and frequently 
reinstated imaginally than they can be de- 
scribed verbally. To check this possibility, the 
number of times that imaginal subjects visu- 
alized the modeled responses and the verbali- 
zation subjects described the responses was 
scored. For the imaginal coding group, the 
mean number of visualizations was 11.1. For 
the verbal description group, the mean num- 
ber of descriptive repetitions was 2.8. The 
difference between these means is highly sig- 
nificant (f= 8.10, p < .001). A second pos- 
sible interpretation is that imagery reinstates 
modeled responses in a more organized and 
integrated form, whereas concrete verbaliza- 
tion restores the responses in a fragmentary 
fashion. More frequent and better organized 
representation of modeled events would aid in 
their acquisition. Whatever the explanation 
may be, the findings of this study provide 
further evidence for the influential role of 
imaginal mediators in learning processes. 

The results of this study failed to support 
the subsidiary hypothesis that the efficacy of 
different coding systems would partly depend 
on the ease with which the modeled responses 
could be described verbally. While the de- 
scriptive properties of modeled responses af- 
fected observational learning, imagery did not 
emerge as superior to nonverbalizable re- 
sponses, and, conversely, verbal encoding did 
not prove most effective with highly describ- 
able responses. Two factors may be oper- 
ative here, First, different criteria for cate- 
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gorizing responses and a wider range of values — 
may be required to determine whether these 
findings are attributable to problems of item 
selection. Second, it may be that verbaliz- 
ability and visualizability of the responses 
employed are positively correlated; that is, 
modeled responses which are easy to describe 
may also be easy to visualize. 

To test this hypothesis, the mean number 
of visualizations and the mean clarity of the 
visualizations reported by the subjects in the 
imaginal condition were compared for high 
and low verbalizable items. Results of this 
analysis show that high verbal items were 
visualized more often (X = 11.87) than low 
verbal items (X = 10.41), a difference that is 
highly significant (¢ = 3.48, p < .01). A com- 
parable analysis of clarity with which the two 
sets of modeled behaviors were visualized 
similarly revealed that the subjects visualized 
the high verbal items (X = 2.80) more viv- 
idly than the low verbal items (X — 2.01). 
This difference is also highly significant 
(t = 6.13, p < .001). 

The preceding findings suggest that both 
imaginal and verbal coding could be similarly 
affected by the item properties, but on the 
basis of the operation of different processes. 
A definitive test of the predicted interaction 
effect between item verbalizability and type 
of coding system would, therefore, require the 
use of high verbalizable items that evoke weak 
imagery and low verbalizable items that 
produce high and vivid imagery. 

The hypothesis that the summary labeling 
code would prove to be superior for memory 
reproduction was partially corroborated. How- 
ever, the subjects in all groups displayed a 
substantial loss of modeled responses after a 
period in which their attention was focused 
on entirely different activities. This response 
decrement probably reflects the operation of 
at least three factors. First, the modeled re- 
sponses were relatively meaningless, which 
would hinder both acquisition and retention. 
Second, the different responses were composed 
of various combinations of a relatively small 
set of common elements. Such high common- 
ality of component responses can have dele- 
terious effects on retention processes through. 
associative interference. In the acquisition 
phases, of course, associative interference was 
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minimal because the responses were repro- 
duced one at a time in sequential modeling 
trials, whereas in the delayed test the various 
responses could be easily confused. Finally, 
the subjects were prevented from rehearsing 
what they had learned, while ordinarily, indi- 
viduals stabilize and strengthen their acquired 
responses by engaging in either covert or 
overt rehearsal. 

It is interesting that clarity of imagery was 
positively associated with reproduction of low 
verbalizable responses immediately following 
exposure and with total reproduction scores 
in the delayed test. These relationships pro- 
vide further suggestive evidence that the sub- 
jects in the imaginal coding condition were 
engaging in imaginal activities and that these 
activities had a functional relationship to 
observational learning. 

The findings of the present experiment also 
have bearing on nonmediational theories of 
imitative learning. The Skinnerian analysis of 
modeling phenomena (e.g., Baer & Sherman, 
1964; Baer, Peterson, & Sherman, 1967; 
Skinner, 1957) assumes that modeled behav- 
ior is acquired as a result of differential rein- 
forcement of matching responses. This analy- 
sis of modeling phenomena does not distin- 
guish between acquisition and performance, 
since it is believed that the acquisition pro- 
cess cannot be directly observed and thus is 
iruitless to investigate. In a recent paper, 
Gewirtz and Stingle (1968) asserted that 
mediational processes can only be inferred 
irom the very behavior it is purported to 
explain, and thus, one gains no new informa- 
tion from such an analysis. This criticism 
might be valid when applied to theories that 
postulate hypothetical internal constructs as 
the major determinants of overt behavior. In 
the present study, however, the symbolic 
activities were independently manipulated, 
rather than inferred from motor matching re- 
sponses which constituted the independent 
events. Results of this experiment and related 
studies discussed in the introductory section 
demonstrate that symbolic activities are not 
only manipulable, but they can exercise strong 
influence over behavior. 

The findings of this study further illustrate 
the need to analyze the variables governing 
observational learning, the effects of which 
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occur before any matching responses are per- 
formed. After responses are acquired ob-erva- 
tionally, reinforcement variables may py a 
decisive role in regulating the perfor:nance 
of matching responses. An adequate theory 
of modeling must account not only íor the 
appearance of matching behavior, but also 
for the absence of matching behavior ‘0 low- 
ing exposure to .modeling stimuli, for the 
specificity of matching performances, : for 
variations in retention of modeled behavior. 
A multiprocess theory that encompasses at- 
tentional, coding and organization, anc moti- 
vational processes is perhaps best suited to 
account for modeling phenomena. 
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AN EXAMINATION OF THE “OTHER VARIABLES” 
EXPLANATION OF ATTITUDE-BEHAVIOR 
INCONSISTENCY * 


ALLAN W. WICKER # 


University of Illinois 


Verbal and overt behavioral responses of 152 church members toward their 
local churches were examined. Three verbal measures, in addition to attitude 
toward the church, were obtained: perceived consequences of behavior, evalu- 
ation of behavior, and judged influence of extraneous events on behavior. The 
four measures were used in multiple-regression equations to predict three 
behavioral criteria: Sunday worship service attendance, monetary contributions 
(both from church records), and serving in responsible roles in church activities 
(self-report). Consistent with the “other variables” position, addition of other 
verbal predictors to attitude did significantly improve predictions of the be- 
haviors. The best single predictor of the behaviors was judged influence of 
extraneous events (mean r= .36), followed by evaluation of behavior (mean r 
= .26), attitude (mean r= .22), and perceived consequences of behavior (mean 
7 = .15). When all verbal predictors were employed, approximately 75% of the 
behavior variance remained unaccounted for (mean R — 46). The following 
possible implications of attitude-behavior inconsistency are discussed: (a) re- 
focusing the attitude from the object of the behavior to the behavior itself, 
(b) assessing variables other than attitudes, and (c) directly studying overt 


behaviors and the conditions which affect behavior. 


The relationship of attitudes to overt be- 
haviors has been given relatively little atten- 
tion by social psychologists. Yet, the question 
is important for several reasons: it has impli- 
cations for how attitudes are best conceived 
(DeFleur & Westie, 1963), it directly bears 
on the validity of attitude scales (McNemar, 
1946), and it must be considered when at- 
tempting to apply social science findings to 
social problems (Deutscher, 1966). 

Wicker (1969) recently reviewed 46 stud- 
ies in which subjects’ verbal and overt be- 
havioral responses to attitude objects were 
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obtained on separate occasions. The studies 
included a wide range of verbal attitude 
measures (e.g., Thurstone, Likert, semantic 
differential, interview), overt behavioral mea- 
sures (e.g., willingness to be photographed 
with a Negro, attendance at organization 
meetings, cheating on examinations), and 
subject populations (e.g., maternity ward pa- 
tients, labor union members, oil field workers, 
college students). Measured attitudes were 
often unrelated or only slightly related to 
overt behaviors, and rarely were attitude- 
behavior correlation coefficients above .30; 
that is, only very infrequently did attitudes 
account for more than 10% of the variance 
in the overt behavioral measures. 

The repeated failures to demonstrate a 
strong consistency between attitudes and be- 
haviors have had little impact on most atti- 
tude researchers. Often they argue that atti- 
tudes are but one of several variables which 
influence behavior, and that if the other 
variables were taken into account, better be- 
havioral predictions could be made (cf. 
Weissberg, 1965). A large number of personal 
and situational variables have been sug- 
gested: other attitudes held by the individ- 
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ual; his activity level; his verbal, intellec- 
tual, and social abilities; the presence of 
other people; social norms; the range of al- 
ternative behaviors available; the degree of 
specificity of the attitude object; the occur- 
rence of unforeseen events; the expected and/ 
or actual consequences of various actions; 
and others. (See Wicker, 1969, for a more 
complete list and discussion of these varia- 
bles.) 

Uníortunately, there are very íew studies 
in which such variables have been operation- 
alized and empirically tested in attitude-be- 
havior research (Davis & Triandis, 1965; 
DeFriese & Ford, 1969; Ewens & Ehrlich, 
in press; Green, 1968; Warner & DeFleur, 
1969). One research strategy has been to 
create experimentally two or more levels of 
a variable, and then to examine the attitude- 
behavior relationship at the different levels. 
For example, in Warner and DeFleur's study, 
high- and low-prejudice subjects were given 
the alternatives of signing a statement agree- 
ing to interact with Negroes in a number of 
ways or signing a statement saying they 
would not interact with Negroes in those 
ways. The visibility of the behavior was ex- 
perimentally manipulated by telling one 
group of subjects that their signed statements 
would be published in the student newspaper, 
while another group was told that the signed 
statements would be kept confidential. The 
study was conducted in a border state in 
which the prevailing social norms were against 
integration. High-prejudice subjects behaved 
more consistently with their attitudes (signed 
statements that they would not interact with 
Negroes) when their behavior was to be made 
public than when they were to remain anony- 
mous; low-prejudice subjects behaved more 
consistently (signed agreement to interact) 
when they were to remain anonymous than 
when their behavior was to be made public. 

An alternative strategy for testing the con- 
tribution of a variable is to measure it di- 
rectly, along with attitude and overt behavior. 
Attitude and the additional variable can then 
be treated as independent variables in a 
regression equation, with the overt behavior 
as the dependent variable. Using this ap- 
proach, Ewens and Ehrlich (in press) mea- 
sured subjects’ attitudes toward Negroes and 


their expectations about whether each of a 
number of important people in the subjects’ 
lives (reference others) would approve or dis- 
approve of their allowing a televised statement 
of their racial views to be broadcast and of 
their participating in a number of civil rights 
activities. Eight weeks later, the subjects 
were asked to give a brief talk before a video- 
camera on their attitudes toward Negroes. 
After the talk, they were asked to sign re- 
lease statements permitting the videotape to 
be publicly broadcast and to sign other com- 
mitments to participate in a number of civil 
rights activities. The activities were the same 
as those used in the previous questions on 
support of reference others. Predictions of 
signed agreement to participate in civil rights 
behaviors were consistently better when based 
on both the attitude and the reference-others 
support measures than when based on atti- 
tudes alone, showing that perceived opinions 
of other people may influence the attitude- 
behavior relationship. 

The present study used the multiple- 
regression approach to assess the contribu- 
tions of attitude and three other variables 
to the prediction of members’ behaviors 
toward their church. The first additional vari- 
able was the perceived consequences of be- 
havior, that is, what a person believed would 
follow from his engaging in a given action. 
A number of writers have stressed the im- 
portance of such expectancies on overt be- 
haviors (Fishbein, 1967; Rotter, 1954; 
Vroom, 1964). The second additional pre- 
dictor was the subject’s evaluation of various 
levels of church behavior, for example, the 
degree to which he approved or disapproved 
of a member’s spending 30 hours a week in 
church activities, spending 20 hours a week, 
and so on. More specifically, the predictor 
was based not only on the level of behavior 
which the subject most approved, but also on 
the pattern of his approval and disapproval of 
other levels. The measure was derived from 
Jackson’s (1965) return potential model, and 
is closely related to his concept of intensity. 
It is also similar to Sherif and Hovland’s 
(1961) latitudes of acceptance and rejection. 
The third additional predictor represents an 
attempt to measure the influence of un- 
planned, extraneous events on subjects’ be- 
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haviors. Lewin (1951) believed that it was 
not possible to predict overt behavior from 
knowledge of the life space, because extrane- 
ous events not represented in the life space 
at the time of assessment subsequently in- 
trude to affect the behavior. The subject was 
asked to judge the influence of a number of 
plausible extraneous events (e.g, a heavy 
snowíall on Saturday night) on his church 
behavior (e.g., attending church on Sunday). 

The three variables described, in addition 
to attitudes, were used to predict three sepa- 
rate behaviors: attendance at Sunday worship 
service, monetary contributions to the church 
(both from church records), and number of 
responsible positions held in church activities 
(from selí-report). 

In addition to the major goal of assessing 
the contributions of several variables to the 
prediction of church behaviors, the present 
research had a secondary goal of examining 
the behavioral validity of several measures 
of attitude. One of the earliest attitude scales 
was Thurstone and Chave's (1929) scale of 
attitudes toward the church; it was origi- 
nally “validated” by asking subjects to indi- 
cate whether or not they attended church 
frequently and whether or not they were 
active members of a church. Subsequent 
generations of attitude scales have been 
validated by correlating them with Thurstone 
scales. For example, Osgood, Suci, and 
Tannenbaum (1957, p. 194) offer as evidence 
of the validity of the semantic differential, 
the fact that ratings of “The Church” on 
evaluative scales correlate approximately .75 
with the Thurstone and Chave scale. More 
recently, the semantic differential has become 
a standard for validation. For example, 
Fishbein’s (1963) summative procedure of 
assessing attitudes toward concepts associated 
with a target attitude object was validated 
by showing that it correlated highly (.80) 
with the semantic differential. 

The present research permits behavioral 
validation of the original Thurstone and 
Chave scale, the semantic differential scale, 
and a modification of the Fishbein procedure, 
in this case measuring the subjects’ attitudes 
toward the concepts which they associate 
with their own local church. 


WICKER 
METHOD 

Subjects 

Subjects were 152 members of three United 
Methodist Churches in Milwaukee, Wisc: » All 
were married, and their spouses were also nbers 
of the church, Eighty-seven of the subj were 
randomly selected from church members rolls 
without regard to length of membershi; their 
present church, while 65 were selected from lists 
of members who had joined two of the churches 
between 34 and 30 months before they > con- 
tacted by the researchers. The same v u and 
overt behavioral measures were obtained on -both 
groups of subjects, although there was a ear lag 


between the two data-collection periods. 


Subjects were first contacted by a personal le 
from their church pastor, who asked them do 


participate in the research. Questionnai 
administered in group sessions at the chur 
in the members! homes. The 152 subjects : 


were 


data constituted 82% of the 185 members who were 
contacted. 
Verbal Measures? 

Subjects were assured that their responses to all 
questionnaires would be kept confidential. Numbers 
were employed for questionnaire identification, 
rather than the subject’s names, The importance 
of giving one’s “own true feelings” was stressed in 
preliminary oral instructions, 

1, Attitude measures, All subjects were asked to 
rate several concepts on the following  7-point 


semantic differential evaluative scales: ; ant- 
unpleasant, meaningful-meaningless, good-bad, valu- 
able-worthless, and fair-unfair. Four filer scales 
were also used. Concepts rated were “The Church, 
[member's 
own local] Methodist Church,” and three of the 
subject's associations to the church. In rating the 
concept, The Church, the subjects were asked to 
think of the church in broad terms, not just their 
own local church. The associations were obtained 
by asking the subjects to think about their own 
church, and then to write down whatever aspects of 
the church came to mind. Typical associations Were 
"Reverend Doe," "worship service," * friendship," 
"budget problems." Each subject then ranked his 
associations in terms of their importance in deter- 
mining his attitude toward his church, Next he rated, 
on the semantic differential scales, the associations 


? Complete copies of the questionnaires and @ 
detailed description of scoring procedures have been 
deposited with the National Auxiliary Publications 
Service. Order Document No. 01477 from National 
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Sciences, Inc., 909 3rd Avenue, New York, New 
York 10022. Remit in advance $5.00 for photo- 
copies or $2.00 for microfiche and make checks 
payable to: Research and Microfilm Publications, 
Tnc. 
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he ranked fist, second, and third.* Attitude scores 
were obtained by summing across the five evaluative 


scales. 


Eighty-seven subjects (the first group of subjects 
tested) also completed the Thurstone and Chave 
(1929) scale of attitudes toward the church. 

2. Per t "sequences. oj behavior. Questions 
dealt with whether the subjects believed certain pre- 
sumably desirable consequences followed from three 
church behay regular attendance at Sunday 
worship service, regular contributions to the church, 
and taking an active part in church activities. The 
same nine qt s were asked for each behavior. 
Among th ns were the following: Does the 
behavior set wood example for one's children? 
Does the behavior provide inspiration and help in 
coping w jblems? Does the behavior make one 
more r d by the church pastor? Response 
alterns ^ question were “yes” (scored 2), 
“uncertain” 1), and “no” (scored 0). For 
each subject, a numerical score of perceived conse- 
quences of each behavior was obtained by summing 
across the nine items. Split-half reliability coeffi- 
cients for this measure, based on application of the 
Spearman-Brown prophecy formula to data from 
the first group of subjects tested (m= 87), were 
attendance, 80; contributions, .83; participation, .73. 

3. Evai of behavior. For each of the three 
kinds of behavior, six different levels of the 
behavior were described. The levels for attendance 


at worship service ranged from attending every 
Sunday (Level 1) to attending less than twice a 
month (Level 6); for contributions, contributing 1 
month’s income annually (Level 1) to contributing 
less than 1 day’s income annually (Level 6); for 
participation, spending more than 30 hours per week 
in church activities (Level 1) to spending less than 
1 hour per week (Level 6). Each level of behavior 
was rated on a 7-point scale, ranging from strongly 
approve to strongly disapprove. 

Conceptually, the score for evaluation of a given 
kind of church behavior was the most approved 
level, weighted for exclusiveness or range of accept- 

y. The following example illustrates the scoring 
Suppose a subject rated Level 1 for at- 
( iding church every Sunday), "strongly 
approve" (scored +3), Levels 2 and 3 (missing 
once or twice a year and missing five or six times 
a year), “slightly approve” (+1), and Levels 4, 5, 


4Two different scoring procedures were employed 
for the attitude measure based on associations to 
one’s own local church. One procedure weighted 
the associations in terms of their rated importance, 
with a weight of 3 assigned to the association 
rated by subject as most important in determining 
his attitude, a weight of 2 to the association rated 
second, and a weight of 1 to the association rated 
third. The other procedure was unweighted; that is, 
the three associations were treated equally. The two 
scoring procedures were so highly correlated (r= 
98) that only the results of the unweighted scoring 
procedure are reported.in.the present paper. 
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twice a month, missing more than twice a month), 
“strongly disapprove” (—3). The most approved 
level of behavior was attending 52 Sundays per 
year. The weight for this level is obtained by count- 
ing the number of steps each approval rating differs 
from the highest approval rating: two steps for 
Levels 2 and 3; six steps for Levels 4, 5, and 6. These 
steps are then summed (2+2+6+4+6+6=22) 
and multiplied by the level of behavior most 
approved (52 Sundays per year X 22) to yield the 
evaluation of behavior score (1,144) for attendance. 

Using this procedure, the higher the score for 
evaluation of behavior, the more likely the subject 
felt that only high levels of behavior were accept- 
able. Subjects who failed to differentiate in ap- 
proval of the various levels of a church behavior, 
that is, those who circled the same approval rating 
for all six levels, were eliminated from the analysis, 

4. Judged influence of extraneous events. For each 
of the three behaviors, eight hypothetical situations 
were described in which an unplanned extraneous 
event occurred, Subjects indicated the degree to 
which they believed that their church behavior 
would be affected by the event. For example, what 
would be the effect on a subject’s worship service 
attendance if he had weekend guests who did not 
regularly attend church? How would a subjects 
contributions be affected if the congregation voted 
to spend funds on a project of which he disap- 
proved? For each question, four alternatives were 
listed: (a) The event almost certainly would not 
influence the behavior (scored 4), (b) the event 
probably would not influence the behavior (scored 
3), (c) the event probably would reduce the likeli- 
hood of the behavior (scored 2), and (d) the event 
almost certainly would reduce the likelihood of the 
behavior (scored 1). Scores for the eight items were 
summed io yield a score of judged influence of 
extraneous events for each behavior. Split-half reli- 
ability coefficients for this measure, based on ap- 
plication of the Spearman-Brown prophecy formula 
to data írom the first group of subjects tested 
(n— 87), were attendance, .83; contributions, .84; 
participation, .77. 


Overt Behavioral Measures 


1. Attendance at Sunday worship service. Church 
attendance data were obtained from church records, 
based on sign-up sheets passed along the pews 
during each worship service, The number of Sundays 
attended by a subject during a 39-week period, Sep- 
tember through May, was the present attendance 
measure. For 11 subjects who had been members for 
less than 39 weeks, data were extrapolated from the 
date they joined the church. 

2. Contributions to the church. Church records 
were also the source of the contributions data. 
Members’ contributions were placed in special en- 
velopes with an identifying number, and a record 
was kept of contributions. In practice, contributions 
are made by the family, rather than_the husband 
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or wife independently; thus, the present measure 
of contributions was the amount of money given 
by the subject’s family during 1 year. Data were 
extrapolated for 16 subjects who had been members 
for a shorter period of time. Also available from 
church records were data on the members’ pledged 
contributions, that is, their signed commitment to 
give a certain amount during the forthcoming fiscal 
year. Subjects were not aware that the researchers 
would examine the church records on attendance 
and contributions. 

3. Participation in church activities. A complete 
list was compiled of all organized activities which 
occurred in each church in the 3} months (January 
through mid-April) preceding the testing sessions. The 
lists included such events as church school classes, 
women’s organization meetings, fund-raising projects, 
and parties. For each activity he attended, the sub- 
ject indicated whether he was a “member,” 
“worker,” or “leader.” These roles were defined for 
subjects as, respectively, having no particular job 
or responsibility (e.g, merely attending a business 
meeting), having a responsibility which contributed 
to the event but did not involve the main activity 
(e.g, serving refreshments after a business meeting), 
and having a position of control and authority in 
the main activity (eg., presiding over a business 
meeting). The number of activities in which the 
subject had either a worker or leader role was the 
present measure of participation in church activities. 
This criterion thus measures deeper involvement in 
church activities than mere attendance, 

4. Behavioral composite. Composite scores for be- 
havior toward the church were obtained by trans- 
lating the attendance, contributions, and participa- 
tion data into standard scores, and then summing 
across the three standard scores for each subject. 


Missing Data 


For a variety of reasons, complete data were not 
obtained on every subject for every variable. Occa- 
sionally, the subjects apparently inadvertently 
skipped pages of questionnaires, and a few failed 
to understand or follow instructions, The most 
missing data for any variable assessed by the ques- 
tionnaire were three. However, in the analyses of 
responses to the evaluation of the behavior question- 
naire, larger numbers of “missing” data were 
involved, since the subjects who failed to dis- 
criminate in their approval ratings of the levels of a 
church behavior were eliminated. The missing data 
for this questionnaire were attendance, 9; contribu- 
tions, 19; and participation, 14. 

Also, church records were incomplete for some 
subjects. For example, members of the choirs did 
not pass sign-up sheets at worship services, so 
their attendance data were unavailable. Figures on 
worship service attendance were not available for 9 
subjects, and contributions data were missing for 
6 subjects. Due to the above reasons, the ms in the 
analyses to be reported vary, depending on the 
number of subjects for whom complete data were 
available on the particular set of variables being 


examined. Disregarding the analyses involving 
Thurstone and Chave scale, the smallest m for an 
analysis was 125, or 8296 of the 152 subjects 

the study. 


RESULTS | 


The means, standard deviations, and cor- 
relation matrix of the attitude scales and 
overt behavioral measures are given in 
Table 1. The verbal measures have moder- 
ately high positive relationships with one 
another (top triangle) as would be expected - 
on the basis of earlier “validation” studies. - 
The behavioral measures also intercorrelate. 
within the same range (bottom triangle). The 
relatively low behavioral validity coefficients 
(rectangle) are similar in magnitude to most | 
other reported attitude-behavior coefficients | 
(Wicker, 1969). However, examination of the - 
coefficients suggests that the attitude scales" 
differ in their behavioral validities: the Th 
stone and Chave.scale is the best behavioral 
predictor, followed by the semantic differential | 
ratings of one’s own local church, and ratings | 
of associations. to one's own local church. 
Semantic differential ratings of The Church ~ 
did not significantly correlate with any of 
the behavioral measures. / 

It should be pointed out that the distribu- 
tions of the attitude scale scores had some- 
what restricted ranges, as can be seen from 
examination of means and standard devia- 
tions in Table 1. For example, the present - 
statistics for the Thurstone and Chave scale 
are quite similar to those obtained for divin- 
ity students (X = 2.82, SD = 96; Thurstone | 
& Chave, 1929, pp. 67, 69). However, it — 
may be noted that behavioral validity does 
not directly covary with the standard devia- 
tions of the attitude measures, For example, — 
the summed ratings of associations to one's 
own local church has the largest standard 
deviation, but is third out of four measures. 
in behavioral validity. 

The restricted range question was further 
examined as follows: Subjects whose attitude 
Scale scores were in the top and bottom 
quartiles of each attitude scale distribution 
were identified, and their mean scores on the 
behavioral measures were compared using 
two-tailed ¢ tests. It was expected that the 
subjects with the most favorable attitudes 
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TABLE 1! 


Means, STANDARD DEVIATIONS, AND CORRELATION MATRIX OF 
ATTITUDE SCALES AND ST BEHAVIORS 


—— a) TE 


Scale | s SD 


1. Semantic differential, the church 
2. Semantic differential, own church) — 2.21 88 
3. Semantic differential, 3 associa- 


tions to own church 2.09} 108 
4. Thurstone and Chave scale, the 

church 2.39 80 
5. Worship service attendance 18.39} 11.16 
6. Contributions 141.80 | 140.07 
7. Responsible positions in church 

activities 2.56) 3.54 


8. Behavioral composite 02 2.51 


60"* 


55** 
87** 


42** 
go 


Note.—ns range from 138 to 150, except for coefficients involving the Thurstone and Chave scale, for which ns range from 
81 to 87. Decimal points have been omitted in the correlation matrix. 


*p <05 
** p c 0l. 


would show higher means on the behavioral 
measures than those with the least favorable 
attitudes. This expectation was confirmed for 
the Thurstone and Chave scale on all behav- 
ioral measures: attendance (p < .01), con- 
tributions (pf < .05), participation (p < .01), 
and composite (p < .01). Three of the four 
comparisons of top and bottom quartiles of 
the semantic differential ratings of one’s own 
local church were significant and in the ex- 
pected direction: attendance (p < .01), con- 
tributions (p< .05), composite (p< .01). 
For semantic differential ratings of associa- 
tions to one’s own local church, significance 
was reached on the attendance (p< .05) 
and composite (p < .05) measures. All other 
comparisons were nonsignificant. 

Due to the fact that the Thurstone and 
Chave scale was administered to a smaller 
and somewhat different sample (longer term 
members) than the other scales, it might ap- 
pear that the higher validity of this scale 
is an artifact due to greater predictability 
of the behaviors of the longer term members. 
Available data do not support this interpreta- 
tion. Eight of nine coefficients relating the 
three semantic differential scales and the 
three overt behaviors for the longer term 
members are smaller than corresponding co- 
efficients for the total sample. Moreover, the 
other verbal predictors are not consistently 
better when applied to the longer term mem- 


bers than when applied to the total sample. 
Thus, it appears that the relatively higher. 
validity of the Thurstone and Chave scale — 
is due to some internal feature of the scale . | 
itself, for example, its content, format, length, 
or reliability. 

The major goal of the present research was 
to test the contributions of three variables; 
in addition to attitude, in predicting overt 
behaviors. Before examining the multiple- 
regression predictions of behavior, however, i 
it is necessary to show that the new pre- 
dictors are empirically distinct from one 
another and from the attitude measure. 
Using the Campbell and Fiske (1959) multi- — 
trait-multimethod approach, the three predic- ! 
tors (perceived consequences of church be- — | 

S 


havior, judged influence of extraneous events 

on church behavior, and evaluation of church X 
behavior) can be considered as “traits,” and 
the three behaviors (attendance, contribu- ^| 
tions, and participation in church activities), 

as “methods” for assessing the traits. (It — 
will be recalled that for each predictor there 1 
were three separate scores, one based on 7 
each kind of church behavior.) Table 2 shows P 
the multitrait-multimethod matrix (within 1 
the triangle) and, in addition, the coefficients 1 
relating the three predictors with the atti- : 
tude measure and the overt behavioral cri- : 
teria. The attitude measure employed in the : 
regression equations was the semantic differ- E 
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TABLE 2 
CORRELATION MATRIX OF VERBAL PREDICTORS AND OVERT BEHAVIORS 
Attendance Contributions 
Criterion and predictor 
PC | EB | ji PC EB | JI 

Attendance 2 

1. Perceived consequences of behavior (PC) 

2. Evaluation of behavior (EB) 26 

3. Judged influence of extraneous events (JI) 14 39 
Contributions 

c ós tt 12 

EB 09 20 34 16 

Jt 11 07 50 07 26 
Participation 

"PC $ 29.007 01 60 03 01 

EB —06 23 15 07 20 -02 —04 E 

H 1390132 06471 2014116 22. 08 02 
Attitude toward own church TL 31 15 05 39 i$ 01 26 
Overt behavior 

Attendance 19531 6. 42d 30. a 20 0r 16 

Contributions —03 17 29 05 37 45 —04 05 Hn 

Participation 03 18 35 03 13 18 20 06 

Note,—ns range from 125 to 152. Decimals are omitted, When » = 127, rs of .17 or larger are significant beyond the .05 level, 


and rs of .23 or larger are significant at the .01 level. Validity diagonals are indicated by italics. 


ential rating of the local church, which had 
the highest behavioral validity of the attitude 
measures obtained from all subjects. 

The coefficients in Table 2 indicate that 
the new predictor variables have both con- 
vergent and discriminant validity. The pre- 
dictors show some stability across kinds of 
church behavior, in that the validity diago- 
nals (in italics) are relatively high (median 
r= 47) and all are significantly different 
from zero. Discriminant validity is shown 
by the low correlations relating the different 
predictors within the same kind of behavior 
(median r = .16). For example, scores for 
perceived consequences of contributing regu- 
larly to the church are not highly related to 
scores for evaluation of contribution behav- 
ior (r= .16) or to scores for judged influ- 
ence of extraneous events on contributions 
(r = .07), nor are the latter two variables 
closely related (r — .26). Moreover, correla- 
tions of the new predictors with attitude are 
also rather low (median r = 47), thus meet- 
ing the desideratum of empirically distinct 
predictors. 

While a given verbal predictor should not 
be highly correlated with other verbal pre- 
dictors, it should correlate more highly with 
the overt behavior with which it deals than 


with other overt behaviors. For example, the 
Scores for perceived consequences of church 
attendance should correlate more highly with 
actual attendance than with contributions 
or participation. The last three figures in 
the first column of Table 2 show that this 
is the case (.19 > —.03 and .03). Similar 
expectations exist for the other predictors, 
and are confirmed in five of the other eight 
cases. It is also expected that the correlation 
between a given predictor and the behavior 
with which it deals will be higher than the 
correlation between that behavior and pre- 
dictors which deal with other behaviors. For 
example, the correlation between scores for 
evaluation of attendance behavior and actual 
attendance should be higher than the cor- 
relation between actual attendance and scores 
for evaluation of contribution behavior or 
evaluation of participation behavior. The 
second, fifth, and eighth figures in the at- 
tendance row for overt behaviors (Table 2) 
show that the expected relationship is ob- 
tained (.31 > .19 or .01). This general 
expectation is confirmed in five of the other 
eight cases. 

Intercorrelations of single verbal predictors 
and overt behavioral criteria are shown at 
the top of Table 3. Averaged over the three 


TABLE 3 


CORRELATION COEFFICIENTS RELATING SINGLE AND MULTIPLE VERBAL PREDICTORS 
TO OvERT BEHAVIORAL CRITERIA 


ATrTITUDE-BEHAVIOR INCONSISTENCY 


Criterion 
Predietor " : 
| Attendance Contributions Participation | R 
E a Pow mas ». satus i^ | 
Single | 
1. Attitude toward own church (A) dre 22t* | 11 2t 
2. Perceived consequences of behavior (PC) 05 20* 15% 
3. Evaluation of behavior (EB) ol 3i** 06 | 20** 
4. Judged influence of extraneous events (JI) 42** | 45** 14 36t* 
X 32** 29** iq 25 
Multiple 
A+ PC 33** | 22* | 21* 26“ 
(A) (A) | (PC) 
A+ EB 39** | 42^ | 13 336 
(A, EB) | (A EB) | 
A + JE 45** 45** 16 3g" 
(A, JD (D | 
A+ PC + EB | 40** 42"* 22 36** 
(A, EB), | (AEB) | 
A+ EB 4 JI 49** | pone 16 43** 
| gI) | «EB, JI) 
A+PC+JI | 47** 45** 23* 4p 
(Qu Un (PC) 
A--PC^A-EB- ji Sort S3 23 46** 
GD (EB, JI) | 
| 


Note. manatee whose eertain UA ETE are significant beyond the .05 level are given in parendum 


criteria, the best single predictor was judged 
influence of extraneous events, followed by 
evaluation of behavior, attitude, and per- 
ceived consequences of behavior. This order- 
ing of predictors was also obtained for two 
of the three behavioral criteria, attendance 
and contributions. Coefficients relating pre- 
dictors and participation in church activities 
not only show a different order, but were 
generally lower than for the other two criteria. 

At the bottom of Table 3 are multiple- 
correlation coefficients relating predictions 
from multiple-regression equations and actual 
behaviors. For each criterion, seven multiple- 
regression equations were computed; attitude 
was always the first independent variable. 
The effects of adding one, two, or all three 
variables to attitude are shown in Table 3. 
Overall, the best single predictor added to 
attitude was judged influence of extraneous 
events (mean R = .38); the best pair of pre- 
dictors added to attitude were judged influ- 
ence of extraneous events and evaluation of 
behavior (mean R = .43). As was true for 
the single predictor data, multiple-regression 


from 125 to 146. Mean coefficients are determined from s transformations, Decimal points have been omitted, 


predictions of participation in church aetivi- 
ties show a somewhat different pattern than 
predictions of the other two criteria, and 
they were consistently less accurate. 

The contributions of independent variables 
(predictors) in multiple-regression equations 
tend to be related to their order in the equa- 
tions, as specified by the investigator. Each 
successive variable results in a smaller in- 
crease in the predictability of the dependent 
variable (criterion). In the present study, 
attitude was specified as the first independent 
variable in the multiple-regression equations; 
this is appropriate for the question of how 
much the predictability of overt behavior is 
increased by addition of other variables to 
attitude. But it is possible that there are 
combinations of two or three independent 
variables other than attitude which are better 
predictors than combinations which always 
include attitude. In order to examine this 
question, three stepwise regression equations 
were computed; each included four verbal 
predictors and the corresponding overt behav- 
ioral criterion. In the stepwise procedure, the 
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TABLE 4 
BEHAVIOR VARIANCE 
PREDICTORS: STEPWISE REGRESSION ANALYSES 


i eee | % of F for 
Criterion and predictor variance Noe 
Attendance 
Judged influence of extraneous | 
events 19.6 | 31.82** 
Attitude 2.8 4.61* 
Evaluation of behavior I 2.77* 
Perceived consequences of | 
behavior | 8 | 134 
Total | 249 
R = .50* | 
Contributions 
Judged influence of extraneous 
events | 202 | 31.10** 
Evaluation of behavior 7.6 | 12.92** 
Attitude aA «1 
Perceived consequences of 
behavior al «1 
Total 280 | 
R = .S3** 
Participation 
Perceived consequences of 
behavior 2.8 3.73 
Evaluation of behavior 1.3 1.77 
Judged influence of extraneous 
events 11 1.40 
Attitude 3 «1 
"Total 5:5 
R-.23 


, Note.—ns for the attendance, contributions, and participa- 
tion ees are 132, 125, and 131, respectively. 


** p < 01. 


best single predictor becomes the first inde- 
pendent variable in the regression equation; 
the second independent variable selected is 
the predictor which will result in the greatest 
increase in predictability of the criterion over 
the first predictor, and so on. 

Results of the stepwise analyses are shown 
in Table 4. It is seen that attitude entered 
the equations for attendance, contributions, 
and participation, second, third, and fourth, 
respectively. Only for attendance did addi- 
tion of attitude significantly increase the per- 
centage of behavior variance accounted for. 
The stepwise analyses show that predictions 
of attendance based on judged influence of 
extraneous events, attitude, and evaluation of 
behavior were not significantly improved by 
the addition of the perceived consequences of 
behavior measure. Predictions of contribu- 
tions based on judged influence of extraneous 
events and evaluation of behavior were not 
significantly improved by addition of atti- 


ACCOUNTED FOR BY VERBAL 


tude or perceived consequences of behavior. 
The strongest predictor of participation in 
church activities was perceived consequences 
of behavior, which was of borderline sivnifi- 
cance (p < .06). 

The attitude measure employed in the 
present regression equations, semantic dif- 
ferential ratings of one’s own local church, 
had the highest behavioral validity of the 
attitude scales completed by all subjects. 
However, the Thurstone and Chave scale, 
which was administered to a subsample of 
subjects (longer term members), was higher 
in behavioral validity than the semantic dif- 
ferential measure for own church. This raises. 
the question of whether the use of the Thur- 
stone and Chave scale would result in higher 
predictor—criterion relationships and perhaps 
different conclusions about the value of the 
attitude concept, relative to other predictors. 

To answer this question, all multiple-re- 
gression analyses, including the stepwise 
analyses, were recomputed using only those 
subjects (n = 87) who had completed the 
Thurstone and Chave attitude scale. 
analyses for attendance and contrit 
resulted in multiple correlations very 
to those shown in Table 3. The grea 
crepancy between comparable analyses for 
those two criteria was .04. All coefficients 
for the participation criterion were higher 
when the Thurstone and Chave scale was em- 
ployed, although they were still consistently 
lower than those for the other two criteria. 
The multiple correlations for participation 
are as follows (compare with the third col- 
umn, bottom part of Table 3): attitude to- 
ward own church + perceived consequences 
of behavior, .29; attitude toward own church 
+ evaluation of behavior, .27; attitude to- 
ward own church + judged influence of ex- 
traneous events, .31; attitude toward own 
church + perceived consequences of behavior 
+ evaluation of behavior, 30; attitude to- 
ward own church + evaluation of behavior + 
judged influence of extraneous events, 32: 
attitude toward own church + perceived con- 
sequences of behavior + judged influence of 
extraneous events, .35; and attitude toward 
own church + perceived consequences of be- 
havior + evaluation of behavior + judged in- 
fluence of extraneous events, .35. All of the 
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"ultiple correlations are significant beyond 
he .08 level, The larger multiple correlations 
‘iting when the Thurstone and Chave 
le was employed appear to be primarily 
iue to the scale's higher correlation with the 
jarticipation criterion (r= .24). In the 
;bove regression equations, the only predictor 
with significant (p < .05) beta weights was 
attitude. Finally, it may be noted that the 
mean multiple correlations summed across 
the three criteria (last column, bottom part 
of Table 3) were from .01 to .03 larger when 
the Thurstone and Chave scale was used. 

When the Thurstone and Chave scale was 
used as the attitude measure in the stepwise 
analyses of attendance and contributions, re- 
sults were very similar to those reported in 
Table 4. In the analysis of participation, 
however, the following differences were noted: 
attitude (Thurstone & Chave scale) entered 
as the first independent variable, accounting 
for 6.5% of the variance in participation (F 
= 5.51, p< 01), and judged influence of 
extraneous events entered second, accounting 
for 3.5% of the variance (F = 3.03, p< 
05). 

The consistently poorer predictability of 
the participation criterion (number of re- 
sponsible positions held in church activities) 
could be due to several factors. The range is 
more restricted than for the other criteria; 
many subjects reported no positions of re- 
sponsibility during the 15-week period as- 
sessed. A second and related factor has to do 
with the criterion itself: to take a responsible 
position usually requires that the individual 
have some degree of ability, commit himself 
in advance, have the time, and perhaps even 
the transportation necessary to serve. The 
restrictions on the other two criteria are prob- 
ably not as great. Moreover, the length of 
time covered by the participation measure 
(15 weeks) was considerably shorter than 
the attendance (39 weeks) and contributions 
(52 weeks) measures. To summarize, for the 
durations employed, the situational threshold 
(Campbell, 1963) for taking a responsible 
job was higher than for contributing or at- 
tending: 1% of the subjects did not con- 
tribute, 6% did not attend church worship 
service, and 35% did not serve in a responsi- 
ble position. 


Discussion 


The present findings on the attitude-be- 
havior relationship are consistent with most 
other studies, in that measured attitudes 
were relatively poor predictors of overt be- - 
haviors. Across four attitude measures and 
three overt behaviors, the median amount of 
behavior variance accounted for by attitude 
responses was 4%, and the highest amount 
was 13%. 

According to DeFleur and Westie (1963), 
the repeated failure of attitudes to predict 
overt behaviors has implications for how at- 
titudes should be conceived. The most popular 
conception of attitudes, they suggested, is a 
relatively enduring, transsituational hypo- 
thetical variable which gives “both direction 
and consistency to the person’s responses [p. 
21]." It follows from this view that verbal 
expressions of attitude and overt behaviors - 
directed toward the same object should co- 
vary, since both are presumably mediated by — 
the same latent attitude. DeFleur and Westie 
labeled this expectation “the fallacy of ex- 
pected correspondence." They suggested that 
rather than conceiving of attitudes as general 
response tendencies influencing different kinds 
of responses, "attitudes appear to be most 
usefully conceptualized as specific, in the 
sense that they may be viewed as probabili- 


ties of specific forms of response to specific 2 
social objects, or specific classes of social ob- E 
jects [p. 30].” * 
Relatively low behavioral validity is not 4 
unique to attitude scales, but is common to 3 
most measured personality traits (Mischel, — 4 
1968; Vernon, 1964). Mischel (1968) sug- 3 
gested that “the phrase ‘personality coeffi- y 
cient’ might be coined to describe the corre- E 
lation between .20 and .30 which is found k 
"ui 


persistently when virtually any personality 
dimension inferred from a questionnaire is 
related to almost any conceivable external 
criterion involving responses sampled in a 
different medium—that is, not by a question- 
naire [p. 78]." On the question of whether 
people show transsituational consistencies, as 
assumed by trait theorists and many attitude 
theorists, Mischel concluded: “With the pos- 
sible exception of intelligence, highly gen- 
eralized behavioral consistencies have not 
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been demonstrated and the concept of person- 
ality traits as broad response predispositions 
is thus untenable [p. 146].” 

What are the implications of this state of 
affairs for the study of attitudes? It depends 
upon the degree to which the attitude re- 
searcher believes that his research bears on 
something beyond the verbal behavior of the 
test situation. For those who are interested 
in overt behaviors outside the laboratory, 
several possible implications can be suggested. 
All of the suggestions have in common a shift 
toward greater specificity of the response 
measures. 

One alternative is to redirect the attitude 
measure from the object of the behavior to 
the behavior itself (DeFleur & Westie, 1963; 
Fishbein, 1967). Thus, instead of measuring 
attitude toward the church, one would mea- 
sure attitudes toward attending, contribut- 
ing, and participating. Recent research has 
shown attitudes toward behavior in a Priso- 
ner’s Dilemma game to yield reasonably high 
correlations with the behavior itself (Ajzen, 
1971; Ajzen & Fishbein, 1970, 1971). 

A second alternative to the use of global 
measures of attitude is to employ other verbal 
measures which deal with the specific behavior 
of interest (DeFleur & Westie, 1963; Fish- 
bein, 1967; Wicker, 1969). The measures 
could be used instead of, or in addition to, 
attitude scales. The present study examined 
the feasibility of this approach. It was found 
that two variables, judged influence of ex- 
traneous events and evaluation of behavior, 
were generally better behavioral predictors 
than attitude, while the measure of perceived 
consequences of behavior was poorer than 
attitude. The relatively poor performance of 
this last measure is somewhat inconsistent 
with the importance placed on expectancies 
by a number of theories of behavior (e.g., 
Fishbein, 1967; Rotter, 1954; Vroom, 1964). 
However, the present study may not have pro- 
vided a good test of the expectancy notion. 
Next to attitude, the perceived consequences 
measure was the most general, in that the 
same questions were asked for each behavior. 
Moreover, the possible consequences listed 
were not obtained from church members, but 
were constructed by the investigator and an 
assistant. 


The evaluation of behavior measure, which 
was superior to attitude, could in some ways 


be construed as a measure of attitude toward 
behavior, as discussed above. Scores on eval- 
uation of behavior were based on the pattern 
of responses in such a way that the score in- 


cluded an exclusiveness or intensity com- 
ponent. The relatively high behavioral valid- 
ity of the measure raises the question of 
whether this procedure for assessing attitudes 
toward specific kinds of behavior might be 
superior to the more conventional measures 
of affect toward an object (cf. Ajzen, 1971; 
Ajzen & Fishbein, 1970, 1971). 

The best single predictor in the present 
study was based on the subjects’ judgments of 
the degree to which unplanned, extraneous 
events would influence their church behav- 
iors. The greater the reported susceptibility 
to extraneous influences, the lower the level 
of actual church behavior. This measure 
is clearly applicable to overt behaviors oc- 
curing over time outside the laboratory, but 
also should be useful in laboratory situations 
in which a series of events not controlled by 
the investigator are allowed to occur (e.g, 
group discussion, interaction on competitive 
tasks). Some measure of judged influence of 
extraneous events may prove to be useful in 
field applications of Fishbein’s (1967) exten- 
sion of Dulany’s (1968) theory of proposi- 
tional control. The theory holds that there is 
a close relationship between behavioral in- 
tentions to perform a specific act and the ac- 
tual performance of the act, and some sup- 
porting evidence for this postulated relation- 
ship has been obtained in the laboratory 
(Ajzen, 1971; Ajzen & Fishbein, 1970). 
However, Davis and Triandis (1965) found a 
relatively low relationship between personal 
intentions and overt behaviors when a group 
decision intervened. In studies in which ex- 
traneous events can intervene to disrupt the 
relationship between intentions and acts, 
judgments of the influence of such events may 
improve the prediction of behavior. 

Parenthetically, it may be noted that under 
some conditions, the assumed close relation- 
ship between behavioral intentions and overt 
behaviors obtains in nonlaboratory situations. 
In the present study the correlation between 
the subjects' pledged contributions and the 
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amounts actually given over 1 fiscal year was 
92. However, this coefficient is undoubtedly 
higher than most intention-act relationships 
outside the laboratory. The coefficient is also 
somewhat misleading, since many people con- 
tribute less than they pledge, and very few 
contribute more, so that the mean pledge is 
higher than the mean contribution. It is 
interesting to note the rather extensive pro- 
cedures employed in churches to achieve a 
high relationship between pledges and con- 
tributions. Members are asked to make real- 
istic pledges which they can afford. They sign 
their names to the pledges, and in some cases 
the pledges are obtained on visits to mem- 
bers’ homes by other members. Moreover, 
members are sent quarterly statements indi- 
cating the amounts they have pledged and 
contributed. Periodic statements of the 
church’s financial condition also appear in 
church bulletins and newsletters. And the at- 
mosphere is one in which contributions are 
represented as a moral obligation of the 
members. 

No single verbal measure in the present 
study was so strong a predictor that it could 
not be improved by addition of other predic- 
tors. This suggests that use of multiple pre- 
dictors, including attitudes in some cases, 
may be a desirable strategy for predicting be- 
havior. In this sense the “other variables” ex- 
planation of attitude-behavior inconsistency 
was supported in the present research. How- 
ever, it should also be noted that even when 
a number of verbal predictors are employed, 
a large proportion of behavior variance re- 
mains unaccounted for. For attendance and 
contributions, all four predictors accounted 
for approximately 25% of the variance, leav- 
ing 7596 to be explained; for participation, 
94% was unaccounted for. 

A third possible implication of the weak 
attitude-behavior relationship is that it may 
be desirable to abandon the attitude concept 
and other verbal predictors in favor of direct 
study of the overt behavior of interest and 
the variables which affect that behavior. 
Mischel (1968) has cogently argued the case 
for the study of situational determinants of 
overt behavior. 

Different researchers will undoubtedly 
evaluate and react to the above alternatives 
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in different ways, depending upon the nature 
of their other information, their level of in- 
volvement and identification with one or an- 
other type of research, and so on. But per- 
haps it is premature to make a firm decision 
regarding any of the alternatives at the pres- 
ent time, pending more research. Certainly 
the present study is limited in a number of 
ways: it dealt with overt behavior in only one 
domain; although based in part on a review 
of the literature, choice of attitude scales and 
other predictors was somewhat arbitrary; and 
scoring procedures for some predictors were 
arbitrary. Also, only linear relationships 
were studied. Quadratic, cubic, or other sys- 
tematic functions might better fit the data. 

Additional research is clearly needed. Fu- 
ture investigations might most profitably in- 
clude measures of social norms (cf. Davis & 
Triandis, 1965; DeFriese & Ford, 1969; 
Ewens & Ehrlich, in press; Fishbein, 1967; 
Warner & DeFleur, 1969), habits (Triandis, 
Vassiliou, & Nassiakou, 1968), and ability, 
in addition to some of the variables in the 
present study. The investigation of attitude— 
behavior relationships provides an oppor- 
tunity for innovative and creative efforts: 
unobtrusive measures can be employed (cf. 
Webb, Campbell, Schwartz, & Sechrest, 1966), 
theories can be tested in field settings (cf. 
Fishbein, 1967; McGuire, 1967), and data 
can be obtained on socially significant behav- 
iors. 
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A 2X3 design was used to explore the effects of complexity and importance 
of decision on conservatism in processing information. One hundred and twenty 
undergraduate males considered either two or three alternatives for one oí 
three problems differing in perceived importance. Upon receiving certain infor- 
mation, the subjects indicated which of the several alternative explanations for 
their problem was more or most probable. These probability statements were 
compared with "optimal" probabilities, arrived at by using Bayes's theorem. 
The more important and complex the decisions were perceived to be, the more 
conservative the subjects were when compared to optimal Bayesian values (f$ 
< 01). The differences were more pronounced as the number of alternatives 
to be considered increased (f —.01). Seven individual difference variables 
measuring adequacy in processing information were unrelated to conservatism. 
A process analysis suggests that strongly held initial opinions can limit the 
judged relevance of subsequent information and, in turn, affect the optimal 
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Bayesian decision. 


x 
Edwards and his associates (Edwards, 
Lindman, & Phillips, 1965; Edwards & Phil- 
lips, 1964; Phillips & Edwards, 1966; Phil- 
lips, Hays, & Edwards, 1966) have shown 
that human subjects are generally conserva- 
iive in making decisions when their decisions 
are compared to those resulting from the ap- 
plication of Bayes's theorem. Whether decid- 
ing about the percentage of red and blue poker 
chips in one of two bookbags or judging if 
information on a radar screen signifies an 
enemy attack, men appear to be suboptimal 
information processors. Subjects seem unable 
to extract “all the certainty from information 
that Bayes’ theorem indicates is in that in- 
formation [Edwards et al., 1965, p. 303].” 


1 This paper was written and the research was car- 
ried out during the coauthors’ tenure as National In- 
stitute of Mental Health Postdoctoral Fellows at the 
Educational Testing Service, Princeton, New Jersey. 
The research was supported by Research Grant 1 
POL HD01762 from the National Institute of Child 
Health and Human Development. Portions of the 
study were presented at the meeting of the American 
Psychological Association, Washington, D. C, Sep- 
tember 1967. Thanks are due to Charles Hermann, 
Nathan Kogan, and Melvin Novick for their com- 
ments on an earlier draft of this paper. 

2 Requests for reprints should be sent to Margaret 
G. Hermann, Mershon Center for Education in 
National Security, 199 W. 10th Avenue, Columbus, 
Ohio 43201. 


in... 

— As well as noting subjects’ conservatism, 
recent experiments have examined how a va- 
riety of procedural variables affected the 
magnitude of this conservatism. Such varia- 
bles have included the addition of payoff 
functions (Phillips & Edwards, 1966), use of 
different response modes (Phillips & Edwards, 
1966), variations in sample size (Peterson, 
Schneider, & Miller, 1965), and variations in 
the ambiguity of the information presented 
(Phillips et al., 1966). The present study was 
also concerned with the extent of man’s con- 
servatism. An attempt was made to ascertain 
the effect of different types of problems and 
the effect of skill in processing information on 
conservatism, 

The representativeness of the typical prob- 


lem used in studying human versus Bayesian 


information processing was the primary focus 
of the present experiment. Most research 
comparing persons with the Bayesian model 
of decision making has employed problems 
of the bookbag and poker chip variety. The 
question can be raised as to whether subopti- 
mal information processing will occur for 
problems of greater consequence to the sub- 
jects. It was hypothesized that degree of con- 
servatism is related to type of decision—the 
more important and complex the decision, the 
more conservative a subject's behavior is, rela- 
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tive to the Bayesian model. As problems be- 
come more lifelike in nature and more repre- 
sentative of “real-world” decision making, de- 
cisions will become more conservative. A posi- 
tive, rather than a negative, relationship was 
postulated because one can argue that the con- 
servatism in the case of the bookbag and 
poker chip studies results from the Bayesian 
model being an artificially risky representation 
of man's natural decision processes. There- 
fore, applying Bayes's theorem to problems 
more illustrative of ordinary decision making 
should reveal even greater conservatism. As 
concern for the consequences of an action 
increases and, consequently, the influence of 
past experience, of prior opinions, and of 
present needs increases, extracting certainty 
from available information may no longer be 
the central issue in the decision-making pro- 
cess. How consistent the information is with 
one's prior opinions and present needs may be 
more important in making a decision than the 
content of the information. 

Also of interest was the type of person who 
exhibits the conservative behavior. Do indi- 
viduals more competent at information pro- 
cessing deviate less from the Bayesian predic- 
tion? In other words, are people ordinarily 
considered rational and intelligent decision 
makers less conservative than those lacking 
such qualities? Given that the Bayesian model 
of decision making indicates how an ideal 
person would make a decision (see Becker & 
McClintock, 1967; Edwards, 1968), the de- 
cisions of more rational subjects should show 
the closer approximation to the dictates of 
the model, 

A 2 X 3 design, manipulating decision im- 
portance and complexity, was employed in the 
present study. Decision complexity was varied 
by giving the subjects either two or three 
alternatives to consider in making their de- 
cions. Three different kinds of problems were 
used in varying decision importance—a regu- 
lar poker chip problem, a problem concerning 
an increase in birth defects, and a problem 
involving the occurrence of an atomic explo- 
sion. In addition, a variety of individual dif- 
ference measures were obtained on the sub- 
jects. Measures were selected because of their 
relevance to the adequacy with which indi- 
viduals process information. In order to com- 


pare the subjects' decisions with predictions 
from Bayes's theorem, the subjects were re- 
quired to indicate how plausible each alterna- 
tive was (prior probability), how helpful 
different kinds of information were in deciding 
among the alternatives (probability o! the 
data given the hypothesis), and, in light of 
certain specific pieces of information, what 
their decisions would be. The prior probability 
and the probability of the data given the 
hypothesis were combined to determine the 
Bayesian optimal outcome. These results were 
contrasted with the subjects! decisions. 


METHOD 
Subjects 
The subjects were 120 undergraduate men from an 
Eastern state university, who were recruited through 


a student employment agency, They were paid $1.50 
an hour for spprosimately 3 hours of testing 


Procedure C 


All of the subjects were provided with a booklet 
entitled “Decision Process Questionnaire.” It bega 


In what follows you are going to be given a 
hypothetical problem to solve which has several 
possible answers. You will have three tasks to per- 
form in solving the problem. First, you will be 
asked to indicate how plausible you think each of 
the answers is. Second, you will be asked to judge 
how helpful different kinds of information are in 
deciding between the answers. Third, you will be 
asked to revise your initial opinion in the light of 
certain specific pieces of information. 


An example designed to help the subjects become 
acquainted with and interested in their problem fol- 
lowed this introductory statement. This practice ex- 
ample required the subjects to think through the 
three steps necessary for the application of Bayes’s 
theorem. These steps were estimating prior proba- 
bilities, estimating probabilities of certain data given 
the hypotheses, and judging final probabilities 

The practice example concerned the disappearance 
of an espionage agent. Two alternative hypotheses 
were presented to the half of the subjects who subse- 
quently would be in the two-alternative condition 
Three alternative hypotheses were specified for the 
remaining subjects who would be in the three- 
alternative condition. Questions about the procedure 
were solicited from the subjects at this time. 

The experimental problems followed the example 
in the questionnaire booklet. The problems were 
randomly assigned to the subjects, with every sixth 
subject receiving the same problem. As in the ex- 
ample, the subjects were asked to indicate the prior 
probabilities of specific mutually exclusive hypothe- 
ses, to estimate the relevancy of particular pieces 
of information to selecting among these hypotheses, 
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ind Lo revise their original probabilities in light of 
factual information which was to be considered true. 
/hen the subjects had completed work on their 
, they were asked to describe their reactions 
Specifically, the subjects indicated on 7-point 
how difficult the problem was, the importance 
e problem, and how concerned they were about 
the consequences of making a mistake. They also 

ted how skillful they were in using probabilities 
ind how many probability courses they had had. 
In addition, the subjects were asked if they knew 
Bayes’s theorem, If their answer was affirmative, 
they were requested to state what it was. The indi- 
vidual difference measures followed these questions. 


Experimental Conditions 


Decision importance. Decision importance was de- 
fined as involving both the import of a problem and 
the concern for the consequences of one's actions 
which any decision arouses. 

The subjects were presented with one of three 
problems which the experimenters thought differed 
in these two qualities. One problem dealt with the 
poker chip composition of a bookbag. Subjects were 
asked to determine which of several mutually exclu- 
sive and exhaustive hypotheses was more probable 
on the basis of a series of simulated draws from a 
bookbag. The second hypothetical problem concerned 
a rapid increase in the number of children with birth 
defects. In this problem the subject was asked to 
take the role of a science reporter for a leading New 
York newspaper in evaluating several mutually ex- 
clusive and exhaustive explanations for this phe- 
nomenon. His conclusion was to be drawn from a 
series of research findings bearing on the proposed 
explanations. The third problem posited the occur- 
rence of an atomic explosion in the San Francisco 
area. Subjects in this condition were asked to assume 
the role of chief intelligence officer for the Strategic 
Air Command in deciding which of several mutually 
exclusive and exhaustive alternatives was the more 
probable. Their decisions were to be made on the 
basis of a series of intelligence reports which pre- 
sented information on the various alternatives. 

It was assumed that the poker chip problem would 
be perceived as less important than the birth defect 
problem and, similarly, that the birth defect problem 
would be perceived as less important than the atomic 
explosion problem. To check the correctness of this 
ordering, the subjects were asked after completing 
work on their problem how important they thought 
it was and how concerned they were about the 
consequences of their actions. 

Decision complexity. Decision complexity was 
defined as the number of specified alternative hy- 
potheses to be considered in reaching a decision, 
namely, two or three. Subjects having to deal with 
three alternatives were thought to have a more 
complex decision to make than those with two 
alternatives, 

Half of the subjects working with each of the 
three problems (ie, 20 subjects) had two alternative 
hypotheses; the other half (ie. 20 subjects) had 


three alternative hypotheses. In all cases the subjects 
were instructed to consider only the alternatives 
explicitly presented to them. They were also told 
that the alternatives could not all be true. If one 
were true, the other(s) must be false. These require- 
ments were necessary to enable a test of the Bayesian 
model. For the model to apply, the hypotheses 
which are evaluated must be mutually exclusive and 
exhaustive. 

Subjects with the poker chip problem considered 
the following two alternatives: (a) 'The bookbag 
contains 70% red chips and 30% blue chips, and 
(b) the bookbag contains 30% red chips and 70% 
blue chips. Those subjects working with three alter- 
natives were also told that the bookbag could contain 
5096 red chips and 5096 blue chips. 

The two alternatives which all subjects with the 
birth defect problem were instructed to consider 
were: (a) The increase in birth defects is a side 
effect of the increased use of birth control pills, 
and (b) some condition in prenatal or postnatal 
medical care has changed and inadvertently produced 
an increase in birth defects. The additional alterna- 
tive specified for the subjects in the three-alternative 
condition was that the large number of children 
with birth defects merely reflects the rapid increase 
in the number of babies born in the last several 
years. In other words, the percentage of children 
with birth defects has remained the same. 

The two alternatives considered by all subjects 
working with the atomic explosion problem were: 
(a) The atomic explosion is the result of a careful 
and deliberate plot by Russia to provoke nuclear 
retaliation on Communist China, and (b) the explo- 
sion is the result of a careful and deliberate plot by 
Communist China to provoke nuclear retaliation on 
Russia. The additional alternative which only con- 
cerned those subjects in the three-alternative condi- 
tion stated that the explosion was the result of an 
accidental detonation of fissionable materials already 
in the San Francisco area. 

To check if the subjects perceived any difference 
in complexity when dealing with two or three alter- 
natives, they were asked after finishing their problem 
to indicate how complicated they had found it to be. 


Dependent Variable 


'The dependent variable in the present study was 
the signed deviation of each subject's decision from 
what was determined in Bayesian terms as the 
posterior probability of the hypothesis given the 
data. To calculate such posterior probabilities (or, 
more simply, the optimal Bayesian decision), one 
must know the prior probability of the hypothesis 
and the probability of the data given the hypothesis. 
A description of how the prior probability of the 
hypothesis and the probability of the data given the 
hypothesis were obtained will precede discussion of 
the calculation of the posterior probability. 

Prior probability. Prior probability refers to the 
subject's initial opinion about a hypothesis. For the 
bookbag and poker chip problem, uniform prior 
probabilities (50-50 in the two-alternative condition, 
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33-33-33 in the three-alternative condition) were 
assigned to the subjects as has typically been done 
for such problems in previous experiments. In order 
to ascertain prior probabilities for the hypotheses in 
the birth defect and atomic explosion problems, the 
subjects were asked to assign a number from 1 to 
100, indicating their first reactions to the probabili- 
ties of the several alternatives. To emphasize that 
only the alternatives specified were to be considered, 
the probability estimates were required to sum to 
100, All subjects complied with this requirement. 

Probability of the data given the hypothesis. The 
concept of probability of a datum given a hypothe- 
sis is clearest in the bookbag and poker chip problem. 
For this problem, simple statistical theory suffices 
to determine the appropriate value. If, for example, 
the subject is trying to guess whether he is drawing 
chips from Bag A (70% blue, 30% red chips) or 
Bag B (30% blue, 70% red chips), the probability 
of a red chip given Hypothesis A is .30 and given 
Hypothesis B is .70, In other words, for a red chip 
P(D|H.) = 30, and P(D|Hz) =.70, where P refers 
to probability, D refers to datum, and H refers to 
hypothesis. The present subjects were asked to esti- 
mate ratios such as P(D|H4)/P(D|Hs) because 
Edwards et al. (1965) reported that subjects find 
ratio estimation easier and give more reliable re- 
sponses than they do in simply estimating P(D|H.) 
by itself. 

In performing the ratio-estimation task, the sub- 
jects assigned numbers from 1 to 10 to the specified 
alternatives for each of 12 (in the two-alternative 
condition) or 18 (in the three-alternative condition) 
hypothetical pieces of information. The information 
included six items which a pretest? had shown to 
be most relevant to each alternative. Two sample 
pieces of information from the birth defect problem 
are: 


Families who wait more than 4 years before having 
children seem to have more children with birth 
defects than those who have children immediately 
alter marriage. 


Recent studies of currently used incubators have 
indicated some malfunction in the gauges con- 
trolling oxygen supply and carbon dioxide removal. 


3 The pretest consisted of asking eight staff mem- 
bers at the Educational Testing Service (primarily 
research assistants) to indicate prior probabilities for 
the alternatives in the birth defect and atomic ex- 
plosion problems and to note the relevance of 30 
pieces of information in choosing among these al- 
ternatives. Four of these subjects considered two 
alternatives, while the other four subjects had three 
alternatives. In the case of the two alternatives 
present in both the two- and three-alternative condi- 
tions, items of information were included in the 
study when six of these eight subjects agreed on 
which alternative was most probable; three of the 
four subjects with three alternatives had to agree 
for an item of information to be included for the 
third alternative. 
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Two sample items from the atomic explosion problem 
are: d 
Population distribution in the USSR and Com. 


munist China has changed over the last several 
years so that nuclear retaliation would injure 70% 
of all Russians and only 40% of all Communist 
Chinese. 


The area destroyed by the explosion was what 
would be expected from the maximum megaton 
capabilities of Russia. ` 


Judgments were made so that the numbers indicating 
the probability of the information given each alter- 
native formed ratios. Thus, if the subject thought 
that the first piece of above information for the 
birth defect problem was highly likely given the 
pill alternative, he might assign a 9 as his answer 
for that particular alternative. If he thought the in- 
formation was unlikely given the other alternative(s), 
then it (both) might be assigned a 1. In such a 
case the subject was informed that his judgment 
suggested that the particular piece of information 
could provide only one-ninth as much supporting — 
evidence for this alternative(s) as it could for the 
pill alternative. These likelihood ratios were used as 
the inputs into Bayes’s theorem for the birth defect 
and atomic explosion problems. 

Asking the subjects with the poker chip problem 
for this information on each of 12 or 18 draws, when 
the same answer was expected for each draw, could 
easily have antagonized or insulted them. Therefore, 
these subjects were simply informed what the proba- 
bilities of the data given the hypotheses were for 
their problem. 

Posterior probability. As noted earlier, to obtain 
the posterior probability—or, more simply, the 
optimal Bayesian decision—one must first know the 
prior probability of the hypothesis and the prob- 
ability of the data given the hypothesis. The calcula- 
tion of the posterior probability involves the fol- 
lowing formulas which are given for the two- 
alternative condition for reasons of simplicity. Bayes's 
theorem may be stated as 


P(D|E) PCH) 
P(D) : j 


ee E 


pa ce ll ll A ne 


P(H|D) = 


where P (H|D) = the posterior probability of Hy- | 
pothesis H, P(D|H) =the probability of a particular 
Datum D given that H is true, P(H) =the prior 
probability of H, and P(D) =the probability of 
Datum D. Rewriting this formula separately for each 
of the two-alternative hypotheses, Ha and Hp, and | 
dividing the resultant expression for P(H4|D) by the 
expression for P(H»|D) yields the computationally 
convenient equation 


P(HA|D) _ P(D|Ha) 


( e ) y PA) \ 
P(H»|D)  P(D|Hs) 


P(Hs) i 
The ratio of the two posterior probabilities is easily 


translated into the separate values for these terms | 
since P(H,|D) + P(H5|D) —1. When more than | 
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one datum is being considered at a time, as in the 
. present study, P(D|H,) is the product of the prob- 


2 abilities of cach datum considered given the hy- 


r, 


. The same is true of P(D|H;). Further 
tational discussion and numerical examples are 
provided in Edwards et al. (1965). 


‘The decision-making stage of the subject's par- 


4 


F ticipation came after the probabilities of the data 


- given the hypotheses had been obtained. Each sub- 
ject was presented with a sampling of eight pieces 
of information drawn from those he had previously 
judged. Six of the eight pieces of information were 
those that favored the alternative which the pretest 
suggested would be considered leas! likely at the 
outset. In the two-alternative condition, the remain- 
ing two pieces of information favored the second 
alternative. In the three-alternative condition, one 
piece of information favored each of the two remain- 
ing alternatives. In both conditions, these two pieces 
of information were selected from the total at ran- 
dom. The information was divided into six and 
two pieces, respectively, to encourage as large a range 
as possible over which opinion revision could occur. 
The alternative favored most by the information in 
the atomic explosion problem (considered least likely 
at the outset) was that Russia was unleashing the 
attack. The comparable alternative for the birth 
defect problem was that hospital conditions were 
responsible. For the poker chip problem, the 70% 
red-3096 blue alternative was arbitrarily selected as 
the correct one. Each of these alternatives appeared 
in both the two- and three-alternative conditions. 
The eight pieces of information were presented on 
separate pages of the questionnaire. After reading 
all eight pages of information, the subject was 
asked to indicate his decision of how probable he 
now thought each of the alternatives was. Once 
again the subjects were required to make the prob- 
abilities sum to 100, on the assumption that one of 
the alternatives was true. 

Since Peterson and DuCharme (1967) have found 
a primacy effect when examining subjective prob- 
ability revision—information appearing early in a 
sequence influencing the subjects’ probability esti- 
mates more than information appearing later—the 
eight pieces of information were presented in a fixed 
order. The second and seventh pieces of information 
always provided the subject with data not conducive 
to a decision in the anticipated direction. In this 
manner, information more likely to be in agree- 
ment with the subject's original opinion received 
less potential benefit from either a primacy or 
recency effect, 

In sum, each subject was given one decision to 
make. Of interest was how much this decision dif- 
fered from that predicted by Bayes's theorem. To 
determine the Bayesian decision, the subject was 
asked to indicate an initial reaction to the problem 
before knowing much about it (prior probability of 
the hypothesis). He was also required to note how 
certain pieces of information would affect his decision 

if they proved to be true (probability of the data 

given the hypothesis), Although each subject made 


these latter judgments for 12 or 18 pieces of infor- 
mation, depending on the number of alternatives he 
was considering, only the judgments for the 8 pieces 
of information indicated as being true and employed 
by the subject in making his final decision were used 
in determining the Bayesian decision. 

The present procedure differs somewhat from other 
studies comparing Bayesian and human decisions: 
(a) The subjects only made one decision, rather 
than a series of decisions; (b) the subjects’ own 
prior probabilities and probabilities of the data given 
the hypotheses were used, rather than uniform values 
specified by probability theory or by the experi- 
menters, Probability theory has difficulty specifying 
such probabilities for the more lifelike problem. 
Moreover, since the present research employed sev- 
eral problems on which individuals have definite 
views, it seemed important to get the subjects’ own 
opinions, rather than imposing those of the experi- 
menters upon them. This process involved a more 
detailed focus on each step in applying the Bayesian 
formula, limiting the number of problems which the 
subjects could consider in the time available for the 
experiment. 


Individual Difference Measures 


Six measures of individual differences were com- 
pleted by each subject. Five of the measures were 
selected because of their relevance to the adequacy 
with which individuals process information. These 
five measures were (a) a composite measure of 
rational risk taking (Kogan & Wallach, 1964); (b) a 
measure of integrative complexity (Schroder, Driver, 
& Streufert, 1967); (c) a short, timed test of verbal 
intelligence—the Advanced Vocabulary Test (V4) 
from the Kit of Reference Tests for Cognitive Factors 
(French, Ekstrom, & Price, 1963); (d) the human- 
istic (“Left”) and normative (“Right”) ideology 
scores from the Tomkins (1964) Polarity Scale 
(see also Tomkins, 1963); and (e) the 40-item 
Rokeach (1960) Dogmatism Scale (see also Long & 
Ziller, 1965). Previous research (that already listed) 
with these measures has shown that rational risk 
taking, integrative complexity, verbal intelligence, and 
humanistic ideology relate positively to adequacy in 
processing information; normative ideology scores 
and dogmatism have been found to relate negatively 
to adequacy in processing information. 

The sixth individual difference measure, the Kogan 
and Wallach (1964) Choice Dilemmas instrument, 
assesses propensity to take risks. This measure was 
included not so much because it bears on adequacy 
in information processing as because it is indicative 
of a stylistic approach to decision making. 


RESULTS 

Manipulation Checks 
To check that the subjects perceived de- - 
cision importance and complexity as intended 


by the experimenters, the subjects’ post- 
decision questionnaire responses were ana- 
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TABLE 1 


MEANS AND ANALYSIS OF VARIANCE OF POSTDECISION QUESTIONNAIRE RESPONSES } 


M cell values | 


“Complexity” "Importance" Concern over consequences" | 
Type of problem Number of alternatives 
2 3 2 3 2 3 
Poker chi 2.25 2.90 3.15 3.05 3.20 3.10 
Atomic alsin 3.70 3.45 3.90 3.75 4.75 4.45 
Birth defect 3.20 3.20 4.70 4.75 4.65 | 5.00 
JURE of variance 

Source df MS E df MS F af | M | LA 
Ty 33.06 | 13.05" 
Type of problem (A) 2 10.21 5.46* 2 26.51 10.99* 2 33.06 | 13: 
Number of alternatives (B) 1 58 «1 1 13 «1 1 OL <1 | 
AXB 2 2.16 1.15 2 Af «1 2 131 [599 

Error 114 1.87 114 241 114 2.37 | 

n * p <.01, ay 


lyzed. Table 1 summarizes the analysis of 
variance results for these questions. The sub- 
jects perceived the problems to differ in im- 
portance and in complexity, regardless of the 
number of alternatives to be considered. 
Individual comparisons among the means 
(Winer, 1962, pp. 207-211) indicated that for 
perceptions of complexity and for percep- 
tions of concern over consequences, the differ- 
ence was between the poker chip problem and 
the other two problems. The means for the 
birth defect and atomic explosion problems 
were not significantly different on these two 
questions, For the question on importance, 
however, the comparisons among the means 
showed that the birth defect problem was 
seen as significantly more important than the 
atomic explosion problem (F = 6.75, (AES 
1/114, p < .05), which, in turn, was viewed 
as significantly more important than the poker 
chip problem (F = 4.32, df= 1/114, p< 
.05). Contrary to the experimenters’ assump- 
tion, the subjects perceived the birth defect, 
rather than the atomic explosion, decision as 
the most important. Consequently, the sub- 
jects’ ordering of the decisions on importance 
was used throughout the rest of the analysis. 
There were no significant main effects or 
interactions for postdecision questions con- 
cerned with prior experience with probability. 


In other words, the groups did not differ 
significantly in the number of probability 
courses they had had or in their perceived 
skill in using probabilities. Moreover, no sub- 
ject was able correctly to identify Bayes’s 
theorem. 

The data from a pretest were used in deter- 
mining which alternatives in the birth defect 
and atomic explosion problems were to be. 
considered correct and favored by the eight 
pieces of information on which the subjects 
were to base their decisions, The supposedly 
least favored alternative was selected —in the 
case of the birth defect problem, the alterna- 
tive indicating hospital conditions was re 
sponsible; for the atomic explosion problem, 
the alternative indicating Russia was unleash- 
ing the attack. The average prior probabili- 
ties for the subjects by condition were: birth. 
defect, two alternatives—birth control pills. 
= .62 (SD = .16), hospital conditions = 38 
(SD = .16); birth defect, three alternatives— 
birth control pills = .42 (SD = .18), hospital. 
conditions = .17 (SD = 15), baby increase 
=41 (SD -— 21); atomic explosion, two. 
alternatives—Russia did it = .38 (SD =.10), 
China did it = .62 (SD = 10); atomic explo- 
sion, three alternatives—Russia did it = 35. 
(SD = .22), China did it = .30 (SD = .24), 
accidental = .35 (SD = .26). An examination 
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of the subjects’ prior probabilities shows that 
for three of the four conditions the least 
favored alternative was indeed selected. In 
the fourth condition, the atomic-explosion- 
three-alternative problem, the average prior 
probabilities were practically the same, 


Effects of Decision Importance and 
Com plexity 


Table 2 presents the results of the analysis 
of variance for the deviation of the subjects’ 
decisions from the optimal Bayesian decision. 
As hypothesized, the more important the 
decision, the greater was the conservative 
deviation from optimality. 

Even though the subjects did not perceive 
a difference in complexity among the two- 
and three-alternative conditions, the number 
of alternatives did affect their decision- 
making behavior, As Table 2 shows, there was 
a significant interaction between type of prob- 
lem and number of alternatives. With three 
alternatives to consider, deviation from the 
Bayesian decision became larger as problem 
importance increased. In the two-alternative 
condition, there was little difference between 
the three types of problems. Problem impor- 
tance appears to be more salient as the num- 
ber of alternatives to be considered increases. 
Interestingly, in the three-alternative-poker- 
chip condition, the subjects decisions aver- 
aged very close to that predicted with the 
Bayesian statistic. 


Effects of Individual Differences 


None of the correlations between the indi- 
vidual difference variables and deviation from 
the Bayesian decision were significant, The 
correlations ranged from .15 to —.07. There 
seems to be little relationship between ade- 
quacy in processing information as measured 
in the present study and deviation from 
optimality. 


Process Analysis of Bayesian Decision 


Because two of the problems in this study 
required the subjects to indicate the rele- 
vance of certain information to the alterna- 
tives under consideration, it is possible to 
examine how variations in one part of the 
Bayesian decision process affect another. 


TABLE 2 


MEANS AND ANALYSIS OF VARIANCE OF 
DEVIATIONS FROM BAYESIAN DECISION 


M cell values 


Type of problem 


Poker chip .22 
Atomic explosion 25 


Birth defect 26 


Analysis of variance 


Source df MS F 
"Type of problem (A) 2 -33. | 10e 
Number of alternatives (B) 1 .01 «1 
2 19 6.87* 
Error 114 .03 


+p <01. 


One such analysis involves the relation be- 
tween the subjects’ initial opinions and their 
judgments of the information. In most re- 
search on the Bayesian model of decision 
making, quantitative values are provided for 
these variables by the experimenter or prob- 
ability theory. When, however, these values 
are not available and subjects must make the 
estimates, do their initial opinions influence 
their judgments of how much impact any 
information can have in distinguishing among 
the alternatives? Do subjects pay attention 
to the content of the information, or do they 
merely rate it in line with their prior 
opinions? In effect, how consistent are the 
subjects’ prior probability estimates and their 
ratings of the probability of the information 
given the hypothesis when the information 
supports the nonfavored alternative? Such an 
analysis is particularly germane to the focus 
of the present paper because as problems 
become more representative of real-world 
situations, it is highly likely that judgments 
of relevance and prior probabilities will be 
made by the individual decision makers. 
Moreover, here may be an important source 
of conservatism. If a subject has a definite 
prior opinion, he may be less responsive to 
new information and, in turn, be more 
conservative. 

The consistency between the subjects’ in- 
itial opinions and their judgments of the 
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TABLE 3 


MEANS AND ANALYSIS OF VARIANCE FOR 
PREDECISION CONSISTENCY 


M cell values 


Number of alternatives 
Type of problem ——— zi 
| 2 | 3 
Atomic explosion OL | 10 
Birth defect —.10 | —.30 
j Analysis of variance 
Source df MS F 
Type of problem (A) 1 119 | 18.35* 
Number of alternatives (B) 1 .005 «1 
AXB 1 .038 5.94* 
Error 76 .006 


Note.—A positive score means the subject viewed the in- 
formation in line with his prior opinion; a negative score means 
the subject did not view the information in terms of his prior 
opinion, 

p> «.0l. 


probability of the information given the hy- 
pothesis was found by examining whether 
their ratings of the information deviated from 
neutrality in the same way and to the same 
degree as their prior opinions. Such a score 
was derived by determining the average de- 
viation from neutrality (.50 for the two- 
alternative condition and .33 for the three- 
alternative condition) for each of the eight 
pieces of information considered in the 
decision-making stage of the procedure. Simi- 
larly, how much the subject's prior opinion 
deviated from neutrality was calculated. These 
two deviation scores were multiplied together 
and by 10 to form the measure. A positive 
score indicated that the subject viewed the 
information in line with his initial opinion; 
a negative score indicated that the subject 
did not consider the information in terms of 
his prior opinion. 

The results of an analysis of variance using 
this preopinion revision or predecision con- 
sistency measure as the dependent variable 
are shown in Table 3. Both the problem 
main effect and the interaction were signifi- 
cant. In considering information for the 
atomic explosion problem, the subjects tended 
to see it as supporting their original positions. 
The opposite was true of the birth defect 
problem. Both effects were more exaggerated 
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when three, as opposed to two, alternatives 
were considered. Previous positions had more 
of an impact in the atomic explosion than the 
birth defect problem. Probably because stu- 
dents have thought less about the causes of 


birth defects than about their reactions to | 
Russia and China, their opinions were less 
firm on this issue. Hence, the subjects with 
the atomic explosion problem were more com- - 
mitted to their prior positions and tended 
to interpret information so as to confirm 
rather than disconfirm these opinions. On the 
other hand, the subjects with the birth defect 


problem could be more open-minded and give | 
strong weight to the data, even though such 
weighting constituted negative feedback to 
their initial reactions. 

The results with predecision consistency are 
symmetrical with, though in the opposite di- 
rection from, the results with deviation from 
optimality. Subjects who considered the birth 
defect problem were more open-minded than 
those with the atomic explosion problem, yet 
they were more conservative in comparison 
with the Bayesian decision. In order to ex- 
plore this relationship directly, the correla- 
tion between predecision consistency and 
deviation from optimality was calculated. This 
correlation was —.14. Although mot signifi- 
cant, the relationship is in the direction sug- 
gested by the analysis of variance results. 

Predecision consistency was significantly i 
associated (r = —.22, df = 78, p < .05) with 
the Bayesian decision. Apparently the 
“optimal” Bayesian decision can be affected — 
by how responsive a subject is to information 
relevant to his decision. This finding has im- 
plications for the subject’s decision. If a sub- 
ject is heavily influenced in a consistency- 
maximizing direction when rating the infor- - 
mation’s relevance to the alternatives, the | 
Bayesian decision resulting from his subjec- 
tive judgments will be less extreme than if 
he is not (or little) influenced by his prior 
opinion. In other words, the more open an 
individual is to the possible negative feedback 
in the information he is rating, the further 
he will have to go in his decision to match 
the optimal decision. 

An examination of the individua] difference 
variables shows that propensity to take risks 
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“as measured by the Kogan and Wallach 
(1964) Choice Dilemmas instrument is sig- 
nificantly correlated with predecision consist- 
ency (r= 23, df=78, p< .05). Low risk 
takers were more likely to be influenced by 
their initial opinions than high risk takers. 
Subjects who had a predilection to view in- 
formation selectively seemed to have a sty- 
listic tendency to make conservative decisions. 
The other individual difference variables were 
virtually unrelated (rs ranged from .06 to 
—.08) to predecision consistency. 

A reminder is in order here. The preceding 
interpretations are limited, as is the process 
analysis, to problems where (a) the data on 
which the Bayesian prediction is based cannot 
be objectiñed without using individual judg- 
ments, and (5) the information which the 
subject must consider in making his decision 
lends support to the nonfavored alternative. 


Discussion 


The results indicate that subjects were more 
conservative as the decisions to be made be- 
came more lifelike in nature. The more com- 
plex and important the decisions were per- 
ceived to be and the more concern over conse- 
quences they aroused, the more conservative 
the subjects were when compared to predic- 
tions using Bayes’s theorem. These differences 
were more clear-cut as the number of alterna- 
tives to be considered increased. When faced 
with more representative, nontrivial problems, 
individuals appear to function even less like 
Bayesian decision makers than previously 

| thought. 
The Bayesian model of decision making is 
a normative or prescriptive model (cf. Becker 
& McClintock, 1967; Edwards, 1968), that 
is, indicating how an ideal person would make 
a decision. Therefore, a subject's behavior 
does not necessarily have to match the Bayes- 
ian prediction. If one's interest is in how 
decisions are actually made or deriving a 
descriptive model from a normative model, 
the conditions under which behavior deviates 
" from optimality and the conditions under 
which it approaches optimality become im- 
portant. Descriptive models suggesting how 
people do make decisions can be based on 
amendments to normative models required by 
nonoptimal behavior. The present results 
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indicate that such descriptive models should 
decsion to the indivdisls toed UR 
decision to the individuals involved in 
decision-making task and the number of 4 
natives to be considered in reaching the 
decision. Moreover, in the Bayesian model, 
the prior probabilities are assumed to be 
independent of the judgments of the prob- 
abilities of the data given the hypotheses. - 
Our predecision consistency measure suggests — 
that psychologically this assumption of inde- 
pendence cannot be taken for granted. It is 
precisely the amount of independence which 
may be changing as problems become less like 
mathematical puzzles and more like “real- 
life" decisions. 

Edwards and his associates ( Edwards, 1962, 
1968; Edwards et al., 1965) have proposed — 
that a man-machine system employing Bayes- 
ian analysis replace inherently conservative 
human decision makers. Called a probabilistic 
information-processing system (PIP), such . 
a system uses computer-calculated Bayesian 
posterior probabilities based on experts’ judg- — 
ments of the value of available information, 
Studies (Kaplan & Newman, 1966; S 
Goldstein, & Southard, 1966) comparing 
man-machine system to other possible sys- 
tems (e.g, men estimating posterior odds 
directly) indicate PIP's greater efficiency. 
The present results for predicision consistency 
raise some questions about PIP's outcome. 
If the decisions submitted to the man-machine — 
system are of any consequence to the experts 
making the judgments of the relevance of 
available information, the experts! prior 
opinions about the plausibility of the various 
alternatives may influence such judgments. 
Moreover, where such experts have strong 
dispositions toward a specific weighting of 
alternatives, the Bayesian decisions resulting 
from the experts’ judgments will be lower or 
more conservative than would be the case 
were the opinions not as strongly held. In 
effect, the range over which the alternatives 
can change has decreased. Training and ex- 
perience in separating prior opinion and judg- 
ments of data relevance may help to minimize 
this problem. A measure of the strength of the 
prior opinions is even more desirable. The 
range of possible Bayesian solutions for 
strongly, moderately, and weakly held prior 
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opinions could be determined and examined 
before a decision was made. 

Many extralaboratory decision situations 
require subjective judgments like those in- 
volved in the atomic explosion and birth de- 
fect problems. Objective indicators (e.g., 
radar, dice) are not available for the prior 
probabilities and probabilities of the data 


given the hypotheses. When subjective 
judgments are involved in determining the 
Bayesian decision and the information 


provided supports the nonfíavored alterna- 
tive, one is, in a sense, studying attitude 
change. As in the present study, the sub- 
ject is asked to indicate an advance opinion 
on a problem and how relevant various infor- 
mation is to that opinion. He is then given 
certain pieces of data which are supposedly 
true but do not support his view. The subject 
is faced with changing his position. The influ- 
ences on the degree of change should be simi- 
lar to those found in research on attitude 
change. One such variable is the consistency 
which the subject perceives between his 
opinion and the persuasive communication. 
The more consistent the new information, the 
less need there is for change (cf. Cohen, 1964). 
Our predecision consistency measure provided 
an index of the subjects’ perceptions of how 
the persuasive communications matched their 
initial opinions. The investigation of other 
variables involved in the attitude change pro- 
cess, such as the consistency variable, may 
aid in modifying the Bayesian model to 
describe how men actually use information in 
making day-to-day decisions. 

The lack of relationship between the indi- 
vidual difference variables and deviation from 
the Bayesian decision is at first glance 
puzzling. Ability to adequately process in- 
formation was not related to approximating 
the Bayesian decision. Phillips et al. (1966) 
have proposed that conservatism is the result 
of intellectual deficiencies. If this is the case, 
variables such as rationality, verbal intel- 
ligence, and integrative complexity should 
have been related to deviation from optimal- 
ity—more rational, intelligent, and complex 
individuals should have shown less conserva- 
tism. The lack of relationship raises questions 
about the type of ideal decision maker the 
Bayesian model denotes. Perhaps these cogni- 


Marcaret G. HERMANN 


tive individual difference variable e less 
important than those associated with attitude 
change (e.g., self-esteem, tolerance for ambi- 
guity) or motivation (e.g., need for certainty, 
fear of failure). Such variables affec! how a 
subject will approach a decision-making task 
(his "set") and, thus, may influence his 
decision. An alternative explanation for the 


lack of relationship is provided by the analy- 
sis of predecision consistency. Conservatism 
may be specific to the problem i in a 
way that broad-gauged personality measures 
are unlikely to detect. In the two problems 
in the present study where consistency could 
be ascertained, prior opinion affecte] infor- 
mation processing in one way for one problem 
and in the opposite way for the other prob- 
lem. In effect, the influence of prior opinion 
may be so powerful that it leaves little room 
for the operation of individual diñerence 
variables. | 
The present paper has focused on whether 
men can be intuitive Bayesian statistici 
when faced with relatively real-life problem 
The results suggest some hitherto un 
sidered influences on the Bayesian model o 


decision making. By employing problems of 


the bookbag and poker chip variety, previous 
research on the Bayesian model of decision 
making has not had the need to elucidate the 
systematic relations between probabilities. 
The process analysis in the present study 
indicates that where more lifelike problems 
are involved, such relations become impor- 
tant. Further investigation of the systematic 
relationships among the probabilities or 
phases of the decision process for more repre- 
sentative problems may hold the key to why 
men generally deviate in a conservative di- 
rection from the Bayesian optimal decision. 
With such knowledge, we will be better able 
to determine whether men are inherently con- 
servative in processing information or whether 
for real-life problems Bayesian statistics are 
intrinsically rash. 
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IN THE PRISONER’S DILEMMA GAME: 
A REPLY TO GUMPERT, DEUTSCH, AND EPSTEIN * 
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A recent study by Gumpert, Deutsch, and Epstein indicated that subjects 
played the Prisoner’s Dilemma game more cooperatively when imaginary money 
was at stake than when real money was used. Since these findings are at 
variance with the majority of published studies, two experiments were under- 
taken in an attempt to isolate the variables responsible for their results. The 
first experiment employed a control group which received neither an initial 
stake nor real money, as well as an initial stake and a real money group. The 


initial experiment. In both studies, subjects playing for real money were more 
cooperative than those playing for imaginary money, but in neither study did 
the differences reach significance. A combined analysis for both studies did 
yield a significant difference in favor of real money. It is suggested that the 
relationship between real money and cooperative behavior is a highly complex 


function of a number of variables. 


In a recent issue of the Journal of Personal- 
ity and Social Psychology, Gumpert, Deutsch, 
and Epstein (1969) reported the results of 
an experiment designed to assess the effects 
of real and imaginary money on cooperative 
behavior in the Prisoner’s Dilemma (PD) 
game. Subjects played for either real dollars 
or imaginary dollars, equal to the face value 
of the matrix shown in Figure 1, or for inter- 
mediate amounts of real money equal to 10%, 
5%, or 1% of the face value of the matrix. 
Overall, subjects in the imaginary dollars 
condition produced 46% cooperative re- 
sponses, compared to only 32% in the real 
dollars condition. The small money conditions 
produced a level of cooperation equal to or 
slightly greater than the amount of coopera- 
tion in the real dollars condition. 

The results of the Gumpert et al. experi- 
ment are inconsistent with the findings re- 
ported by Radlow, Weidner, and Hurst 
(1968), using the PD game; Gallo (1966), 
using a matrix version of the Deutsch and 
Krauss trucking game; and McClintock and 
McNeel (1966, 1967), using the Maximizing 
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differences game. All of the foregoing studies 
found greater cooperativeness with ‘large 
monetary rewards. The results are also in- 
consistent with the observations of Gallo and 
McClintock (1966), later revised by Gallo 
(1968), that large monetary rewards will in- 
crease cooperative behavior in mixed-motive 
games that are structured such that competi- 
tive behavior will result in monetary loss, 
rather than just reduced monetary gain. On 
the other hand, Wrightsman (1966) and 
Willis and Joseph (1959) failed to find dif- 
ferences between real and imaginary money, 
and Messé, Bolt, and Sawyer (1970) found a 
nonsignificant difference in favor of small 
monetary rewards, as compared to large ones. 
Since the Gumpert et al. experiment found 
a significant difference in favor of imaginary 
money and did so with a matrix containing 
negative values, it seemed relevant to examine 
closely the method of that study. A detailed 
examination of the method and instructions 
(obtained by personal communication with 
D. Gumpert) indicated three factors that 
could possibly account for their results. 


| 
1 
second experiment involved modifying the instructions that were used in the 
i 
E 
i 
} 
| 
| 
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1. Subjects in the imaginary dollars and 
real dollars conditions were recruited through 
an ad in a weekly New York newspaper which 
promised financial compensation for experi- 
mental participation. Thus, the subjects were 
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self-selected on the basis of earning money, 
and might differ significantly from subjects 
who are recruited to fulfill an experimental 
participation requirement. 

2. Subjects in the imaginary dollars condi- 
tion were unconditionally given $4 at the 
start of the experiment. A gift of $4 could 
alter the frame of mind of the subjects, and 
does not seem comparable to an experiment 
in which people just play a game for imag- 
inary money. 

3. The instructions contained segments 
which appeared to the present authors to 
suggest competition. Such instructions could 
have mitigated or altered an effect of real 
money.* 


Two experiments were conducted to explore 
the effects of the above-mentioned factors. 
Since San Diego State College requires its 
introductory psychology students to take part 
in 5 hours of experimentation, it was decided 
to recruit subjects from the introductory 
classes, rather than through ads in newspapers 
or on bulletin boards. The subjects were not 
advised that money would be available. 
Otherwise, the method was identical to that 
employed in the original experiment, with the 
exception that Experiment I employed an 
additional control group that played for imag- 
inary money but did not receive the initial 
$4 stake, and Experiment II employed in- 
structions that did not contain the presum- 
ably competitive segments of the original 
instructions. 


METHOD 


Experiment I 

Subjects 

Sixty male and 60 female subjects were recruited 
by means of a sign-up sheet placed on a bulletin 
board in the Life Sciences Building at San Diego 
State College. Subjects always played in like-sex 
pairs and were assigned equally to the experimental 
conditions: Participation in the experiment partially 
fulfilled a 5-hour experimental participation require- 
ment, No subjects were advised that money was 
involved in the experiment until they arrived at 
the laboratory. 


Procedure 


The subjects arrived at the laboratory in pairs 
and were seated at the experimental table. Plywood 

3 Complete copies of the instructions may be ob- 
tained from the senior author. 
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. choices. The matrix shown in Figure 1 was sul 
imposed on the game board. The subjects played | 


PLAYER ! 


an 


Fic. 1. The PD matrix. 


partitions prevented them from seeing each other or 
the experimenter once they were seated. Each sub 
was provided with an electric game board, consisting — 
of two response buttons, marked red and green, and - 
four lights which indicated the outcome of the joint. 


the game for 20 trials, 

Subjects were randomly assigned to either an imag- 
inary dollars condition, an imaginary dollars condi- 
tion with a $4 real money initial payment, ora real. 


game. In the ake iis dollars with $4 initial pay= 
ment condition, subjects were given $4 to take home | 


dollars condition, the subjects were given $10 in real. 3 
money to begin the game. b 
In order to keep conditions as identical to the | 
Gumpert et al. study as possible, the subjects were 
asked to indicate the expected choices of the other 
player before each trial began. At the conclusion 
of each trial, the subjects were informed of the 
choices and outcomes of both players and the amount — 
of money earned by each. 


Instructions 


The instructions were given to the subjects via a 
tape recorder. Except for slight changes made to take 
into account minor differences in equipment function 
these instructions were identical to those on the tape — 
used by Gumpert et al. 


Experiment II 
Subjects 


Forty male and 40 female subjects were recruited il 
in the same fashion as in Experiment I. Again, the 
subjects were not advised of the fact that money 
would be available, and all subjects played in like- 
sex pairs. 


TABLE 1 


PERCENTAGE OF COOPERATIVE 
Cuorces Map BY Borg 
PLAYERS 


Trial block 
Condition 


1-5 | 6-10 | 11-15 | 16-20 | 1-20 


Imaginary dollars, $4 
stake, Gumpert et al*| 57 | 42 | 38 | 47 | 46 
Real dollars, Gumpert 


et al^ 7|31]| 26 22-132 
Imaginary dollars" 42 | 32 | 31 24 | 32 
Real dollars" 51 | 44 | 38 30 | 41 

*N = 10, à p 

bN = 40. 

Procedure 


Subjects were randomly assigned to either an imag- 
inary dollars condition with no real money initial 
payment or a real dollars condition. As in Experi- 
ment I, sexes were assigned equally to the experi- 
mental conditions. Except for instructions, the 
method was identical to Experiment I and, hence, 
to the Gumpert et al. experiment. 


Instructions 


The instructions were given to the subjects via a 
tape recorder. They were identical to the instructions 
used in Experiment I, with the exception that certain 
portions were omitted. The omitted portions referred 
to the fact that if a player knew that the other 
would choose green, he could win more by choosing 
red; that they would never see each other again ; 
that they did not care about the other's feelings 
nor did the other care about theirs; and that they 
did not care what happened to each other, 


RESULTS 
Experiment I 


The dependent variable for all analyses was 
the percentage of green (cooperative) re- 
sponses made by each dyad. The imaginary 
dollars condition produced 28.75% coopera- 
tive responses, the imaginary dollars with $4 
initial payment condition produced 32.63% 
cooperative responses, and the real dollars 
condition produced 38.63% cooperative re- 
sponses. A one-way analysis of variance ap- 
plied to these results yielded no significant 
differences (F = 1.94, df = 2/57). 


Experiment II 


The imaginary dollars condition produced 
35.50% cooperative behavior, whereas the 
real dollars condition produced 43.38%. An 
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analysis of variance again failed to indicate 
a significant difference (F = 2.81, df = 1/38). 


Combined Analysis 


Since the imaginary dollars and real dollars 
groups in Experiment I differed from the 
imaginary dollars and real dollars groups in 
Experiment II only in terms of the instruc- 
tions, the $4 initial payment group of Experi- 
ment I was dropped from the analysis, and a 
2X2 X2X 4 mixed-design analysis of vari- 
ance was performed on the remaining four 
groups. In this analysis, size of payoffs, 
instructions, and sex served as the indepen- 
dent factors, and blocks of five trials served 
as the repeated factor. The results of the 
mixed-design analysis indicated that subjects 
in the real dollars condition were significantly 
more cooperative than subjects in the imag- 
inary dollars condition (F = 6.30, df = 1/72, 
P < 025). The usual decrement in coopera- 
tive responses over trial blocks was also found 
(F= 24.51, df=3/228, p<.001). The 
effect of the instructions reached borderline 
significance (F = 2.97, df = 1/72, p < .10). 
Although male pairs were somewhat more 
cooperative than female pairs, this effect did 
not reach significance, nor did any of the 
interactions approach significance. An over- 
all summary of the cooperative behavior noted 
in the present experiments and in the Gumpert 
et al. experiment is presented in Table !. 


Discussion 


The results of the present experiments ap- 
pear to stand in direct contrast to those 
reported by Gumpert et al. Real money served 
to increase, rather than decrease, the amount 
of cooperative behavior produced by the sub- 
jects. However, the fact that significant dif- 
ferences were not found until the number of 
dyads reached 80 indicates that the effect, 
though reliable, was not particularly large in 
the present situation. 

In attempting to reconcile the differences 
among these experiments, the authors dis- 
covered that several hypotheses can be readily 
discarded. Experimenter bias as a confounding 
variable does not appear to be a viable expla- 
nation of the differences. In both the present 
experiments and the Gumpert et al. study, 


EFFECTS OF INCENTIVE MAGNITUDE ON COOPERATION 


the experimenter had minimal interaction 
with the subjects, and all instructions were 
given via a tape recorder. 

rhe hypothesis that the instructions af- 
fected the results can also be discarded. Al- 
though there is marginal evidence that the 
original instructions tended to increase com- 
petitiveness, there is absolutely no evidence 
to indicate that they affected the imaginary 
lolars or real dollars groups differentially. 
Similarly, the present experiments provide no 
support for the hypothesis that the $4 initial 
stake caused the reversal. In Experiment I, 
the subjects who received the initial stake 
performed at a level midway between the real 
dollars group and the imaginary dollars group 
that did not receive a stake. 

The remaining differences among the experi- 
ments have to do with the nature of the sub- 
jects and the method of recruitment. 
Gumpert’s subjects were New Yorkers, who 
knew that money was available prior to com- 
ing to the experiment, and who were presum- 
ably students, though of unknown character- 
istics. The subjects in the present experiment 
were San Diego residents, who did not know 
money would be involved in the experiment, 
and who were students in Psychology I classes. 
Oskamp and Perlman (1965) have pointed out 
that cooperativeness in the PD game can be a 
function of size and location of colleges, aca- 
demic involvement produced by small classes, 
and belief in the legitimacy of psychological 
research. Thus, the possibility that the differ- 
ences among the experiments are a function 
of different subject populations cannot be 
ruled out. 

Another possible explanation is that dif- 
ferences in recruitment procedures are re- 
sponsible for the different results. Subjects 
who come to the laboratory in order to earn 
money may be angry and frustrated when 
they learn that the amount of money they 
can earn will be directly influenced by the 
responses of another person. When negative 
payoff values are present in the matrix, as 
they were in these experiments, the possibility 
exists that they will earn no money at all. 
Such subjects may approach the game with a 
rather negative and punitive set. The subjects 
who have been given the $4 initial stake have 
already earned their money and may ap- 
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proach the game with a much more positive 
set. An entirely different set of motives may 
be generated in subjects who participate in 
the experiment for credit and do not know 
that money will be available. Those in the 
imaginary dollars condition may consider it 
just another boring experiment, whereas those 
in the real dollars condition may be delighted 
at the prospect of earning money in addition 
to experimental credit, In this case, it could 
be the real dollars subjects who would 
approach the game with a positive set. 

Although this explanation seems reasonable, 
there is no evidence in the literature to support 
or disconfirm it. In the study by Radlow et al. 
(1968), the experimenters recruited their sub- 
jects through ads and found greater coopera- 
tiveness with real money. The unpublished 
study by Messé et al. (1970) directly tested 
the difference between subjects recruited 
through ads and subjects recruited for credit 
and found that ad-recruited subjects were 
more cooperative. Unfortunately, both studies 
used payoff matrices containing all positive 
payoffs and are thus not comparable to the 
present studies. 

The most important conclusion from the 
present studies seems to be that there is a 
complex interaction among size of reward, 
recruitment procedures, matrix payoffs, ex- 
perimental conditions, and cooperative be- 
havior. It seems clear that simple generaliza- 
tions concerning the effects of real money on 
cooperative behavior are not justified. Until 
more sophisticated theoretical positions are 
developed, each experimental situation must 
be approached individually, and the effects, 
if any, of real money must be determined 
empirically for that specific situation. 
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DISCOVERY-ORIENTED BEHAVIOR AND THE 
ORIGINALITY OF CREATIVE PRODUCTS: 
A STUDY WITH ARTISTS 


M. CSIKSZENTMIHALYI * axp J. W. GETZELS 


University of Chicago 


In order to examine the significance of the “problem-formulation” stage of 
creative activity, 31 advanced art students were observed in a quasi-naturalistic 
setting of an art school while carrying out an assignment to produce a still- 
life drawing. Observations of “discovery-oriented” behavior were recorded for 
each subject from the time he began organizing the still-life objects until he 
completed the actual drawing. The finished art work was independently evalu- 
ated by an expert panel on three dimensions: overall value, originality, and 
craftsmanship. A positive relationship was found between discovery-oriented 
behavior at the problem-formulation stage and the originality (but not the 
craftsmanship) of the creative product. The study aífirms the theoretical and 
empirical importance of the problem-formulation stage of the creative process 
and suggests a method for observing and analyzing behavior at this stage. 


Despite much recent research on creativity, 
perhaps the most critical aspect of the prob- 
lem has eluded systematic inquiry: the process 
of creative production itself. Although the 
literature contains numerous self-reports by 
people engaged in creative tasks (Arnheim, 
1948; Guitar, 1964; Ruitenbeek, 1965; 
Tomas, 1964) or interviews with creative in- 
dividuals regarding their psychological states 
while at work (Barron, 1969; Halasz, 1966), 
and despite the fact that the importance of 
understanding the creative process rather than 
merely recording the characteristics of crea- 
tive people has also long been recognized 
(Ghiselin, 1959; Taylor, 1959), the diff- 
culties involved in obtaining reliable observa- 
tions of such elusive phenomena have left us 
essentially without indicators of the relevant 
variables, to say nothing of empirical data, 
regarding the process of creative production. 

The present report, part of a larger study of 
artistic creativity (Getzels & Csikszentmihalyi, 
1964, 1965, 1966b), attempts to fill this lack 
by (a) delineating a theoretically derived set 
of behavioral variables for the observation 
and measurement of a creative process in a 
“real-life” setting and (5) examining the rela- 
tion of these measures to the quality of the 
creative product. Although the study was 
limited to artistic creativity, it seems likely 

1 Requests for reprints should be sent to Mihaly 
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that the concepts and methods should yield 
useful results in other areas of creative en- 
deavor as well. 

Previous work in creativity has suggested 
the fruitfulness of a conceptual distinction be- 
tween discovered problem situations and pre- 
sented problem situations (Getzels, 1964; 
Getzels & Csikszentmihalyi, 1965, 1966a, 
1967). Problem situations can be meaning- 
fully distinguished in terms of how much of 
the problem is clearly given at the start, how 
much of the method for reaching a solution is 
already at hand, and how general the agree- 
ment is as to what constitutes a good solution. 
At one end of the continuum there are pre- 
sented problems, that is, situations where the 
problem, the method, and the solution are 
already known, and the problem solver needs 
only adopt the “correct” procedural steps to 
arrive at the satisfactory solution. At the other 
end, there are discovered problems, that is, 
situations where the problem itself has not 
yet been formulated but must be identified 
and the appropriate method for reaching a 
solution and the nature of the satisfactory 
solution are unknown. 

It would seem that creative work is the 
outcome of a more or less pure form of the 
discovered problem-solving process. Outstand- 
ing instances of creative achievement involve 
solutions to problems which were not even 
formulated as such, but first had to be identi- 
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fied as problems (Kuhn, 1962). In the words 
of Einstein and Infeld (1938): 


The formulation of a problem is often more essential 
than its solution, which may be merely a matter of 
mathematical or experimental! skill. To raise new ques- 
tions, new possibilities, to regard old problems from 
a new angle, requires creative imagination and marks 
real advance in science [p. 92]. 


This appears to be the situation in art as well. 
The importance of discovering the problem in 
the creative problem-solving process has also 
been noted by Dewey (1910), Ecker (1963), 
and Macworth (1965). 

If the discovery of problems is central to 
the creative process, then a crucial dimension 
of creativity is the disposition of individuals 
to perceive problems as either discovered or 
presented. In a preceding interview study, dif- 
ferences between art students were observed 
in terms of their reported perception of the 
artistic task. This dispositional variable was 
tentatively identified as concern for discovery, 
and it was found to be significantly related 
to the originality of the solutions that the 
subjects produced (Csikszentmihalyi & Get- 
zels, 1970). 

'The present study was designed to answer 
the following question: Is discovery-oriented 
behavior in a real-life situation involving crea- 
tive production related to the assessed crea- 
tivity of the product? 

The form of creative production observed 
was the completion of a drawing. The process 
of producing any work of art implies the 
same questions that any other problem-solving 
task does: What is to be done? How is it to 
be done? When is it completed? These ques- 
tions can be approached either as discovered 
or presented. Furthermore, the manifest proc- 
ess of conceiving and executing a graphic work 
of art can be observed more easily than almost 
any other processes involving creativity; the 
steps in the process lend themselves readily to 
observation and analysis in terms of the dis- 
covery model; and the outcome of the process, 
that is, the finished work of art, can be as- 
sessed as to its creative value with reasonable 
reliability. In view of the theoretical con- 
siderations, it was expected that there would 
be a positive relationship between discovery- 
oriented behavior and the assessed creativity 
of the product. 


METHOD 


Subjects and Procedure 


The sample consisted of 31 male juniors 
seniors at one of the foremost art schools in the 
country. Although the subjects were all in sch 
some had already won prizes in competition wi 
nonstudent artists, some were exhibiting their work 
regularly in public and private galleries, and son: 
had won national scholarships; all had survived at 
least 3 highly competitive years in the art schoo! 
at considerable financial and personal sacrifice. 

Of the data collected in the larger studies (Getze 
& Csikszentmihalyi, 1964, 1965, 1966b), three items 
are relevant to the questions raised in this paper. 
These are (a) an art work from each subject with 
a variable degree of creativity which could be as 
sessed, (b) an assessment by well-known artist 
critics of the degree of creativity of the art worl 
and (c) a set of behavioral measures relating to 
discovery orientation taken while the subject was 
working on the artistic product. 

Although Categories a and b above include the de 
pendent variables of the present study and c con- 
tains the independent variables, for the purpose of 
greater clarity, it will be more expedient to describe 
the dependent variables first. , 

1. The creative product. Each of the 31 subjects 
was asked to produce a drawing in a studio of the 
art school under initially constant conditions. The 
conditions were as follows: Twenty-seven object: 
were placed on a table; among the objects were à 
small human figure, an antique book, a bunch oí 
grapes, a polished gear shift, a floppy velvet hat, etc. 
Each subject was asked—1 at a time—to set up on a 
second table in the same studio a composition that 
suited him with any of the available objects. He was 
then to work on a drawing until he felt that it was 
completed, using a variety of dry media that were 
also made available. No time limit was given. While 
the subjects were involved with their task, an ob- 
server was present with them in the studio, taking 
notes and photographs. 

Subsequent interviews suggested that the subjects 
saw the situation as differing very little from the 
free creative conditions to which they were accus- 
tomed. Twenty-three of the 31 subjects (72%) said 
that the difference from their usual free working 
conditions, if any at all, was minimal. They declared 
that once they began to work, they proceeded in 
their customary way. Four of the subjects (14%) 
declared that the experimental conditions resembled 
a typical art school assignment rather than a free 
creative situation. Only 4 of the subjects (14%) 
reported that they felt uncomfortable in the research 
setting. Given their straightforward responses to 
other questions of the interview, there is no reason 
to assume that the process of producing the experi- 
mental drawing was seen as essentially different from 
their way of working either in their own studios or 
in the studios of the school, 

2. The assessment of the relative creativity of the 
products. The 31 drawings produced by the subjects 
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under the conditions outlined above were given to 
wn artists and artist critics for inde- 
The judges were unaware of the 
. Each judge was asked to make 

first, on the craftsmanship or 
f the product (regardless of its origi- 
on its originality or imaginativeness 
its craftsmanship); and finally, on its 


overall cesthetic value. (How would you rate these 
drawings if you were to award prizes in a competi- 
tive art show?) The ratings were made on a 1-9- 


point scale with a “prenormalized” distribution. 
It was cted that the three dimensions of eval- 
uation would overlap considerably, and this, in fact, 
proved to be the case. The correlation between the 
combined ratings of the five artist critics on overall 
aesthetic value and the combined ratings on origi- 


J)O (product-moment correlation); the 
between overall value and craftsmanship 
:nd the correlation between originality and 
inship was .76. However, as will be seen 
below, the three dimensions are sufficiently distinct 
to yield meaningful insights into the problem of 
creativity. The agreement among the five judges, 
ie reliability of the ratings on each dimen- 
letermined by intraclass correlation. The 
for overall aesthetic value was .29 with 
-of-sum coefficient of .67; for originality 
ficients were, respectively, .31 and .69; 
ip they were .22 and 58 (Getzels & 
lyi, 1969). These correlations, although 
all signifi statistically, are obviously not high. 
The reasons for the lack of closer agreement are dis- 
cussed in detail elsewhere (Csikszentmihalyi, 1965). 
Part of the unreliability is due to the purposely 
open instructions given to the judges and some to 
the deviant ratings of one of the judges, who had a 
very personal interpretation of what was original and 
aesthetically valuable. (He equated both with “hon- 
esty" and "naivete.") In any case, it must be noted 
that the combined score based on the five ratings 
correlated very highly with each of the five individual 
ratings, and thus it may be considered a representa- 
tive measure of each artist's definition of the dimen- 
sion, 


3. The behavioral measures: Discovery orientation. 
The process of discovered problem solving could be 
observed as beginning right after the task was ex- 
Plained to the subjects upon their arrival to the ex- 
perimental studio, Three sets of behavioral variables 
were recorded between the time the task was ex- 
plained and the time that the subjects began to 
draw. These behaviors were called discovery at the 
stage of problem formulation, since they preceded 
actual attempts at solution. The three separate ob- 
servations were categorized and quantified as follows. 
(a) Number of objects manipulated (41): the score 
on this process variable was a count of how many 
of the 27 objects were touched by a subject prior to 
the beginning of drawing. The count was based on 
the record taken by the observer. (b) Uniqueness of 
the objects chosen (A2): this score was based only 
On the objects selected by any given subject and 
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actually transferred by him írom the first to the 
second table. Each object was given a score corre- 
sponding to the rank-order frequency with which it 
had been used by the sample as a whole. Thus, the 
book, the most frequently chosen object, was as- 
signed a score of 1; the lens, chosen by the fewest 
subjects, had a score of 25.5. A subject’s score on 
this variable consisted of the averaged rank score of 
all the objects he selected. (c) Discovery-oriented be- 
havior during selection and arrangement (A3): this 
variable was scored as follows. A score of 1 was 
given if the subject just picked up the objects from 
the first table and placed them on the second table. A 
score of 2 was given if the subject was observed 
holding an object up to his eyes, feeling its weight, 
texture, etc. A score of 3 was given if the subject 
experimented with the objects, for instance, by 
looking through the prism and lens, folding the hat 
into different shapes, trying to work mechanical 
parts, etc. A score of 5 was given when actions for 
both Scores 2 and 3 applied. (d) Total problem- 
formulation score (AA). Whenever a subject was 
above the median for the sample on one of the three 
preceding variables, he was given a score of 1. By 
adding these scores, the total problem-formulation 
score was obtained. A constant of 1 was added in 
order to give a numerical value to subjects who 
otherwise would have had a score of zero. The total 
problem-formulation score had, therefore, a range 
of 14, 

The next set of three observations was made after 
the subject began working on his drawing. These 
variables may therefore be said to represent dis- 
covery at the stage of problem solution. They were 
quantified as follows: (a) Openness of problem struc- 
ture (B1). To score this variable, the sequence of 
photographs taken at 6-minute intervals during the 
experiment was examined. For each subject, that 
photograph was scored which for the first time re- 
vealed the final structure of the drawing in all its 
essentia] elements. The score consisted of the number 
of minutes which elapsed between the beginning of 
the experiment and the taking of the given picture, 
divided by the total time in minutes taken for the 
drawing as a whole. The score is, therefore, the per- 
centage of the total drawing time elapsed before the 
final structure of the drawing was essentially com- 
pleted. (b) Discovery-oriented behavior while draw- 
ing (B2). This was scored as follows: The subject 
received a score of 1 if he just drew without inter- 
ruption. The subject received a score of 2 if he 
changed paper or switched from one medium to 
another. The subject received a score of 3 if he 
changed the arrangement of objects, substituted, or 
manipulated the objects. A score of 5 was obtained if 
both 2 and 3 applied. (c) Changes in problem struc- 
ture and content (B3). A score of 1 was received if 
the subject just copied the arrangement. A score of 2 
was received if he introduced changes in perspective. 
A score of 3 was received if he changed the relative 
magnitude of objects, or left one or more of the ob- 
jects out. A score of 4 was received if he changed the 
position of objects on the paper. A score of 5 was 
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TABLE 1 
CORRELATION MATRIX OF Six Discovery-Process VARIABLES AND Two SUBTOTALS 
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Problem formulation Problem solution 
Item 
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| 
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Problem formulation 
Al 
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A3 48*** 22 
AA 64**** .60**** 77**** 
Problem solution F 
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B2 .05 14 18 A3 —41 
B3 .30* .34* .56*** .48** .02 Ade 
BB 26 14 35" .34* .34* Aut rg 43** 
Note.—N = 31. 
* > «.05. 
** 5 < 01. 
we p c .005. 


e p < .0005. 


received if there was an addition of nonexistent ob- 
jects or major visual elements. A score of 9, if both 
actions for Scales 4 and 5 applied. (d) Total problem- 
solution score (BB). Whenever a subject was above 
the median for the sample on one of the three pre- 
ceding variables, he was given a score of 1. By 
adding these scores, the total problem-solution score 
was obtained. A constant of 1 was added in order 
to give a numerical value to subjects who otherwise 
would have a score of zero. The total problem-solu- 
tion score had, therefore, a range of 1-4. 

The reliability of the scoring of the six process 
variables and the two subtotals was tested by having 
two raters assess each protocol; the product-moment 
correlations between the two raters ranged from 82 
to .98; none of the significance levels rose above a 
percentage of more than .0005. It is to be noted that 
scores on Variables Al, A3, and B2 were based 
exclusively on the record compiled by the observer. 
Reliability in recording the observations was not 
established; however, the variables in question are 
of such self-evident nature that little or no error in 
their recording seems to have been possible. 

The internal correlations between the six process 

variables and two subtotals are presented in Table 1. 
It is clear from the pattern that while some of the 
variables (A1, B3) have a relatively high degree of 
commonality with the others, the overall overlap is 
not so large as to make any variable redundant. 
In other words, it is possible for a subject to en- 
gage in what appears to be discovery-oriented be- 
havior at one stage of the problem-solving process 
without doing so at either an earlier or at a later 
stage. 

There are two main reasons why these six variables 
were chosen to measure the theoretical construct of 
discovery orientation. In the first place, they are 
genuine variables that occur “naturally” whenever 
artists approach their task. Second, each variable 
was derived from the model of discovery orientation, 
as follows. The sucessful envisagement of a discov- 
ered problem would require that (a) a large number 


of problematic elements be considered before 
problem becomes structured (A1); (b) either ur 
usual elements are to be selected as foci of the prob- 
lem (A2), or (c) a more usual element is selecied, 
but it is thoroughly explored through multiple sen- 
sory channels before it becomes a parameter oí ihe 
problem (A3). 

The successful solution of a discovered problem is 
seen as requiring the following processes: (a) After 
the problematic elements are selected, the problem 
begins to be stated, and solutions are attempted. It 
seems important that the problem solver should not 
view his problem as having achieved structure too 
early, lest the problem fail to develop along original 
lines (B1). (b) As the problem is stated, various 
forms of solutions are tried out, some of which 
involve old methods, others involve the discovery of 
new methods (B2). (c) During the solution of the 
problem, the problematic elements lose their ac- 
cepted interpretation—their structure and symbolic 
content undergo hitherto unforeseen transformations 
which recombine into a new meaningful whole (B3). 

These steps, of course, are not intended to exhaust 
the requirements for a successfully discovered prob- 
lem-solving process. 


RESULTS 


The data in Table 2 present the relation- 
ship between the amount of discovery-oriented 
behavior observed during the process of ar- 
tistic problem solving and the evaluation of 
the ensuing product. The results follow, in 
general, the expected direction. More specifi- 
cally, five of the six behavioral variables were 
significantly related to the “originality” of the 
resulting product, four of the six variables 
were related to its “overall aesthetic value,” 
and two of the six reached significance when 
correlated with the “craftsmanship” dimen- 


TABLE 2 


CORRELATIONS BETWEEN DiscCVERY-PROCESS 
/ VARIABLES AND ÉVALUATION OF THE 
Artistic PRODUCTS BY Five 
ARTIST CRITICS 


Dimension of evaluation 
| 
= 7 
- Process variable Overall | oripinatty | Crafts- 
71 aesthetic Originan manship 
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5 raters) re qx 5 raters) 
_ Problem formulation | 
[ Al—Manipulation Age sneer | 16 
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A3—Exploration ET d | .5geeeek .34* 
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— Problem solution 
— Bi—Structure time .09 .08 —.18 
— B2— Exploration 22 ry bad 01 
y p oe dae | o eee 37** 
_-BB—Tot 21 sae 12 


Note.—N = 31. 


ween discovery-oriented behavior during the 
- formulation of the problem and the originality 
- of the drawing, as rated by the judges. Of the 
_ six variables, the only one that did not support 
_ the expectation even in part was the one we 
have called “openness of problem structure" 
(B1). 
Tn view of the results reported in Table 2, 
and keeping in mind the fact that the three 
2 dimensions of evaluation—overall aesthetic 
value, originality, and craftsmanship—share 
a great amount of common variance, one 
‘might question the real relationship between 
_ the discovery variables and each dimension of 
evaluation taken individually, shorn of the 
Variance held in common with the others. 
- Table 3 shows rather clearly that when the 
overlapping variance of the three dimensions 
_ of evaluation is artificially removed through 
- the statistic of partial correlations, the only 
- significant positive correlations that remain 
. are those between the discovery-process varia- 
= bles (in particular at the stage of problem 
formulation) and the originality of the prod- 
. uct. The high negative correlation (r = —.50) 
between discovery-oriented behavior at the 
stage of problem formulation and the rating 
- of overall aesthetic value is probably due to 
| the near-perfect correlation between the rat- 
ings of overall aesthetic value and originality 
- (7 = .90). Once the commonality of the two 
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variables is artificially removed by means of ; 
partial correlation, the small residual variabil- — | 
ity of the rating of overall aesthetic value is 
negatively related to discovery orientation. 
The small amount of residual variance may 
reflect rater error almost entirely, thus pro- 
ducing meaningless correlations. Alternatively, 
the negative correlation might suggest that in 
the present cultural climate, originality is a 
more important criterion of creativity than 
what was measured by the rating of overall 
value. 


DISCUSSION AND CONCLUSION 


The operational measures of discovery ori- 
entation introduced in this study appear to be —— 
useful tools for the study of creative produc- 
tion. Of special importance is the operation- 
alization of the concept of discovery at the — | 
stage of problem formulation, the importance 
of which has been previously recognized in —— 
theory, but has remained inaccessible to em- — 
pirical observation. A 

A question raised by the present study is 
whether the same relationship between discov- — 
ery orientation and the originality of the 
product obtains in other fields of creative en- 
deavor, such as mathematics, the sciences, 
poetics, statesmanship, and so on. While the 
present study cannot shed light on this issue, 
the method employed here allows for an em- 
pirical answer to it, once the measurement of 
discovery is appropriately reformulated, so as 
to be applicable to problem solving in dif- 
ferent fields. 

Perhaps a word should be added to clarify 
our view as to the nature of the discovery- 
process variables. The experimental conditions 


TABLE 3 


SECOND-ORDER PARTIAL CORRELATIONS BETWEEN 
THE DisCoVERY-PROCESS-VARIABLE TOTALS 
AND THE EVALUATION OF THE PRopUCT 
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used in the setting of this study seem to have 
brought out in the open and made observable 
a process which normally goes undetected in 
the artist’s awareness whenever he is involved 
in the production of a work of art. Under 
ordinary circumstances the artist does not in- 
spect, touch, or manipulate concrete objects 
before he decides what to paint, but he does 
consider, weigh, and analyze feelings or sen- 
sory material in his awareness. We have as- 
sumed for the purposes of this study that the 
two processes—the manifest and the latent— 
follow similar laws, and we used the former 
as an index of the latter. A more complete 
elaboration of this point has been presented 
elsewhere (Csikszentmihalyi, 1965; Csikszent- 
mihalyi & Getzels, 1970). In any case, surely 
one should not extrapolate from the observed 
relationships any implication that touching 
many objects, choosing unique objects, etc., 
“causes” originality. 

A less basic, but unresolvable, question con- 
cerns the universality of the findings in time 
and space. Would discovery orientation be 
related to the originality of the painting of 
artists living a hundred years in the past or 
in the future? Cultural trends wax and wane 
with regard to the importance allotted to origi- 
nality as a component of the creative process. 
Tt could perhaps be safely said that in history 
originality has more often been seen as hinder- 
ing rather than as fostering great accomplish- 
ment. 

An awareness of cultural fluctuations in the 
value attributed to originality helps to point 
out the strengths and limitations of the con- 
cept of discovery. The attitude of discovery 
orientation in problem solving might not 
eventuate in a “successful” product except 
under sociocultural conditions that reward 
originality. The findings of this study suggest 
the conclusion that in a cultural milieu which 
strongly equates originality with overall value, 
an attitudinal and behavioral approach to 
problem solving characterized by discovery 
orientation leads to results which will be con- 
sidered valuable; and it presents a method by 
which such orientation can be empirically 
measured. 
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This study examined the effects of having models dissimulate different levels 
of pain susceptibility on observers pain thresholds. Subjects and a confederate 
model ostensibly received identical electric shocks, increasing from an unde- 
tectable level in increments of .5 milliampere up to the level at which the 
subject reported pain. Making responses contingent upon the subject's ratings, 
the model either delayed or hastened advances along a 5-point rating scale 
of the severity of the experience. Observing a model tolerate pain led to pain 
thresholds better than three times greater than the thresholds reported by 
subjects observing a model who was less tolerant. The findings are discussed 
in the context of the effects of social influences on pain reports and theories 


of vicarious experience. 


has accumulated to show that 
and verbal reports of noxious 


perc 

stimuli are the consequences of complex physi- 
ological, subjective, and social cues (Stern- 
bach, 1968). Interpretation and judgment rep- 
resent "portant determinants of reports of 
pain : dis omfort, to e extent that in- 
variant psychophysical relationships do not 


st between the physical parameters of 
aversive stimuli and the subjective experi- 
ences of their recipient. 

Studies of modeling and vicarious rein- 
forcement processes have suggested the possi- 
bility that reports of pain may be amenable 
to control through exposure to variably tol- 
erant models (Bandura, 1969; Kanfer & Phil- 
lips, 1970). Changes in personal discomfort 
measures previously demonstrated to be 
amenable to control through the use of the 
modeling paradigm include yariations in hos- 
tility (Ditrichs, Simon, & Greene, 1967), 
phobias and childhood aversions (Bandura, 
Blanchard, & Ritter, 1969; Bandura & Men- 
love, 1968; Geer & Turteltaub, 1967), auto- 
nomic changes (Craig & Lowery, 1969), avoid- 
ance learning (Craig, 1967), and other verbal 
and motoric skills. Consequently, it was postu- 
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lated that exposing subjects to models mani- 
festing either high or low levels of tolerance 
ior noxious stimulation would influence the 
subjects in similar directions. Electric shock 
was chosen as the noxious stimulus because it 
could be administered in discrete trials along 
continuous, readily controllable parameters. 


 Mxzruop 

Several alternative methodologies suggest them- 
selves for varying the temporal relationship between 
the subjects’ and models’ responses. To maximize 
modeling effects, it was decided to have the subject 
and model ostensibly receive identical shocks on each 
presentation, with the model indicating his subjective 
reaction immediately after the subject responded. 
Reactions were limited to the selection of one of five 
switches graded along a scale varying from “unde- 
tectable” to “pain.” Selecting the latter switch ended 
each trial, Subjects were randomly assigned to one 
of three groups, with the groups varying according 
to whether the model rated the shock to be more 
intense than the subject, less intense than the sub- 
ject, or in a manner that was not contingent upon 
the subject’s response. 


Subjects 


The subjects were 30 male volunteer students from 
the University of British Columbia, who did not 
have prior knowledge concerning the conditions of 
the study. The mean age was 18.7 years, and the 
range was from 18 to 38 years. The same paid male 
model and female experimenter were used for all 
groups. 


Apparatus 


Electric shock was generated by a constant-current 
stimulator and delivered to the volar surface of the 
subject’s right forearm through concentric electrodes 
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(Tursky, Watson, & O'Connell, 1965). Gritty elec- 
trode paste was used to remove dead epidermis and 
to reduce the variability in the subjects' skin re- 
sistances. The electrode was positioned so that the 
resistance between the two poles of the electrode on 
the surface oí the arm was reduced to 5,000 ohms 
prior to each series of shock administrations. The 
resistance was returned to this level, if necessary, at 
the end of each series of shocks. There were no sig- 
nifücant differences between groups in resistance 
changes over series. 

All subjects received initial shock intensities of 0 
milliamperes, with each subsequent shock uniformly 
increasing by .5 milliampere. Shock durations were 
.5 second. Subjects rated shock intensities by depress- 
ing one of five microswitches mounted on a small 
console placed in front of the subject and confederate. 
The confederate simulated ratings using the same 
switches. The switches, when depressed, illuminated 
red jewel lights above printed labels reading from 
left to right: (5) undetectable; (4) slightly de- 
tectable; (3) detectable; (2) questionably painful; 
(1) painful. 


Procedure 


The confederate arrived immediately after the sub- 
ject, and both were queried as to age, academic year, 
and major, with the confederate posing in all in- 
stances to be a 21-year-old second-year business 
student, changing his major to English. The subject 
was seated to the confederate’s right. A wooden 
screen restricted their view of each other in such a 
way that each could see only the other’s left hand 
as he selected a switch. Both were advised of the 
experimenter's interest in how “mild electricity in- 
fluences your perception of when a shock becomes 
painful.” The shocks were to be increased in in- 
tensity “from levels at which you cannot feel the 
shock through to the level at which you report 
pain,” at which point no additional shock would be 
administered, Participation in the study was then 
described as completely voluntary, and all subjects 
were given the opportunity to withdraw then or at 
any subsequent time during the experiment. No sub- 
jects withdrew. Electrodes were then attached, and 
the subjects were requested to not move their arms. 

They were then advised: 


You'll notice that each of your electrodes is con- 
nected to this box. This apparatus regulates the 
voltage, amperage, frequency, and other physical 
characteristics of the electric shock. On each trial 
I will press a switch, you will hear a click, and 
both of you will receive the same shock at the 
same time, The initial shocks may be so slight as 
to be undetectable; some you may feel as a warm 
tickle, some may be stronger and somewhat pain- 
ful. All I want you to do is to evaluate the in- 
tensity of each shock and indicate your response 
by means of one of these light switches. 


The labels were then read and pointed out, and 
they were advised that the recording equipment re- 


KENNETH D. CRAIG AND STEPHEN M. WEISS 


quired the subject seated on the right to respond 
first. Six series of shocks were to be administers! 
ostensibly to study variations in shock characteri 
A minimum of 10 seconds was interposed bet 
each shock administration, At the end of each 

of shocks, the experimenter advised them that 
physical characteristics of the shock were t: 
altered, and she simulated the alteration by moy 
irrelevant dials on the shock stimulator. The < 
federate’s electrode was not attached to the stimu- 
lator. A questionnaire examining the subject’s ex 
periences was administered after threshold determina- 
tions were completed. 


Confederate’s Instructions 


Group treatment differences were defined solely b 
different relationships between the confederat 
switch selection and that of the subject. Onlv the 
confederate had prior knowledge of the subject 
group assignment. Pretesting to produce realistic pro- 
cedures led to the following rules for the confederat 
responses. 

For the shock-intolerant group, the confederate 
endeavored to reduce the pain thresholds by dis- 
simulating intolerance for shock by selecting swit 
that indicated a higher rating of shock intensity t 
those made by the subject. Switch 5 was matched 
for the first shock, followed by repeated selections 
of Switch 4 until the subject matched this selection 
At that time the confederate remained there for one 
shock and advanced to Switch 3. This procedure was 
continued until Switch 1, If the subjects reversed to 
less intense switches, the confederate remained at the 
same level until the subject again matched him, and 
then the confederate advanced as above. If the sub 
ject were to skip above the confederate, advancement 
would proceed at a rate of one switch per shock until 
the confederate again led the subject, at which time 
the above rules were followed. 

For the shock-tolerant group, the confederate en- 
deavored to increase the pain thresholds by selecting 
switches at less intense levels than the subject. His 
switch selections remained on Switch 5 until the sub- 
ject selected Switch 3, at which time he remained at 
Switch 5 for one shock and then advanced to Switch 
4, He remained there until the subject advanced to 
Switch 2. After the subject advanced, the confederate 
would remain at the switch that had been selected 
for one shock and then advance. If the subjects were 
to reverse to a switch indicating less intensity, the 
confederate would remain at his previous level until 
the subject moved ahead again, and would then 
follow the above procedure. If the subjects were to 
advance after only one trial at a given switch, the 
confederate would advance one switch as usual, re- 
main there for two shocks, advance, and remain 
there for two shocks until he was again one switch 
behind the subject, After the subject selected the 
pain switch, shock was no longer administered to him, 
and the confederate moved ahead at the rate of two 
shocks per button until the pain switch was reached 


e 
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A noncontingent relationship between the sub- 
cts and confederate's responses was programmed 
the control group. In an effort to have the con- 
te respond in a manner that would represent 
; ical, uninfluenced, naive subject, the mean 
"number of switch selections for each switch on each 
series was calculated for all prior subjects. The con- 
— federate then used this to determine the frequency 
with which switches were selected. 


RESULTS 


Analyses of variance were performed over 
the following variables: experimental group, 
_ the four categories of rated intensity prior to 
— the rating of pain, and changes over time as 
‘they occurred during the six series of shock 
presentations. Dependent variables comprised 
the frequency of switch selections and two 
indexes of response certainty—the frequency 
$ of reversals from higher to lower level switches 
and the number of switch selections out of 
_ the expected sequence of responding from 
lower to higher levels of experienced pain. 
Table 1 presents the summary of the analy- 
i, sis of variance for the frequencies of responses 
- to the different switches across groups and 
= series, Significant differences were associated 
— with the effects of different group treatments 
© (p< 01), the switch selected (p < .01), and 
the Groups X Switches interaction (p < .01). 
"The mean shock intensities received before the 
"painful" switch was selected on each of the 
six series of shocks are shown in Figure 1. 
The model’s behavior had a powerful ef- 
- fect on the pain-threshold determinations. 
— Over all series, the control, shock-tolerant, 
and shock-intolerant groups accepted mean 


TABLE 1 
ANALysis OF VARIANCE FOR FREQUENCIES OF SWITCH 
SELECTION ACROSS GROUPS AND TRIALS 


Source df MS F 
Groups (A) 2 577.68 9.59* 
Error between 27 60.24 
Series (B) 5 3.43 «1 
AXB 10 3.75 <A 
Error B 135 5.08 
Switches (C) 3 505.64 40.26* 
AXC 6 43.28 3.45* 
Error C 81 12.56 
Exc 15 11.10 1.61 
BAXBXC 30 4.91 «1 
Error residual 405 6.90 


* p «ot, 
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Fic. 1, Current intensities received on the last shock 
of each series. 


shock intensities of 6.35, 8.65, and 2.50 milli- 
amperes, respectively, before selecting the 
pain switch. Multiple comparisons revealed - 
that differences between individual pairs of. 
the three groups were significant ($ < .01) 
on all trials. Pain thresholds did not change 
significantly over time, despite, as Figure 1 
suggests, the apparent tendency for the shock- 
intolerant group’s thresholds to decline over 


the six trials. 
The significant Groups X Switches interac- 
tion indicated that the three groups differed 


4Tn the interests of an independent replication of 
the findings, one of the authors (KDC) served as the 
experimenter for an additional 10 University of 
Calgary male undergraduates, who were run through 
the shock-tolerant procedures previously outlined. A 
different 25-year-old male undergraduate served as 
the confederate. He was not apprised of the study’s 
purpose or expectations, but was required to mem- 
orize and practice the confederate's role in advance 
and volunteered an understanding of the nature of 
his role after 1 subject had been run. Table 2 pre- 
sents the frequency with which each individual sub- 
ject selected a switch before progressing to each sub- 
sequent switch throughout one series of shocks and 
the level of shock received on the final shock admin- 
istration prior to the subject’s selection of the pain 
switch, The mean level of shock accepted on the Jast | 
shock presentation was greater for the replication 
group than for the original shock-tolerant group. 
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Shock 
Tolerant 


SELECTION 


Shock 
Intolerant 


MEAN FREQUENCY OF SWITCH 


5 4 3 2 


Undetectoble Stightly Detectoble Questionably 


Detectobie Paintul 
SWITCHES 


Fic. 2. Mean frequencies for the selection of switches. 


in their preferences for the four separate rat- 
ings of shock intensity. Figure 2 presents the 
group mean frequencies for the selection of 
the different switches. Since all series of shock 
presentations ended for subjects with the 
selection of the switch labeled painful, this 
switch (Switch 1) was selected only once for 
each series, and data referring to it were not 
plotted in the figure. The undetectable switch 
was used infrequently by all groups, suggest- 
ing that low shock-current levels are detectable 
by most subjects. The shock-tolerant and con- 
trol groups were markedly different from the 
shock-intolerant group in their frequencies of 


TABLE 2 


FREQUENCIES OF SWITCH SELECTIONS FOR 
INDIVIDUAL SUBJECTS AND FINAL 
SHOCK INTENSITIES 


Switch 
Subject 3 
Unde. | Slightly | Detect- | Qu Final 
tectable | detect- able, tlonably intensity 

1 1 1 6 8 | 80 

2 1 1 21 11 17.0 

3 3 8 8 15 | 17.0 

4 1 1 6 5 65 

5 2 12 10 4 | 140 

6 1 6 13 13 16.5 

7 3 6 20 3 16.0 

8 4 2 3 9 9.0 

9 0 1 4 123885 
10 1 3 13 5 | 110 
Group M 1.7 4.1 10.4 8.5 12.35 

| 


f 
| 
| 


selecting the switches labeled “detectable” and 
“questionably painful.” 

Analyses of variance of the incidence of re- 
versals in responding that were contrary to 
the expected sequence of moving from the 
undetectable switch to the pain switch re- 
vealed no effects for the different switches or 
trials; however, a small but nonsignificant 
tendency appeared (F = 2.72, df — E 
< .10) for the control group (X = 1.6 per 
series) to reverse the sequence more often 
than the shock-tolerant (X — 0) or shock- 
intolerant groups (X — .7). Similarly, the 
groups tended to differ on the total number of 
responses out of the expected sequence of 
moving from the undetectable to the painful 
switch (F — 3.24, df — 2/27, p « .10), with 
the groups having means of 0, 1.9, and 3.3, 
respectively. 


iy 


Postexperimental Questionnaire 


Subsequent to the pain-threshold determ :a- 
tions, a questionnaire was administered iat 
required ratings of (@) the severity of pain 
experienced on the last series; (b) the degree 
of difficulty involved in making the judgment 
as to whether the shock was painful or not; 
(c) the subject's estimate of his everyday 
sensitivity to pain; (d) the extent to which 
the partner was more or less tolerant of the 
pain than the subject; (e) the extent to which 
their pain ratings were influenced by the 
partner; and (f) the extent to which their 
partner was influenced by their own responses. 
All rating scales had anchoring phrases or 
adjectives indicating extreme responses. 

No differences appeared between the groups 
on their estimate of the severity of the pain 
experienced on the last series, despite the 
large differences in shock intensity. Mean rat- 
ings tended to be toward the middle of the 
7-point scale, varying from "mot painful" to 
“very painful” (for the shock-tolerant group, 
X = 3.9; for the control group, X = 4.3; and 
for the shock-intolerant group, X = 4.5). 

Estimates of the difficulty involved in judg- 
ing the painfulness of the shock differed be- 
tween groups (F = 8.04, df = 2/27, p < .01), 
with the shock-tolerant and shock-intolerant 
groups not differing from each other and 
tending toward the “very easy" end of the 
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scale (X — 2.9 and 3.0, respectively), but 
differing significantly (p < .01) from the con- 
trol group (X — 5.1). 

s of everyday sensitivity to pain 
er among the groups and were ap- 


parently not influenced by the experimental 
variables 

The zroups acknowledged differences in the 
“par " pain tolerance on a 15-point rat- 
ing varying from “less tolerant" at the 
low end, to “same” at the middle, to “more 
tolerant” at the upper extremes (F = 21.70, 


2/27, p< O1). The shock-intolerant 
cribed limited tolerance (X = 2.9), 
ol group tended to suggest similarity 
elves (X = 8.6), and the shock-tol- 
oup described considerable tolerance 
.1). Differences between all pairs of 
were significant (p < .05). The order- 
ched the model’s portrayed levels of 
arance and demonstrated the subjects’ abil- 
scriminate model characteristics. 

ts tended to disavow any influence of 
el on their ratings, with all groups 
g toward the “not at all” end of the 
scale; however, the groups did differ 
significantly (F = 6.21, df = 2/27, p < .01), 
with the shock-intolerant group indicating the 
least influence (X = 1.7), the shock-tolerant 
group admitting to slightly but insignificantly 
more influence (X = 2.3, p > .10), and the 
control group acknowledging moderately more 
influence than either of the other groups (X 
=3.4, p< .05). This contradicted expecta- 
tions since the model’s behavior for this group 
was independent of the subject’s performance; 
however, the potency of the modeling effects 
suggested that even in this group, some in- 
fluence was highly probable. 

Although the subjects tended to feel that 
their behavior had little influence on their 
partner, differences between groups appeared 
(F =3.64, df = 2/27, p< 05), with the 
subjects in the control group attributing more 
influence to themselves (X — 3.6) than those 
in the shock-tolerant (X — 3.0) or shock-in- 
tolerant (X — 2.0) groups, despite the ab- 
sence of contingencies between the subject's 
and the model's behavior in the control group. 

The differences between group ratings on 
the latter two questions provided an index of 


7-poin 


ry mM INPUNE SITE 


the extent to which the subjects felt that they 
had control over the confederate’s ratings, as | 
opposed to the confederate controlling their 
own ratings. The direction of the mean rat- | 
ings for all groups, as they appear above, b 
suggests that the subjects felt that they had 
more influence than the confederate, with the — 
shock-tolerant group acknowledging more in- 
fluence (Mair = .7) than the shock-intolerant — 
(Maye = 3) or the control groups (Matt = | 
.2); however, differences between groups were 
not significant. 


Discussion > 
l 

Having a model ostensibly react to the - 
identical electric current intensity as if it were | 
more or less painful than the subject’s reported 3 
experience had a potent influence upon the 
subject's subsequently reported experience of 
pain. When the model moved rapidly along 
the rating scale toward the switch labeled 
*Pain," the subjects followed suit; whereas, 
when the model delayed advancing along the 
scale, the subjects delayed advancing as well. 
The shock-tolerant group reported 
thresholds at substantially higher levels than 
the noncontingent control group, whereas the 
shock-intolerant group reported pain thresh- 
olds at a substantially lower level than the 
control group. The social context and the — 
example of another dictated the reported pain 
thresholds to a considerable extent. 

The findings were consistent with work 
from other laboratories, indicating that verbal 
reports of pain are not linearly related to 
electric shock intensity but represent com- 
plex reactions to physiological, experimental, 
and social cues. Social factors and situational 
cues related to pain reports and tolerances in- 
clude situational stress levels (Kornetsky, 
1954), attentional distractors (Kanfer & 
Goldfoot, 1966), social manipulations of the 
perceived origins of the pain (Nisbett & 
Schacter, 1966), variations in the instructions 
for reporting pain (Blitz & Dinnerstein, 1968), 
variations in responsibility for controlling the 
treatment of the pain (Moss & Meyer, 1966), 
differences in reference-group standards (Buss 
& Portnoy, 1967; Lambert, Libman, & Poser, 
1960), prior cognitive rehearsal of an imagined 
noxious experience (Neufeld, 1969), and 
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instructions designed to produce cognitive 
dissonance (Zimbardo, Cohen, Weisenberg, 
Dworkin, & Firestone, 1966). 

The present findings are consistent with 
the work of others who have shown that pro- 
viding phobic individuals or children with 
strong aversions and the opportunity to ob- 
serve others in nonharmful or pleasurable con- 
tact with feared objects leads to subsequent 
approach behavior (Bandura et al., 1969; 
Bandura & Menlove, 1968; Geer & Turtel- 
taub, 1967). In the present study the vicarious 
experience elicited not only approach to events 
usually deemed to be aversive but also gen- 
erated avoidance of normally tolerated events. 

It may be suggested that witnessing the ab- 
sence of anticipated aversive consequences to 
another would extinguish the observer’s previ- 
ously established emotional responses (Ban- 
dura, 1969). Since the model and the subjects 
were putatively receiving the same levels of 
shock, it was difficult to hypothesize the ac- 
tion of vicarious extinction processes. The ef- 
fects of shock generally would be considered 
to be incompatible with the extinction of con- 
ditioned emotional arousal. However, a physio- 
logical model for vicarious extinction still 
would appear to be possible, since the physio- 
logical components of vicarious arousal and 
arousal generated through direct noxious stim- 
ulation apparently are not coextensive, Craig 

(1968) and Craig and Lowery (1969) have 
indicated that in some instances the patterns 
of vicarious and directly instigated autonomic 

arousal do not correspond to each other. 

Another interpretation of the findings would 
suggest that the task was one of discrimina- 
tion learning, whereby immediate feedback 
regarding the model’s perception of the shock 
provided information specifying standards of 
conduct appropriate to this ambiguous lab- 
oratory situation. Although the model was not 
endowed with any special legitimacy or au- 
thority through instructions or his appear- 
ance, his actions conveying tolerance or in- 
tolerance for the shock probably provided him 
with a rather dramatic social image. Attending 
to the model could have elicited a variety of 
cognitive responses, including reassurance that 
the shocks were not as aversive as anticipated, 
a feeling of competitive challenge, or an 


unwillingness to deviate strongly from the 
model’s standards because of the risk of the 
model’s or experimenter’s disapproval. The 
presence of the experimenter who knew the 
study’s hypotheses would be expected to con- 
tribute to the demand characteristics of ‘he 
situation (Rosenthal, 1966). 

Moderate degrees of information regarding 
the physical characteristics of the shock aj 
parently made the judgmental task more dif- 
ficult, since reversals and out-of-sequenc 
switch selections were most frequent at the 
moderate levels of shock intensity. Modeling 
effects were strongest at these levels of in- 
tensity as well. Changes in the experienced 
aversiveness of the shock apparently did not 
occur through repeated exposures to the 
model’s standards, since the pain thresholds 
were influenced strongly on the first trial and 
remained relatively constant over trials for 
all groups. 

In naturally occurring pain situations, imi- 
tative effects may be even more striking. 
Stress in the present situation may have been 
reduced through the subject's knowledge that 
severely painful shock would not be admin- 
istered. Further, the shock levels were under 
the control of the subject. The subject was 
to concern himself solely with the psycho- 
physical judgment, rather than with the many 
other variables that would be present in natu- 
ral situations. Finally, in normal situations, 
models of enhanced status may be present, 
whereas in the present study the model was 
represented as a peer without outstanding 
qualities. Indeed, the modeling effects are all 
the more striking, since the model did not 
have any remarkable characteristics beyond 
his tolerance or intolerance for shock. 

A final question relates to whether the 
verbal reports of pain thresholds were reflec- 
tive of the subjective intensities of the pain. 
Conceivably, the subjects may have sup- 
pressed reporting their actual experiences so 
as not to contradict the model, and they may 
have fabricated a series of tolerant or in- 
tolerant responses in order to be consistent 
with the model. The questionnaire provided 
strong confirmation of the supposition that 
the verbal reports were indicative of the sub- 
jective experience. The groups did not differ 
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in their reports of pain severity. Threshold 
differences cannot be attributed to purposeful 
decisions to conform unless the subjects were 
fabricating several questionnaire responses, 
including ratings of the pain and the general 
denial that the model’s responses had been in- 


fluential. If there had been an attempt to 
dissimulate, it would seem improbable that 
the subjects would deny conforming but admit 


to perceiving the different levels of model 
tolerance. Hilgard (1969) argued that in the 
absence of acceptable, alternative physiologi- 
cal or behavioral indexes of pain intensity, 
verbal reports may suffice, since they conform 
to power functions observed in other psycho- 
physical-perceptual modalities. 
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AVAILABILITY OF AVOIDANCE BEHAVIORS IN 
MODULATING VASCULAR-STRESS RESPONSES’ 


JACK E. HOKANSON,? DOUGLAS E, DeGOOD, MARVIN S. FORREST, 
axp THOMAS M. BRITTAIN 
Florida State University 


Two studies are reported which evaluated the arousal-reducing function of 
several cognitive factors in stressful situations. In the first study, subjects 
underwent an aversive 4-hour shock-avoidance procedure, in which the experi- 
mental subjects had optional control over rest periods, and a group of yoked 
controls had comparable rest periods imposed on them. The finding that the 
experimental group manifested substantially reduced systolic blood pressure 
levels relative to their yoked counterparts suggests that the availability of an 
avoidance response has an arousal-reducing effect. In the second study, the 
rest periods were preceded by a reliable signal. The results indicated that an- 
ticipatory systolic reductions occurred in conjunction with the prerest signal 
and that these reductions varied as a function of the length of the subsequent 


rest period. 


The last several years have witnessed an 
increasing concern with the effects of cogni- 
tive factors in the modulation of reactions to 
pain and stress. Within the framework of 
learning theory, Mowrer’s (1960) concepts of 
fear decrement and hope and Solomon’s 
(Seligman, Maier, & Solomon, 1970) safety- 
signal mechanism connote cognitive mediation 
in escape and avoidance paradigms. Melzack 
and Wall (1965) explicitly dealt with the role 
of evaluation and cognition in their treatment 
of the physiology of pain; and within the 
realm of personality theory, Lazarus (1966) 
has emphasized the functions of appraisal of 
threat and the availability of coping behaviors 
in determining stress-related responding. With 
regard to theories of emotions, Schachter’s 
(1964) model emphasizes the relationship be- 
tween cognitions, physiological arousal, and 
the labeling of affective states. In this vein, 
Ross, Rodin, and Zimbardo (1969) have re- 
cently demonstrated how the manipulation of 
cognitive cues can serve in the reduction of 
fear. 

One thesis which is generally common to 
each of these approaches is the hypothesis 
that the anticipation of relief from noxious 
stimulation is stress reducing. It would appear 

1 This research was supported by United States 
Public Health Service Grant MH 14030-01. 

2 Requests for reprints should be sent to Jack E. 


Hokanson, Department of Psychology, Florida State 
University, Tallahassee, Florida 32306. 
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that this hypothesis can be meaningfully sub 
divided into at least two parts, depending on 
the source of information regarding the po- 
tential reduction of noxious stimulation. Oe 
subhypothesis suggests that the occurrence oi. 
an environmental cue which signals removal 
of noxious events should reduce stress re- 
sponses (external cue). The other suggests 
that the availability of avoidance, or escape, 
responses, which can be invoked by the or- 
ganism, thereby removing  aversiveness, 
should also be stress reducing (internal cue). 
Taken together, one might argue further that 
internal cues should have the more poten! 
stress-reducing effect, since the utilization of 
avoidance behavior is typically under the sub- 
ject’s control, whereas the occurrence of ex- 
ternal safety signals is usually dependent on 
non-subject-controlled environmental circum- 
stances.* 

The purpose of the present paper was to 
evaluate three hypotheses which stem from 
the above discussion: (a) that the availability 
of avoidance responses reduces indicants of 
stress; (b) that the anticipation of the re- 
moval of noxious stimulation reduces indicants 
of stress; and (c) that cues of impending 
removal of aversiveness which are controlled 

3 This discussion presupposes that the reliability 
(predictability) of cues is controlled. For a discus- 
sion of this important but, for the moment, irrele- 


vant variable, see, for example, Pervin (1963) or 
Seligman (1967). 


3 


"the subject produce greater decrements in 
ress than do cues over which the subject 
has no control. These hypotheses were as- 
ied via two studies. 
Stupy 1 

— In this brief, initial investigation, human 
subjects were exposed to a }-hour aversive 
period, during which they had the option of 
^ taking as many 1-minute time-outs from the 
J'aversiveness as they desired. A group of 
^ yoked-control subjects were exposed to the 
same aversive stimulation, and each yoked 
" subject was given time-outs in exactly the 
same temporal pattern as his experimental 
counterpart. Thus, each group underwent 
comparable amounts and patterning of aver- 
—— "siveness and brief respites, with the critical 
|. difference being that the experimental sub- 
jects could control the occurrence of their 
time-outs, while the yoked subjects could not. 
The experimental hypothesis being tested was 
—— whether or not this control over an avoidance 
esponse produces a decrement in a vascular 
neasure of stress (systolic blood pressure) 
over the period of aversive stimulation. 

- This design is comparable to that used by 
_ Weiss (1968), in which it was found that rats 
— trained to make an avoidance response to 
- electric shocks had substantially reduced 
signs of physiological disturbance, relative to 
_ yoked-control rats which had no available 
avoidance response, even though both groups 


a related series of studies with humans, Hag- 
gard (1943), Champion (1950), and Pervin 
(1963) demonstrated reduced physiological 
arousal and subjective feelings of anxiety 
among subjects who felt that they had control 
over the delivery of aversive stimulation to 
themselves, In the present study, the variable 
being manipulated was control over the avoid- 
ance of aversive stimulation. 


= Method 


Subjects. Twenty-four male, college student volun- 
teers between the ages of 18 and 24 served as sub- 
jects. Prior to the start of the experiment, each sub- 
- ject was informed that electric shock was involved 
and was given an opportunity to decline serving in 
b the study, In addition, subjects with any history of 
— Serious physical illness or undue trepidation regard- 
fe ing psychological experiments were excused. 

Z Subjects were divided into an experimental and a 
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received exactly the same electric shocks. Tn. 


yoked-control group on the basis of their systolic — 
blood pressure, recorded during an initial 10-minute 
rest period. Each experimental subject had a yoked- 
control counterpart whose resting systolic pressure 
was approximately equal. As a result of this matching 
procedure, the mean systolic base lines for the two 
groups were essentially the same, thereby eliminating 
any initial values effects (Wilder, 1957) in subse- 
quent responsivity to stress. 

Apparatus, A Sidman avoidance procedure was uti- 
lized as the stressor during a 4-hour aversive period. 
This consisted of a response panel mounted in front 
of the subject’s chair. The upper portion of the panel 
contained a row of eight lights, each with a different 
geometric symbol associated with it. A similar row 
of geometric designs (but in different positions) was 
located on the lower portion of the panel, cach with 
its own response button below it. At the extreme 
lower center of the panel was a single push button 
labeled “trial.” The subjects task was to depress 
the trial button which illuminated one of the upper- 
row lights (via a 20-position stepping relay) and 
then to make the matching response by depressing 
the appropriate button on the lower row of symbols, 
thus, turning off the upper-row light. Following this 
matching response, the subject had to return immedi 
ately to the trial button, thereby activating anothe' 
upper-row symbol to be matched. In this fashion, the 
subject could work continuously for the 4-hour 
stress period. P 

The symbol-matching panel was wired, via an 4 
interval timer and electromechanical apparatus, to a 
American Electronic Labs stimulator. In order to 
successfully avoid an electric shock, the subject had 
to complete the trial sequence and return to the 
trial button within a temporal interval controlled by 
the electronic timer. The timer setting was tailored to 
each subject’s idiosyncratic matching rate, so that y 
each subject received, on the average, one shock every 17 
45 seconds. This procedure was instituted to generate 
reasonably fast matching behavior and, at the same 
time, to insure that the experimental and yoked- ` 
control groups received an equal number of shocks 
during the stress period. 

The shock-intensity setting on the stimulator was 

also tailored to each individual subject. In a pre- 
liminary psychophysical pain-threshold procedure, 
cach subject was given a series of shocks of gradually 
increasing intensity. He had to report when he felt 
any stimulation, when it became uncomfortable, and, 
finally, when it was felt as painful. This last pain 
level was the intensity used during the experiment. 
(The experimental and yoked-control groups did not 
differ in average shock intensity.) Shocks were deliv- 
ered through standard electrodes taped to the third 
and fourth fingers of the subject’s left hand. 

Two large, prominent lights were mounted on the 
wall of the subject’s booth, to the rear of the match- 
ing apparatus. A white light, labeled “Work,” was 
illuminated while the subject was working on the 
Sidman task; and a red light, labeled “Rest,” was 
illuminated during time-out periods. Both lights 
were controlled by manual switches, located in an 
adjoining experimenter’s room. 
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Systolic blood pressure was recorded at 30-second 
intervals for the entire session by an experienced 
laboratory assistant using a standard sphygmoma- 
nometer, A small, sensitive microphone embedded in 
the occluding cuff was employed to obtain Korotkoff 
sounds. The microphone output was put through an 
audio amplification and speaker system, enabling the 
recorder to obtain distinct and reliable estimates of 
systolic pressure levels. The blood pressure data 
were entered directly onto the strip chart of a four- 
channel event recorder, which also automatically 
marked each symbol match made by the subject, 
each shock delivery, and each rest period. In this 
fashion, concomitant records of behavioral rate, 
shocks received, rest periods, and systolic level were 
obtained over the entire experimental period. 

Procedure. Following the initial health screening 
and consent procedures, the subject was comfortably 
seated at the symbol-matching panel. The blood 
pressure equipment and shock electrodes were at- 
tached to his left arm and fingers. The shock pain- 
threshold procedure was then carried out. A 10-min- 
ute rest period followed, during which systolic 
pressure was recorded every 30 seconds, to allow 
adaptation to the measurement procedures and to 
assess systolic resting level. 

Instructions regarding the symbol-matching task 
were given, and were followed by the critical in- 
structions pertaining to the rest periods. Subjects in 
the experimental group received the following instruc- 
tions: 


It is important for the purpose of the study for 
you to take frequent rests or time-outs, since we 
are measuring blood pressure and do not want 
you to become overly fatigued. Whenever you 
want a time-out, indicate so by saying "Time" or 
“Stop” in a normal tone of voice. The equipment 
will shut down immediately, and the lights in front 
of you will switch from white to red. The red, 
rest light will be on for 1 minute, during which 
time you can rest; and at the end of 1 minute, 
the red light will blink for 5 seconds, meaning that 
you should prepare to begin working on the task 
again. When the white light comes on, begin work- 
ing again. 

Feel free to take as many breaks as you like, 
anytime you think it will improve your efficiency 
on the task or if you just feel like resting. You 
will in no way be penalized for taking these rests. 
The total amount of time spent on the task will 
be about 30 minutes. 


Subjects in the yoked-control condition received 
the following instructions regarding rest periods: 


It is important for the purpose of the study for 
you to have frequent rests or time-outs since we 
are measuring blood pressure and do not want you 
to become overly fatigued. Whenever the lights in 
front of you switch from white to red, the equip- 
ment will shut off and you will receive a 1-minute 
rest period. At the end of 1 minute, the red light 


4 Copies of the verbatim instructions are available 
upon request from the first author, 


will blink for 5 seconds, meaning that you should 
prepare to begin working on the task again. When 
the white light comes on, begin working 1 

There is no penalty associated with these rest 
periods, and they should help to improve your 
efficiency on the task. The total amount of time 
spent on the task will be about 30 minute 


ubject 4 
those 


As indicated earlier, each yoked-controi 
received rest periods which exactly match 
taken by his experimental group counte The 
experimental group took, on the average, rest 
periods. Thus, the two groups underwent the same 
stress-rest period patterns, with the essential differ- 
ence being that experimental subjects had the option 
of taking time-outs, whereas the yoked contro had — | 
the same time-outs imposed by the experimen! 


Results 


Figure 1 portrays the mean systolic ele 
tions for the experimental and yoked-cont 
groups over the j-hour work period, and fÍ 
three 1-minute intervals after the terminati 
of the task. The data points represent mean 
elevations per 2 minutes of work, and read- 
ings obtained during rest periods have been 
omitted. For each subject, the mean oí the 
last four systolic readings during the initial 
10-minute adaptation period represents the 
base line (0 millimeters of mercury on the 
figure; experimental group = 126.9 milli- 
meters; yoked controls — 125.4 millimeters). 

For the statistical analysis of these data, 
the systolic elevations over the j-hour work 
period were averaged into six 5-minute blocks, 
thus permitting a 2 X 6 repeated-measures 
analysis of variance. As can be noted in the 


"figure, the yoked-control groups manifested | 


consistently elevated systolic levels, over the 
entire work period, relative to the experi- 
mental group (F = 5.71, p< .05). In addi- — 
tion, both groups displayed a gradual decline 

in systolic level as the work session pro- 
gressed (main effect for blocks: F = 16.41, p 

< .001); however, the absence of a signifi- 
cant Groups X Blocks interaction (F — .97) 
suggests that the groups did not differentially 
decline over the 30-minute period. 

Considering merely the direction of systolic 
differences over the work session among the 12 
matched pairs of subjects, it was found that 
the predicted effect (higher systolic levels for 
the yoked-control subjects) occurred in 9 of i 
the pairs, with 2 pairs showing no difference, 
and only 1 pair manifesting a reversed effect. 

Naturally, these between-groups systolic 
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Fic. i, Systolic elevations over base line during the 3-hour work period and 
3 minutes oí postwork for experimental and yoked-control subjects. (Rest 
interval data are not included: Study I.) 


effects could be attributable to possible chance 
differences in either response rate on the 
A matching task and/or differences in number 
i of shocks received. However, examination of 

these measures revealed that these potential 
confounding factors were adequately con- 
trolled. In fact, the experimental group (with 
the smaller elevations in systolic level) ac- 
tually worked, on the average, somewhat 
harder (841.3 versus 832.2 symbol matches 
for the session) and received a slightly higher 
average number of shocks (49.5 versus 45.2) 
per session, than did the yoked-control sub- 
Jects. 


Discussion 

Tn sum, these data suggest that having po- 
tential control over time-outs from aversive 
Stimulation attenuates a vascular-stress re- 
Sponse, relative to subjects who undergo essen- 
tially the same pattern and intensity of aver- 
Siveness, but without such potential control. 

'These results prompt several additional 
questions regarding the mechanics involved 
in the attenuated vascular responses, namely: 
(a) What is the course of blood pressure dur- 
ing the time-out periods? (5) Does anticipa- 
tion of a time-out produce "conditioned" sys- 
tolic decreases? and (c) Does subject “con- 
trol" over such anticipatory signals affect any 
Subsequent conditioned systolic responses? 


These questions were considered in a second, 1 
more elaborate study. - : 


Srupy II 


This study tested the hypothesis that an- 
ticipation of relief from aversive stimulation 
will reduce stress-related responses. In es- 
sence, this hypothesis is the complement of 
the often-demonstrated conditioned emotional i 
response, wherein a cue paired with aversive- 1 
ness acquires arousing properties. In the pres- . 
ent case, a subject underwent aversiveness 1 
(and concomitant arousal), and a cue was i 


paired with time-out from aversive stimula- 4 
tion; thus, the cue should hypothetically ac- 1 
quire arousal-reducing properties. ; 


This issue appears to have relevance to the 
data reported in Study I, pertaining to the J 
stress-reducing effects of having avoidance 
responses available while under aversive stim- i 
ulation. If a subject has optional control over : 
when he will take a rest period, it would also 
appear that his prerest decision to take a time- 
out serves as an internal cue, which is paired 
with subsequent removal of the aversive stim- 
ulation. Thus, it might be argued that one 
process? which contributed to the reduced 

5 Another possible factor is that subjects with op- : 
tional control over time-outs have, by the very 
nature of this control, enhanced predictability con- [ 
cerning rest periods relative to the yoked-control 
subjects. 
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blood pressure among the experimental sub- 
jects in Study I was anticipatory vascular re- 
ductions which occurred prior to the actual 
rest period.” 

The present study utilized an elaboration of 
the procedures employed in Study I. Stress 
induction was again via the 43-hour Sidman 
procedure, but in this study all subjects could 
take rest periods at their own option. How- 
ever, in order to take a rest, the subject had 
to hold down a convenient switch (with his 
free hand) for 30 seconds, while he continued 
to work on the Sidman task. At the end of this 
30-second period, the rest period reliably oc- 
curred. In effect, then, a brief anticipatory 
period took place prior to each time-out, dur- 
ing which the subject was still under the rigors 
of the shock-avoidance task; and the question 
being asked is whether or not anticipatory 
blood pressure reductions occur during this 
prerest, conditioned stimulus (CS) interval. 

Several additional variables were included 
in this design. Interspersed randomly among 
the optional, subject-controlled CS intervals 
were an equal number of 30-second experi- 
menter-controlled cues (a 6,000-cycle tone) 
which were also preliminary to a rest period. 
Thus, half of the time-outs were preceded by 
a cue presented by the subject to himself 
(pressing a switch), and half were preceded 
by a stimulus administered by the experi- 
menter (tone) over which the subject had no 
control. 

Last, the subjects were assigned to one of 
three groups which were given rest periods of 
different lengths: 40, 70, or 100 seconds. Of 
interest here are any effects of length of rest 
on the anticipatory blood pressure reductions 
which may occur during the prerest, CS in- 
terval. These values were selected on the basis 
of pilot testing, so as to provide clearly dis- 
criminable time intervals, 


Method 


Subjects. Thirty male, college student volunteers 
between the ages of 18 and 24 served as subjects. 
Each underwent the initial consent, medical screen- 
ing, psychophysical pain threshold, and adaptation 
procedures as described earlier. Subjects were as- 
signed to one of the three length-of-rest-period 


6 Unfortunately, the discontinuous nature of sys- 
tolic measurement prevented a direct assessment of 
this possible phenomenon among the experimental 
groups in Study I. 
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groups (40, 70, or 100 seconds) on the basis of their 
systolic blood pressure level during the initial adap- 
tation period, so as to have approximately matched 
groups. 

Apparatus. The same basic apparatus (symbol 
matching—shock avoidance task, shock source, 
systolic recording) as described in Study were 
used. In addition, a lightly sprung switch was 
located near the subject’s left hand rest, enabling 
the subject to depress this switch (turning on a 
light in the adjoining experimenter’s room) with 
a minimum of effort. Also, an audiospeaker con- 
nected to a standard sound generator was 
mounted at the back of the subject’s response 
panel. (As indicated earlier, the depression of the 
switch by the subject served as the subject-con- 
trolled CS preliminary to a rest period, while an 


audible 6,000-cycle tone from the sound generator 
controlled by the experimenter served as the experi- 
menter-controlled CS.) Finally, as in Study l, a 
white work light was illuminated in the subject's 
booth while he was to be performing on the Sidman 
matching task, and a red rest light was on during 
the time-out periods. 

Procedure. Following the preliminary screening 
and other procedures described earlier, the sub- 
jects were given instructions His xe match- 
ing task. They were also given i ctions re- 
garding the optional taking of rest periods which 
parallel those given to the experimental group in 
Study I, with three important exceptions: (a) 
In order to take a rest, the subject was in- 
structed to depress the button located near his 
left hand for 30 seconds, following which the red 
rest light would come on. (b) He was also told that 
there would be times when the experimenter wanted 
to give him a rest period, and these rests would be 
preceded by a 30-second tone. Finally, (c) the sub- 
ject was cautioned that during these 30-second pre- 
rest periods, it was necessary for him to work as 
quickly and diligently as during any other part of 
the work periods, 

During the work periods, systolic blood pressure 
was recorded, as in Study I, at 30-second intervals. 
However, during the CS intervals (either subject- or 
experimenter-controlled) a systolic reading was taken 
20 seconds after the onset of the CS, thus insuring 
that the point of measurement was comparable for 
all subjects. Similarly, the recorder obtained a sys- 
tolic reading 30 seconds after the rest period began, 
regardless of the length-of-rest group, again in an 
effort to record at a constant point in time for all 
groups. 


Results 


Of primary concern in this study were the 
systolic changes which occurred during the CS 
intervals and rest periods. Figure 2 summa- 
rizes these data, as a function of the length of 
rest period and subject versus experimenter 
control over the CS. The systolic changes rep- 
resent the means of all trials (subjects took, 


128 


(mm. of Hg.) 


SYSTOLIC LEVEL 


- on the average, four rest periods, with the 
experimenter imposing four more randomly 
interspersed among the subject-controlled 
: rests). In addition, the work level of systolic 
— pressure (128 millimeters in Figure 2) was 
| arbitrarily selected as representative of the 
erage pressure during the Sidman perform- 
ance, and is merely utilized as a convenient 
- reference against which to gauge the CS and 
est-period changes. 

A CS interval. For each subject, a differ- 
ence score was calculated for each trial, 
: using the mean of the three previous work 
n Systolic readings as a base measure versus 
- the single reading obtained during the CS 
- interval, Of immediate interest is the 
E discernible systolic reduction which occurs 
- during this anticipatory interval and, 
- Moreover, the apparent effects of the ex- 
perimental variables on these reductions. A 
> X 2 repeated-measures analysis of variance 
(three rest periods and subject versus experi- 
menter control) revealed a significant main 
effect for rest periods (F = 6.62, p< .01) 
and a near-significant main effect (F = 1.81, 
P< .10) with respect to the control variable. 
_ Table 1 summarizes these effects. 


WORK cs REST 
ONSE: T SNSEL 
o 30 


s E 

o CONTROL CONTROL 
€ 40) LENGTH O 40 LENGTH 
= 70 OF o 70 OF 
& 100) REST A 100 REST 


IS! POST-REST READINGS 


I 
60 


! 1 ' 
90 120 150 


TIME (SECS) 


Fic. 2. Systolic decrements, relative to previous work levels, during CS, 
rest, and postrest intervals. (Work level of 128 millimeters was arbitrarily 
selected as representative of average level during the Sidman performance.) 


In interpreting the systolic changes during 
the CS interval (the subject was still working 
on the shock-avoidance task), the subjects’ 
response rate on the matching problem is an 
important consideration. Should the subject 
slow down his behavioral rate in anticipation 
of the rest, the systolic decrements reported 
above would be most parsimoniously ac- 
counted for by this reduced motoric output. 
In view of this possibility, the response rates 
in the 90-second period just prior to the CS 
interval were compared with the rate during 
the CS presentation. The following behavioral 


TABLE 1 


AVERAGE SYSTOLIC CHANGE (IN MILLIMETERS) DURING 
CS INTERVALS RELATIVE TO THE MEAN OF THE 
THREE PRECEDING SvsrOLIC READINGS 
DURING WORK 


Rest period 
40 sec. 70 sec. 100 sec. 
Control ^ — Control Control 
S E s E S E 
—1.63 | —1.05 | —2.73 | —1.53 | —4.15 | —3.80 
— 1.34 —243 —3.08 
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TABLE 2 


AVERAGE SYSTOLIC CHANGE (IN MILLIMETERS) DURING 
THE Rest PERIOD RELATIVE TO THE MEAN OF 
THE Last THREE SvsrOLIC READINGS 
PRIOR TO CS PRESENTATION 
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TABLE 3 


AVERAGE SYSTOLIC CHANGE (IN MILLIMETERS) AT 
First PosrREST MEASUREMENT RELATIVE 
THE MEAN OF THE Last THREE Svsro: 
READINGS PRIOR TO CS PRESENTATION 


Rest period Rest period 
40 sec. | 70 sec. | 100 sec. 40 sec. 70 sec. | 100 sec 
Control | Control Control Control Control Control 
s | P ANGE: E s E E E s i 
—4.82 | —435 | —6.78 | —5.77 | —7.85 | —6.75 —2.23 | —1.31 | —3.68 | —295 | —4.15 | —3.60 
—4.59 | — 6.28 —1.30 —147 —3.32 —3,88 


rates (per 30-second interval) were obtained: 
12.19, 12.07, 12.24, during the three 30-sec- 
ond periods prior to the CS; and 12.18 
matches during the CS period. These data 
suggest that no reliable change in behavioral 
rate occurred during the CS interval. 


Rest period. As above, the systolic 
level obtained for each subject during 
the rest period was compared to the 


mean of the last three readings before 
CS presentation and averaged across trials. 
These difference scores were evaluated via 
a 3X2 repeated-measures analysis of 
variance, with the only significant finding be- 
ing a main effect for length of rest period 
(F = 8.11, p < 01). This effect is portrayed 
in the middle set of data points in Figure 2 
and also in tabular form in Table 2. 

First postrest measure. In an attempt 
to evaluate the effects of the CS and 
rest periods on subsequent systolic re- 
sponding, the first systolic reading after the 
termination of the rest interval was tabu- 
lated. As earlier, a difference score was calcu- 
lated using the mean of three systolic read- 
ings prior to the CS onset as the reference 
point. These results are portrayed as the 
right-hand data points in Figure 27 and are 
also presented in Table 3. 

A 3 X 2 repeated-measures analysis of vari- 
ance revealed a near-significant rest-periods 
effect (F = 2.83, p < 10) which is not sur- 
prising, considering that subjects with the 
longer rest periods had more resting time in 

"Note that these measures occur at different 
points in time across rest-period groups, due, of 
course, to the different lengths of rest given to each 
group. 


which to allow blood pressure to subside prior 
to beginning work again. The subject- versus 
experimenter-control variable was not siguifi- 
cant, although the direction of these data sug- 
gests that when subjects control the onset of 
the CS (and, thereby, the rest period), their 
postrest arousal levels may be relatively at- 
tenuated. 


g 
I 
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Discussion 


Of the hypotheses posed in the introduction 
to this paper, two appear to be reasonably 
supported, namely (a) that the availability of 
an avoidance response while under aversive 
stimulation reduces autonomic signs of 
arousal and (5) that cues, preliminary to a 
respite from aversiveness, elicit a degree of 
arousal reduction. The third hypothesis, that 
subject control over these preliminary signals 
would produce greater anticipatory decre- 
ments in arousal, received only minimal sup- 
port.* These results bear comment on several 
levels of discourse: conditioning theory; the 
concept of control and its relationship to per- 
sonality theory; and last, extrapolations to 
the area of psychosomatics. 

Within conditioning theory, the “safety 
signal" hypothesis (Seligman et al, 1970) 
suggests that when a stimulus reliably predicts 
an aversive event, the converse is also true, 
that is, that "safety," the absence of aversive- 
ness, is also reliably predicted by the absence 
of the signal for aversiveness. Thus, applying 

5'The potency of the control variable throughout 
these studies may be attenuated because of the sub- 
jects’ knowledge of the ethical constraints on the 


experimenter concerning subject safety and the use of 
relatively mild stressors. 
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H Paylovian framework, the reliable predictor 
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CS--) of aversiveness (an elicitor of fear) 
_ implies the existence of a CS—, a reliable pre- 


b: dictor of safety, which hypothetically reduces 
— er inhibits fear. Evidence in support of this 


fear-reduction process occurs in the animal 
literature (e.g., LoLordo, 1967; Rescorla & 
LoLordo, 1965; Seligman, 1967); and in the 
present paper (Study II) the anticipatory 
systolic reductions which occurred during the 
CS interval appear to reflect the same process 
with human subjects. 

This same line of thinking may be extended 
to the results of Study I in this paper. Here, 
subjects in the experimental group had op- 
tional control over rest periods—in effect, 
having an avoidance response available with 
respect to the stressful Sidman schedule. Pre- 
sumably, the taking of a rest period is pre- 
ceded by a decision to do so; and it seems 
reasonable to argue that such a cognitive 
process could serve as a fear-reducing CS. The 
accumulation of a series of such anticipatory 
systolic reductions over rest periods could be 
one process contributing to the reduced blood 
pressure among the experimental subjects. The 
possibility that such an interoceptive condi- 
tioning process could take place in an avoid- 
ance learning paradigm has been directly dem- 
onstrated by Soltysik (1963) with animal 
subjects. Soltysik theorized and experimen- 
tally verified the thesis that one component of 
an escape or avoidance response is the pairing 
Of interoceptive or proprioceptive cues pro- 
duced by the instrumental response itself with 
Subsequent reduction of aversiveness (or aver- 
Sive-related cues); thus, these internal cues 
become conditioned stimuli for fear reduction. 
Hokanson, Willers, and Koropsak (1968) 
demonstrated much the same effect with hu- 
man subjects in aggressive, dyadic relation- 
Ships. It was found that the use of a social 
response which effectively reduced the op- 
ponent’s aggression was accompanied by sig- 
nificant reductions in vascular arousal. 

Within the realm of personality theory, 
the concept of control has been dealt with by 
both Lazarus (1966) and Rotter (1966) 
from somewhat different vantage points. 
Lazarus has hypothesized that the availabil- 
ity of coping responses while under stressful 
Conditions (potential control over aversive 
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events) should reduce stress-related respond- 
ing. The results of Study I appear to clearly 
support this notion; and the systolic data on 
subject-controlled CS presentations in Study 
II also point in the same direction. It will be 
recalled that experimental subjects in Study I 
underwent virtually the same objective stress 
conditions (and rest periods) as the yoked- 
control subjects, and yet displayed systolic 
levels which were substantially reduced. This 
systolic effect, then, can hardly be attributa- 
ble to differential motor output or exposure to 
shocks, In simplest terms, it appears as if the 
knowledge that one could take a rest as de- 
sired was associated with reduced vascular 
levels. 

This phenomenon may be reducible to the 
type of interoceptive conditioning of fear re- 
duction described earlier; or, as suggested by 
Lazarus (1966), there may be more direct 
cognitive effects on stress-related responses. 
In connection with this latter viewpoint, 
Melzack and Wall (1965) developed the case 
for cognitive mediation in the attenuation 
of neural impulses in ascending pain path- 
ways. This implies the operation of a corti- 
cally initiated effector system which can mod- 
ulate peripheral-sensory and central nervous 
system processes subserving the experience of 
pain. The physiological literature on pain con- 
tains some evidence in support of this conten- 
tion (e.g., Hagbarth & Kerr, 1954). 

The work of Rotter (1966) bears mention, 
in that the concept of control is treated by 
this theorist largely as a dimension of person- 
ality, rather than a set of environmental cir- 
cumstances. Rotter emphasizes the individu- 
al’s perception of his own degree of control 
over environmental contingencies and the 
behavioral effects of these self-appraisals. Of 
relevance here is the relationship of Rotter’s 
“external” type of personality (one who feels 
an absence of control over the environment) 
with the concept of “learned helplessness” 
(Maier, Seligman, & Solomon, 1969), which 
has evolved from the animal literature. In this 
latter work, it is generally found that organ- 
isms that undergo aversive stimulation over 
which they have no control display, relative 
to appropriate control groups, (a) more in- 
tense stress responses (Weiss, 1968) and (5) 
inability (or lack of incentive) to learn avoid- 
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ance responses in subsequent aversive situa- 
tions (e.g., Seligman & Maier, 1967). It seems 
quite conceivable that Rotter’s external type 
of personality is one which has undergone a 
predominance of learning experiences, compa- 
rable to those described in the learned-help- 
lessness literature. With respect to the present 
data, the yoked-control subjects in Study I 
were in an aversive situation over which they 
had relatively little control (time-outs are im- 
posed), and as such were undergoing condi- 
tions which presumably develop the learned- 
helplessness syndrome. From this viewpoint, 
the differential systolic responses obtained in 
Study I may reflect enhanced stress respond- 
ing among the yoked controls, rather than 
attenuated systolic levels in the experimental 
group. 

The concepts discussed thus far appear rele- 
vant to the field of psychosomatics, particu- 
larly with respect to cardiovascular disease. 
The possible systolic enhancement effect de- 
scribed above in relation to lack of control is 
at least reminiscent of the field research of 
Jenkins, Rosenman, and Friedman (1967), in 
which coronary-prone individuals were identi- 
fied on the basis of personality variables. 
Among other factors, these investigators found 
that feelings of time urgency, a sense of un- 
relenting external demands, and behavioral 
patterns of compulsive activity to ward off 
impending harm were characteristic of those 
who developed coronary problems. In an ob- 
viously more limited fashion, the yoked-con- 
trol subjects in Study I may well have expe- 
rienced similar subjective feelings. 
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PERCEIVED SOCIAL PRESSURE AS AN EXPLANATION OF SHIFTS 
> IN THE RELATIVE INFLUENCE OF RACE AND BELIEF ON 
PREJUDICE ACROSS SOCIAL INTERACTIONS' 


LOUIS MEZEI? À 


University oj Detroit 


Degree of acceptance and rejection of four stimulus persons, varying in race 
and be was measured for 10 social interactions. Perceived social pressure 
was measured by asking the subjects to state the degree of approval by their 
parents and friends if they engaged in each social interaction with each stimulus 
person. The race of the stimulus persons was Negro or white, and the beliefs 
used were being for or against communism, or being for or against strong 
civil rights legislation. For the two beliefs combined, race prejudice was sig- 
niñcantly higher than belief prejudice for marriage, dating, and accepting as 
kin. For 6 interactions, belief prejudice was higher than race prejudice. Per- 
ceived social pressure to race prejudice was significantly higher than to belief 
prejudice for the interactions for which race prejudice was higher than belief 
prejudice. Adjustment of both race and belief prejudice scores for the opera- 


The following study was designed to recon- 
cile contradictory explanations of the data in 
the literature about the relative influence of 
race and belief congruence on behavioral in- 
tentions in various social situations. The study 
of prejudice was strongly influenced by the 
creation of the social distance scale (Bo- 
gardus, 1928), which forces subjects to re- 
spond to undifferentiated members of minor- 
ity groups. Rokeach, Smith, and Evans 
(1960) have proposed that prejudice is influ- 
enced more by belief than by race compari- 
sons, provided that subjects are given an op- 
portunity to perceive belief differences and 
similarities. Data presented by Triandis and 
Davis (1965) showed that even if belief simi- 
larity is perceived, race is a stronger determi- 
nant of behavioral intentions than belief in a 
number of intimate interactions. The present 
study was designed to illustrate that the op- 
eration of perceived social pressure can ex- 
plain why race prejudice is greater than belief 
prejudice in some instances, and that there is 


1This report is based on a doctoral dissertation 
Submitted to the Department of Psychology, Michi- 
gan State University. The author gratefully ac- 
knowledges the assistance provided by Milton Rok- 
each and Alice H. Eagly. 

2 Requests for reprints should be sent to the author, 
Department of Psychology, University of Detroit, 
4001 West McNichols Road, Detroit, Michigan 48221. 


no need to postulate different processes of 
prejudice operating in the various interactions 
studied. 


Race Prejudice as a Special Case of Belief 
Prejudice 


Rokeach et al. (1960) have proposed that 
race prejudice operates through the mecha- 
nism of ascribing unacceptable beliefs to un- 
differentiated members of discriminated racial 
groups, and rejecting them on the basis of be- 
lief differences, but that there is no prejudice 
on the basis of race per se. The belief-differ- 
ence explanation is consistent with the classi- 
cal findings (Gilbert, 1951; Katz & Braly, 
1933) on the positive relationship between 
ethnic prejudice and negative stereotyping. 

Rokeach et al. (1960) have reported two 
studies in which race and belief, and ethnic 
membership and belief, were compared as de- 
terminants of friendship acceptance. When 
subjects were given an opportunity to express 
their degree of acceptance of stimulus per- 
sons, they chose more on the basis of belief 
than on the basis of race or ethnic member- 
ship. Contrary to commonsense expectations, 
conventionally measured prejudice scores 
against Negroes did not relate to the observed 
differences in acceptance and rejection of peo- 
ple. Anderson and Cote (1966), using the 
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tion of perceived social pressure, using regression calculations, resulted in the ] 
reversal of the observed relationship between race and belief prejudice for the 
3 intimate interactions. 
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same stimulus-person technique, have con- 
firmed that friendship rejection between Eng- 
lish and French Canadians operates on a be- 
lief, rather than an ethnic, basis. 

Rokeach and Mezei (1966) have extended 
the comparison of race and belief to accep- 
tance of coffee partners in a university-set- 
ting experiment and to acceptance as poten- 
tial working partners in a real-life employ- 
ment office situation. Instead of the technique 
of describing stimulus persons on paper, which 
was used in all studies reviewed in this study, 
two white and two black confederates pro- 
vided for both race and belief variation. Both 
white and black applicants for employment in 
a mental hospital tended to choose the two 
confederates, one white and one black, who 
agreed with them on work-related issues. Simi- 
lar results were obtained in the university- 
setting study. The experiment strongly indi- 
cated that the studies using stimulus persons, 
described on paper only, provide information 
relevant to real-life situations. 

Smith, Williams, and Willis (1967) have 
extended the Rokeach et al. (1960) study to 
identifying stimulus persons by sex, race, and 
belief, and to utilization of both black and 
white subjects from the North and the South. 
Their findings show that belief is more impor- 
tant than race is, which, in turn, is more im- 
portant than sex in friendship choices. A 
notable exception was a white Louisiana sam- 
ple, where race was more important than be- 
lief. 


Stereotyping as Belief Rejection 


A series of studies carried out by Byrne and 
his co-workers (Byrne, 1961, 1965; Byrne & 
McGraw, 1964; Byrne & Wong, 1962) show 
that even in Texas, attitude similarities are 
more important for friendship choices than 
race similarities. Highly prejudiced white 
subjects, according to Byrne and Wong, as- 
sumed a dissimilarity of attitudes of Negro 
strangers to their own. In spite of the initial 
differences, both high- and low-prejudiced 
whites have found Negro stimulus persons 
with similar attitudes to their own attractive, 
and Negroes with dissimilar attitudes un- 
attractive. 

Byrne and McGraw (1964) have continu- 
ously manipulated the degree of attitude simi- 
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larity of the stimulus persons. They fo 
differences in acceptance of Negroes by high- 
and low-prejudiced subjects. Highly prej 

diced persons accepted a Negro in terms of 
belief similarities, provided there was com- 
plete similarity of attitudes with the stranger 
about a relatively large number of objects. It 
appears that to overcome the assumed dis- 
similarity of attitudes resulting from preju- 
dice, more similarity to the Negro stranger is. 
needed. The results are consistent with the 
statement that race rejection operates through 
the mechanism of assumed belief dissimilarity. 


Type of Social Interaction and Race and 
Belief Effects 


Triandis and Davis (1965) reported that 
the effectiveness of belief and race congruence 
in determining behavioral intentions depends 
on the particular interaction contemplated. 
The behavioral intentions were obtained for 
items in the behavioral-difference scales re- 
ported by Triandis (1964, 1967). Departing 
from the original conceptualization of the so- 
cial distance scale as unidimensional (Bo- 
gardus, 1928), Triandis and Davis have iden- 
tified and used the five following factors: ` 
formal social acceptance versus formal social 
rejection (I would admire the ideas of); 
friendship acceptance (I would accept as an 
intimate friend, I would eat with, I would go 
fishing with); marital acceptance (I would 
marry, I would date); social distance (I. 
would exclude from my neighborhood, I 
would accept as a close kin by marriage, I 
would treat as an equal); and subordination 
(I would be commanded by). 

Triandis and Davis reported that the social 
distance and the marriage factors are re- 
sponded to in terms of race prejudice; the 
evaluative and formal social acceptance fac- 
tors are responded to in terms of belief preju- 
dice, while two other factors, subordination 
and friendship, are responded to both in terms 
of race and belief. 

Stein, Hardyck, and Smith (1965) and 
Stein (1965, 1966) have demonstrated that 
when belief is manipulated in terms of a 
number of important values of teen-agers, be- 
lief is a stronger determinant of friendship 
and a number of other social distance choices 
than race. In addition, significant racial preju- 
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dice was also observed in a number of inter- 
actions. Stein (1966, p. 27) reported that 


white, tile teen-agers were willing to inter- 
act with like-valued Negroes in situations 
such t next to in class,” “eat lunch 
with, k on a committee with," and 
“have close personal friend." In con- 
trast, such interaction items as “invite 
home for dinner,” “live in the same apart- 
ment ise," “date brother,” “have a close 


relative marry,” and “have as a neighbor on 


the street," white gentile subjects tended to 
react in racial terms. Stein suggested the 
variables of social pressure, the presence 
of others in the situation, and intimacy of the 
social interaction as possible explanations for 
the differences found. 


Willis and Bulatao (1967) have partially 


confirmed that marriage acceptance depends 


to a larger extent on race than on belief, as 
compared to friendship acceptance. Using 
Filipino subjects and descriptions of Filipino, 
Chinese, Japanese, American, and Spanish 
stimulus persons, they found that on the 
average, belief accounts for 30 times as much 
variance as ethnicity on the friendship rat- 


ings, while in the marriage ratings, belief is 20 
times more powerful in predicting interper- 
sonal choice than race. The observations are 
contrary to the usual American findings of 
race rejection for marital interactions. 

Insko and Robinson (1967) and Robinson 
and Insko (1969) have presented data which 
tends to refute the intimacy explanation of 
racial prejudice. In one study they manipu- 
lated beliefs of Negro stimulus persons which 
were judged to be strong beliefs by North 
Carolina white teen-agers, and in the other 
study they used beliefs which were attributed 
to Negroes by the subjects. In both studies 
they found higher race than belief rejection 
for the social interaction items of Triandis’s 
(1964) friendship and social distance factors, 
contrary to the Triandis and Davis (1965) 
findings. The data did not support the hy- 
pothesis that the more intimate the contem- 
plated social interaction is, the more impor- 
tant the race consideration will be. Both stud- 
ies agreed with the findings of Triandis and 
Davis, that belief is more important than race 
in influencing semantic differential evaluation 
of stimulus persons. 
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Byrne and Ervin (1969) have concluded 
from a review of the studies in the area that 
the relative importance of race and belief on 
interpersonal attraction depends on the spe- 
cifics of the experiment. They have provided 
a multiple-regression equation which takes 
into account such belief variables as attitudes 
of the stimulus person and the evaluation of 
the subject by the stimulus person, prejudice 
as a subject variable, and the degree of reality 
of the stimulus person. Presumably, a differ- 
ent regression equation would be necessary for 
predicting behavioral intentions for each in- 
teraction. 


Social Pressure Explanation of Race Rejection 


As a possible explanation of their data, 
Triandis and Davis (1965) proposed that 
intimacy of behavior determines the relative 
influence of race and belief factors in inter- 
personal choice. One major problem with the 
intimacy explanation arises from the fact that 
the five behavioral-differential scales are not 
ordered according to degree of intimacy. For 
example, “I would accept as kin by mar- 
riage” and “I would exclude from my neigh- 
borhood” are representing very different de- 
grees of intimacy, but both appear in the 
social distance factor. In addition, there is no 
compelling reason for translating the social 
distance scale into a psychological intimacy 
scale. 

Tt has been clearly demonstrated that cer- 
tain social interactions, especially marital in- 
tentions, are determined more by race than 
by belief in the United States. Interestingly, 
as discussed above (Willis & Bulatao, 1967), 
in the Philippines, belief is more important 
than race in determining marital intentions. 
Bogardus (1967), in a review of the findings 
of social distance between ethnic groups in 
the United States over a 40-year period, indi- 
cated that not only persistent racial and eth- 
nic preferences exist for the country as à 
whole, but in addition to the national pattern, 
persistent regional differences in ethnic preju- 
dice are also present. The fact of regional 
variations in the relative influence of race and 
belief in interpersonal choice behavior for 
the same social interaction has also been 
pointed out above. The national and regional 
variations cannot be explained by assuming à 
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simple race effect, or by using the intimacy 
hypothesis. 

An explanation of the empirical facts is 
possible by the extension of the belief-congru- 
ence principle. Behavioral intentions depend 
not only on the congruence of beliefs with 
the person to be chosen for interaction, but 
also on beliefs about the consequences of the 
intended interracial or interethnic interaction 
in terms of reference group relationships. Not 
only congruence of beliefs with the person to 
be interacted with but also congruence of 
beliefs with those of significant reference per- 
sons about the interaction need to be consid- 
ered. One study designed to examine promo- 
tion practices in industry (University of 
Michigan Survey Research Center, 1964) 
demonstrated that ethnic discrimination by 
supervisors is related to a perceived bias in 
higher management. Racial or ethnic discrimi- 
nation is not necessarily an individual person- 
ality, or evaluative, variable, but is partially 
determined by perceived social pressure. 

Beliefs about social pressure operate only if 
they are perceived by the subjects as originat- 
ing from significant people. When subjects do 
not perceive social pressure to take race into 
account, they would choose partners for vari- 
ous social interactions on the basis of belief 
congruence, provided sufficient information is 
available for overcoming stereotyped belief 
perceptions. 

The present study was undertaken because 
no study in the literature demonstrated and 
explored the possible dynamic interplay of 
perceived social pressure with perceived be- 
lief and race congruence in interpersonal 
choice behavior. The design of the study pro- 
vides for obtaining correlations between per- 
ceived social pressure and prejudice scores and 
for adjusting the prejudice scores for the op- 
eration of perceived social pressure by means 
of regression analysis. Perceived social pres- 
sure, as measured in this study, is not neces- 
sarily an independent variable, since to an 
undetermined extent, it may have been con- 
taminated by rationalization of prejudice. Ar- 
guments, based on the internal analysis of the 
data, are presented in the Discussion section 
for employing perceived social pressure as an 
independent variable in the interpretation of 
the results. 


Louis MEZEI 


In designing the study, a new set of mea- 
sures to estimate perceived social pres 

was devised. All the other variables us 
been previously employed in the studie: 


re- 
viewed above. Two belief variables, com- 


munism and beliefs about civil rights legis 
tion, were employed, using separate gro: 
subjects, in order to examine the possibility of 
interdependence between race and the belief 
characterization of the stimulus persons used. | 
The present study followed the experimental 
procedures of Triandis and Davis (1965), us- 
ing 10 of the 15 social interactions, and add- 
ing another belief variable to the civil rights 
legislation belief. However, because of an 
expected statistical interaction between race, 
belief, and type of social interaction, each 


social interaction was analyzed separately, as —— 
contrasted with the use of five factors em- - 
ployed by Triandis and Davis as the unit of 
analysis. = 


Hypotheses 


1. There is a statistical interaction between 
the relative influence of race and belief con- 
gruence and the type of social interaction in 
the acceptance and rejection of people. 

2. Belief congruence is a stronger determi- 
nant of interpersonal behavioral intentions 
than race congruence, provided that both race 
and belief prejudice scores are statistically 
adjusted to remove the effect of perceived so- 
cial pressure. (a) For race prejudice to be 
greater, when belief congruence is perceived 
on a relevant issue, perceived social pressure 
to discriminate on the basis of race has to be 
stronger than perceived social pressure to dis- 

j 


criminate on the basis of belief. 


METHOD 
Subjects 


Students in several sections of a large introductory 
psychology class and two smaller summer classes 
have served as subjects during regular class time. 
Every second subject was chosen to participate in 
Experiment I. In Experiment I, the communist 
belief experiment, 31 white male, 28 white female, 11 
black male, and 10 black female subjects participated. 
All 80 subjects were against communism. In Experi- 
ment II, the civil rights belief experiment, 23 white 
male, 27 white female, 4 black male, and 3 black 
female subjects were for strong civil rights legisla- 
tion; and 11 white male and 11 white female sub 
jects were against strong civil rights legislation. As 


there were only 4 significant race and 3 significant 
sex differences at the .05 level, out of the possible 40 
each, sex 2 race differences were not considered 
in the analy jf the data. 

Three of the four race differences observed ap- 
peared in the "accept as kin by marriage" interaction. 
Black subjects accepted significantly more same race- 
different belief, different race-different belief, and 


differer ame belief stimulus persons than did 
white ects. The net outcome was the reduction 
of both nd belief prejudice scores of the black 
subjects alf of that for white subjects for 
the accept <in interaction. Combination of the 
data by ce did mot change the comparative 
strength of race and belief prejudice. The only other 
race difference was found in the date interaction, for 
which black subjects accepted more white stimulus 
persons of the same belief than did white subjects, 


who accepted black stimulus persons of the same 
belief. The race comparisons were made in Experi- 
ment I only, as there were too few black subjects in 


Experimen j 
Two of the significant sex differences were in the 
“comman " interaction, for which females ac- 


cepted the 


subordinate role more than males for the 
two stimulus persons of both races with the same 
belief as the subjects. The net result was a some- 
what hi; race and a somewhat lower belief 


prejudice. The only other significant difference be- 
tween males and females was found in the marry 
interaction for the same race-different belief stimulus 
person, who was strongly rejected by both males and 
females, but the rejection was higher by females 
(6.3, as compared to 5.6 for males). 


Instruments 


Interpersonal choice scale. Ten interpersonal behav- 
iors (outlined above) were used to represent the five 
factors described by Triandis (1967). Behavioral in- 
tentions were measured on a 7-point scale, presented 
as 7 discrete points on a continuum from “I would 
very strongly accept this person” to “I would very 
strongly not accept this person” for the interaction. 
The complete scale consisted of 10 behavior items, 
used to determine to what extent the subject would 
interact with a stimulus person, presented at the top 
of the page. 

Stimulus persons, Four stimulus persons were pre- 
sented to each subject: a white person who is a 
communist, a Negro who is a communist, a Negro 
who is anticommunist, and a white person who is 
anticommunist, for indicating behavioral intentions 
in Experiment I. In Experiment TI the belief of the 
stimulus persons was being for or against strong 
civil rights legislation. 

Perceived social pressure. After the subjects indi- 
cated their behavioral intentions, they were asked to 
indicate the degree of approval or disapproval ex- 
Pected from their parents and friends if they engaged 
in cach of the social interactions with each stimulus 
Person. Separate judgments were made for parental 
reactions and the reactions of friends using the 7- 
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point scale, employed previously for behavioral in- 
tentions, 

After completing the behavioral intention and social 
pressure scales, the subjects were asked to give infor- 
mation about their race, sex, and belief positions on 
the belief variable used. 


Treatment of the Data 


The raw data obtained consist of rejection scores 
by individuals for a specified stimulus person in a 
particular social interaction. Since there were four 
stimulus persons and 10 interactions, a total of 40 
rejection scores were obtained for each subject. Since 
7 signified maximum rejection and 1 denoted maxi- 
mum acceptance on the scale, the descriptive term, 
rejection score, is used with the understanding that 
we are dealing with an acceptance-rejection con- 
tinuum. 

Corresponding to each of the 40 behavioral-inten- 
tion judgments made by each subject are perceived 
social pressure scores obtained for parents and 
friends, a total of 80 perceived social pressure scores. 
Seven on the social pressure scale signified the 
strongest social pressure toward rejection of the 
stimulus person, 1 signified approval of interacting 
with the stimulus person, while 4 was the neutral 
point. 

The rejection scale and the perceived social pres- 
sure scale were constructed in a similar way in order 
to make it easy for the subjects to express their 
rejection judgments and perceptions of social pres- 
sure, by using only one frame of reference. The simi- 
larities in the scale structure also facilitated the 
calculations of correlation coefficients between the 
rejection and the social pressure scores. 


Grouping of Rejection Scores 


In the absence of major race and sex differences in 
the rejection scores, combination of the data into 
two groups was judged to be appropriate, with one 
group using communist and the other employing 
civil rights belief stimulus persons. Before the data 
could be combined, transformations were necessary 
to allow for the race and belief differences of the 
subjects, as compared to the race and belief of the 
stimulus person used. 

In the communist belief experiment, there were no 
belief differences between the subjects, since all of 
the subjects declared themselves as anticommunists. 
Therefore, for combining the rejection scores, only 
the race differences between the subjects needed to 
be considered. To allow for race and possibly belief 
differences, the raw data were tabulated as follows: 
different belief, same race (B—R--); different belief, 
different race (B—R—); same belief, different race 
(B--FR—); same belief, same race (B+R-+). For cx- 
ample, to arrive at the rejection score for a stimulus 
person with the same belief and of the same race as 
the subject (B+R+), the rejection scores of the 59 
white subjects for the anticommunist, white stimulus 
person were combined with the rejection scores of 
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the 21 black subjects for the anticommunist, Negro 
stimulus person. 


Prejudice Scores 


The rejection scores obtained measured the degree 
of acceptance and rejection of each stimulus person 
for a given behavioral intention. Even when there is 
acceptance of a stimulus person, differential accep- 
tance as a result of race or belief prejudice is possi- 
ble. Since the present major interest was to compare 
the differential influence of race and belief on 
prejudice, a more useful set of derived scores can be 
computed following Rokeach et al. (1960). A race- 
difference score was computed by obtaining the 
arithmetic sum of the two possible rejection score 
differences due to race: 


(B+R— — B+R+) + (B—R- — B-R4). [1] 


The numerical value, the difference between the re- 
jection scores for members of the other race and for 
members of the subjects’ race, is a prejudice score due 
to race differences. 

Because the study employed a 7-point scale, each 
of the maximum possible rejection score differen- 
tials was 6, and therefore the race-difference scores 
could range from —12 to +12. A positive value 
which is significantly different from zero indicates a 
rejection due to race, while a negative value indi- 
cates acceptance due to the race of the stimulus 
person, 

A similar belief prejudice score can be computed 
using Expression 2: 


(B-R+ — B+R+) + (B-R— — B+R-), [2] 


The computed belief-difference, or belief prejudice, 
score has the same properties as the race prejudice 
score discussed above. 

Using the social pressure scores, social pressure 
race-difference and social pressure belief-difference 
Scores can be computed by the substitution of the 
appropriate perceived social pressure scores into Ex- 
pressions 1 and 2, respectively. The meaning of the 
social pressure race-difference score is analogous to 
the meaning of the Tace-rejection score, except that 
the social pressure race-difference Score measures the 
amount of perceived social pressure for acting in 
a prejudiced manner as the result of race. Similar 
meaning is attributed to the social pressure belief- 
difference scores. 

Correlation coefficients can be computed between 
the computed prejudice scores and the social pres- 
sure difference scores for each interaction. Compari- 
son of the coefficients of correlation for race and 
belief prejudice can reveal which perceived social 
pressure is more effective in each interaction, the one 
toward race rejection or the one toward belief re- 
jection. Computations from the prejudice scores and 
the social pressure toward prejudice scores can pro- 
vide a direct test of the hypothesis that for higher 
race rejection than belief rejection to take place, 
higher social pressure toward race prejudice than 
belief prejudice is needed. 


We can compute adjusted race and belief prejudice 


scores from which the effects of the operation of 
perceived social pressure have been eliminated by 
regression calculations, The statistical operation: in- 
volved are outlined in the Results section, The ad- 
justed prejudice scores are the mean race lief 
prejudice scores expected under the condition of no 
perceived social pressure. Direct comparison oí the 
adjusted race and belief prejudice scores for each 


interaction can provide information about t 
tive influence of race and belief on interpe: 
acceptance under the condition of no perceived 
pressure toward prejudice. 


RESULTS 


Tables 1, 2, and 3 present the prejuc 
scores for the communist experiment, the 
rights experiment, and the two experiment: 
combined, respectively, for the 10 social inter- 
actions. Race prejudice scores, social pressuir 
to race prejudice scores, belief prejuc 
scores, and social pressure to belief preju: 
scores are presented in columns 1 and 3 of 
each of the three tables. In addition, Tables 1, 
2, and 3 present the correlation coefficients 
between the race prejudice and social pre 
to race prejudice scores, and the coeffic 
of correlation between the belief prejudice : 
the social pressure to belief prejudice scor 
Tables 1, 2, and 3 also present the summa: 
of £ tests between race and belief prejudice 
scores, between social pressure to race and to 
belief prejudice scores, and between the cor- 
relation coefficients computed for measuring 
the relationship between race prejudice and 
social pressure and belief prejudice and social 
pressure. 

A positive prejudice score which is signifi- 
cantly larger than zero indicates differential 
rejection due to race or belief on the scales 
used. Of the 20 race prejudice scores presented 
in Tables 1 and 2, only 1 score had a negative 
sign, indicating that race differences result in 
prejudiced behavioral intentions. Similarly, all 
20 belief prejudice scores were positive, indi- 
cating relative rejection of people on the basis 
of belief differences. 


Interactions for which Race Prejudice Is 
Stronger than Belief Prejudice 


For the two experiments combined, as in- 
spection of Table 3 shows, only for three 
interactions, marry, date, and accept as kin, 
were the race prejudice scores significantly 
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TABLE 1 t UTE 
Race- AND BrLrEr-DrrreRENCE Scores (RD, BD), RACE AND BELIEF SOCIAL PRESSURE 


Scores (RSP, BSP), AND THE CORRELATION COEFFICIENTS BETWEEN RD) AND 
RSP AND BD AND BSP FOR THE Communist BELIEF STUDY 


` d 
* 


Race difference Belief difference 
Person 
x Tub.RSP x Yap_usP 
S 37 | 3.8 
Parent 44 .64** 43 35% 
Friend 3.7 Daum 3.7 29** 
S 44 2.7 
Parent 4.6 .66** 40 2 
Friend 43 .64** 32 Ape 
S 3.3 2.3 
Parent 3.7 .64** 3.5 wre 
Friend 3.0 .68** 2.5 gare 
Friend S. 10 41 
: Parent 23 .36** 43 Eie 
Friend 1.8 Ag** 3/3 :30** 
= Fish with S 9 2.5 
4 Parent 17 -09 3.5 m2 
m. Friend 1. A7 2.6 Ag 
Eat with S d 14 
* Parent 19 .05 23 .39** 
i Friend 1.0 .22 19 E ud 
h Commanded by S 4 41 
M. En Parent 10 —.07 46 .66** 
] Friend 8 12 3.9 .65** 
. Admire S .2 5.4 
r Parent Ei —.08 6.4 3r 
E Friend ST .20* 5.0 195** 
.— Equal S 8 24 
Parent 1.6 44 28 ares 
Friend 12 18 2.2 OTe 
Exclude 5 6 
1 Ch Parent 2.5 AS 32 326* 
" 2 Friend 19 25% 2.4 .44** 
a 
: *p «c0 
ua "ain Si 


_ higher than the belief prejudice scores. The 
social pressure to race prejudice scores were 
. also substantially and significantly higher 
than the social pressure to belief prejudice for 
the same three interactions. In addition, for 

_ the interactions marry, date, and accept as kin, 
3 the correlations between the race prejudice 
. and the social pressure to race prejudice scores 
. were also significantly higher than the corre- 
. Sponding coefficients of correlation for the 
- belief prejudice data. The data in Table 3 
.. Confirm the hypothesis that for interactions, 
. for which race prejudice is higher than belief 


prejudice, there is a larger amount of social 
pressure toward race prejudice than toward 
belief prejudice. 

For the communist belief experiment sum- 
marized in Table 1, the same trend is ap- 
parent, as described above for the combined 
groups. There were no significant differences 
between race and belief rejection for marry, 
probably due to the fact that both race and 
belief appear to be strong determinants of 
prejudice; the respective difference scores 
were 3.7 and 3.8. Although the social pressure 
scores were numerically higher toward race 
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TABLE 2 ; 
RACE- AND BELIEF-DIFFERENCE Scores (RD, BD), Rack AND BELIEF SOCIAL Pressuke 
Scores (RSP, BSP), AND THE CORRELATION COEFFICIENTS BETWEEN RD AND 
RSP anp BD anv BSP ror THE Civil Ricuts BELIEF Stupy 
Race difference Belief difference | 
Interaction Person 'np-BD i ad ere 
X TRD-RSP X T BD-B8P 
M S 44 24 4.17** 
pst Parent 5.8 Ag** 5 .28* 10.67** 2.20* 
Friend 48 .60** 1.9 oor 5.99** .90 
Dat S 4.8 1.9 5.60** 
"uy Parent 6.2 A5" 3 .23* 12.54** 1.57 
Friend 4.9 .61** 1.6 56" 9.94** 50 
Kin ; S 3.8 15 4.65** 
Parent 4.3 ott —.1 23* 8.87** 2.51* 
Friend 3.6 .62** 12 .50** 542** 1.13 
friend — . S 1.0 2.3 2.97** 
sf Parent 2.6 158* A 34 1.98 
Friend 2.8 48** 13 :62** 1.32 
Fish with S 1.3 15 .05 
Parent 2.0 rd a AT Ñ 2.58* 
Friend 1.9 .50** 9 30** 1.51 
Eat with S 8 12 
Parent yd J2* Ad 1.89: 
Friend 1.0 3t 9 vaasa 
Commanded by S 19 2.7 
Parent 1.5 .38** kd 1.00 
Friend 1.0 -09 18 2.64* 
Admire S =l 
Parent 1.0 12 25 « 
Friend 8 Br 2.26* 
Equal S 8 
` Parent 1.6 24" 1.51 
Friend E ast 195 
Exclude S 8 47 
Parent 2.9 EU! 63 
Friend 1.9 122* 94 
by 405 
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than toward belief prejudice for all three 
interactions, the differences were significant 
only for the date interaction. However, for 
the interactions marry, date, and accept as kin, 
the correlations between the race prejudice 
scores and the social pressure scores attributed 
to parents and friends were significantly 
higher than the corresponding correlations for 
the belief prejudice variable, indicating a 
higher effective social pressure to race dis- 
crimination. 

In Table 2, which presents information 
about the civil rights belief experiment, the 


differences between race and belief prejudice 
for the three interactions are high. For marry; 
the race prejudice score was 2 units higher 
than the belief prejudice score; for date, the 
race prejudice score was 2.9 units higher; and 
for accept as kin, the race prejudice score 
was 2.3 units higher than the belief prejudice 
Score. All six social pressure scores to race 
prejudice were also significantly higher than 
the social pressure to belief prejudice scores. 
The six correlations between race prejudice 
and the social pressure to race prejudice scores 
were higher than the corresponding belief 


He Ne STS yt O EET T 
RACE AND BELIEF ON PREJUDICE ACROSS SOCIAL INTERACTIONS 


TABLE 3 


RACE- AND BELIEF-DIFFERENCE SCORES (RD, BD), RACE AND BELIEF SOCIAL PRESSURE 
Scores (RSP, BSP), AND THE CORRELATION COEFFICIENTS BETWEEN RD AND 
RSP AND BD ann BSP FoR tHE Two EXPERIMENTS COMBINED 


Race difference Belief difference 
Interaction Person trp_ap t for rgp- asp 
E x TRD-RSP x TBD-BSP 2 sad ene 
Marry S 4.0 3.1 4.10** 
Parent 5.1 BY fata 24 RP 6.00** 2.84**- 
Friend 43 Soya 28 43** 3.19** 1.69 
Date s 4.6 2.3 5.78** 
Parent 54 des 2.2 Dy Ay Wao 
Friend 4.6 .62** 24 Se 4.77%" 
Kin S 3.0 1.9 5:219% 
Parent 40 .60** 1.7 25° 4.86** 
Friend 3.3 Get 23 ait 2:23* 
Friend S 10 3.2 , 4.70** 
Parent 2.5 .48** 2.4 Rr 
ý ~ Friend 2.3 E s 2.3 Age 
Fish with US 11 2.0 4,43** 
Parent 1.9 att 18 .36** 
3 Friend 1.5 ek e! 1.8 A5ee 
Eat with af 8 1.3 4.20** 
Parent 18 .16* L3 A6** 
Friend 1.0 noone 1.4 LN 
Commanded by S A 3.4 7.45** 
Parent 13 .24** 2.5 A8** 2.49* 
Friend 9 .10 2.9 E ata 4.809** 
Admire S zi 54 8.48** 
Parent 9 .02 3.8 30%" 2.58* 
Friend f p^ bd 44 Ry ed 33195 
Equal S 8 1.8 4.95** 
Parent 1.6 I9 1.7 AZ 2.31* 
Friend 14 Sy bad 1.8 .50** 2.40* 
Exclude S 6 hi 1.26 
Parent 27 Al 1.7 ‘ii .00 
Friend 19 .24** L5 PU ied .68 
"p St 


prejudice correlations. However, the differ- 
ences were significant for only two of the six 
pairs of comparisons. 

In both experiments, all race-difference 
social pressure scores were numerically 
higher than the  belief-difference social 
pressure scores for the interactions marry, 
date, and accept as kin. If we consider the 
joint operation of the size of the social pres- 
sure scores and their effectiveness, as mea- 
sured by the correlations between the preju- 
dice and social pressure scores, we have an 


indication that the higher race than belief 
rejections observed are socially controlled 
through perceived social pressure. 

Marry, date, and accept as kin form a 
natural grouping of the 10 interactions studied 
which we can call interactions controlled by 
high social pressure to race prejudice. The 
three interactions exhibit a range of race 
prejudice scores of 3.3-4.8; in contrast, the 7 
other interactions show relatively low race- 
difference scores in the range of —.1 to 1.3. 
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Interactions jor which Belief Prejudice Is 
Higher than Race Prejudice 


Inspection of the data in Table 3 shows 
that for 6 of the 10 interactions, belief preju- 
dice was found to be significantly higher than 
race prejudice. The 6 interactions in descend- 
ing order of belief prejudice were admire the 
ideas of, be commanded by, accept as a friend, 
fish with, treat as equal, and eat with. 

Commanded by was found to be a unique 
interaction, in the sense that there was a high 
degree of rejection of all stimulus persons. 
Triandis (1967) probably listed commanded 
by as a separate factor due to the unique 
response pattern. For the commanded by in- 
teraction, belief rejection was significantly 
greater than race rejection; the race prejudice 
scores were .4 and .5, while the belief preju- 
dice scores were found to be 4.1 in both ex- 
perimental groups. 

Accept as an intimate friend, go fishing 
with, and eat with were classified by Triandis 

(1967) as belonging to the friendship factor. 
Belief rejection was found to be much stronger 
for the friend interaction than for the other 
two interactions. For all three of the inter- 
actions, belief prejudice scores were found to 
be significantly higher than race prejudice 
scores. 


Interaction with No Difference between Race 
and Belief Prejudice 


The least discriminating interaction varia- 
ble was found to be exclude from my neigh- 
borhood. As Table 1 shows, both race and 
belief prejudice scores were small in the com- 
munist belief experiment, .5 and .6, respec- 
tively, and the differences between the race 
and belief prejudice scores were not signifi- 
cant. A similar pattern was obtained in the 
civil rights experiment. 


Comparison of Race and Belief Prejudice 
Scores Adjusted for Social Pressure 


A direct test of the hypothesis that belief 
prejudice is a greater determinant of rejection 
than race prejudice, provided no strong social 
pressure operates in the opposite direction, 
can be performed by comparing the race and 
belief prejudice scores after they have been 


adjusted to eliminate the influence of social 
pressure. By applying a regression analysis, 
we can compute the prejudice score expected 
if there was no perceived social pre: 
erating. 

The regression equation, for use with raw 
scores, was given by Hays (1963, p. 504): 


ire op- 


X'2 bx.y(Y — My) + Mx 


where X' is the expected prejudice score; by. y 
is the slope of the regression line of X : 
= rxy Sx/sy; Y is the perceived soci: 
sure score; My is the mean perceived social 
pressure score; and M x is the mean prejudice 
score. 

Because present interest was in calculating 
the expected prejudice score, X', under the 
condition of zero perceived social pressure, the 
neutral point on the social pressure scale, the 
value of O for Y was substituted in Equation 
du . 


X bx. y(0 = My) + Mx or 

X’ = Mx —bx.yMy. [4] 
Equation 4 is the formula used to calculate 
the adjusted rejection scores, in order to ob- 
tain the value of the mean rejection score ex- 
pected under the condition of no perceived 
social pressure. This score is the same score 
which would be obtained if the scores for each 
individual were adjusted for the operation of 
perceived social pressure, and then the mean 
adjusted score for each interaction were com- 
puted. 

Before carrying out tests of significance be- 
tween the race and belief prejudice scores, 
the standard deviations of each were adjusted 
for the reduction in variability accounted by 
the linear relationship of predicting prejudice 
scores from the perceived social pressure 
scores. Since the amount of variance explained 
by the relationship between the two sets of 
scores is 7°, the adjusted variance is 


[5] 


Using the adjusted prejudice scores and the 
adjusted standard deviations, £ tests were car- 
ried out between the mean race and mean be- 
lief prejudice scores for each interaction. The 
degrees of freedom used were reduced by one, 


$8? = $2(1 — 7°), 


- prejudice scores, the adjusted mean belief 


ae 


TABLE 4 


AND BELIEF-DIFFERENCE Scores (RD, 
ADJUSTMENT FOR PERCEIVED SOCIAL 
AND t TESTS BETWEEN RD AND 
THE COMMUNIST AND CIVIL 
HTS BELIEF EXPERIMENTS 


lI 
ymunist experiment Civil rights experiment 


1RD-hD lu RD|M BD tnp-an 


! RD|M BD 
- | 


3 | 19| 4,87 | 12| r9 | 1.738 
o | rz| X83 | 12) 14 | 42 
8 15 | 1.69% 8} 12 92 
3| 25 | 933 | -2| 18 | 5.00% 
8 | 12] 194 3h42. dv 
Eat with 6 | sf 18 5 9 1.80* 
Commanded | 
E iby 3| r1 3.68** ATA 8.27** 
Admire E | 23 | 1472 | —2 | 4.0 | 17.11" 
ual 5 1.3 | 1.96* zs | AN 2T 
_ Exclude 2 | =.7 | 126 Cia es 05 
*ġ <.05 


*&$ 2.01, one-tailed tests. 


in order to allow for the additional restraint 
" imposed by the linear regression. 
Table 4 presents the adjusted mean race 


prejudice scores, and the results of £ tests for 
testing the significance of the differences be- 
tween the adjusted mean race and belief 


_ prejudice scores for each of the 10 interac- 


tions in Experiments I and II. Conforming to 
the statement of Hypothesis 2, that the belief 
prejudice scores are expected to be higher 
than the race prejudice scores, the / tests in 
Table 4 were one-tailed tests. 

In both experiments, 9 out of the 10 inter- 


js actions resulted in numerically higher ad- 
- justed belief rejection, as compared to the 


adjusted race-rejection scores. The 1 interac- 
tion for which race prejudice was numerically 
higher than belief prejudice was exclude from 
my neighborhood, for which all prejudice 
scores were small (.2 and .6 for race, and —.7 
and .6 for belief prejudice). 

The most dramatic result of adjusting for 
the operation of social pressure took place in 
the interactions marry, date, and accept as 
kin, for which all unadjusted race prejudice 
scores were numerically higher than the cor- 
responding belief prejudice scores, as out- 
lined in Tables 1 and 2. Elimination of the 
effect of perceived social pressure resulted in 
a reversal of the relationship; all belief preju- 
dice scores for the three interactions were 
numerically higher than the race prejudice 
scores, as reference to Table 4 indicates. 


TW XEM CDU 


RACE AND BELIEF ON PREJUDICE ACROSS SOCIAL INTERACTIONS 


Se a V iow 
Im 9^ "wy 


Without adjustment for social pressure, the t 
marry, date, and accept as kin interactions — 
resulted in significantly higher race than be- 
lief rejection in the civil rights experiment, as 
outlined in Table 2. Inspection of Table 4 
shows that after adjusting for the operation 
of perceived social pressure, there was signifi- 
cantly higher rejection on the basis of belief 
than race for the marry interaction. In the 
communist belief experiment, the date and 
accept as kin interactions exhibited signifi- 
cantly higher race than belief rejection before 
adjustment, while after adjustment for per- 
ceived social pressure, belief prejudice became 
significantly higher than race prejudice. 

To sum up, we find that elimination of per- 
ceived social pressure influences from the 
prejudice scores for the interactions marry, 
date, and accept as kin, which showed a higher 
apparent race than belief prejudice, results in 
a reversal of the relative influence of race and 
belief prejudice. Before adjustment, all six 
race-rejection scores calculated in the two ex- 
periments were numerically higher than the 
belief prejudice scores; however, they all be- 
came lower after elimination of the effect of 
the operation of social pressure on race and 
on belief prejudice. Five of the race prejudice 
scores were significantly higher than the belief 
prejudice scores before, but after correcting 
for social pressure none of the race-difference 
scores were higher than the belief-difference 
scores, and three of the belief prejudice scores 
were significantly higher than the race preju- 
dice scores. 


DISCUSSION 


The results show that when the effect of 
perceived social pressure on both race and 
belief prejudice is statistically eliminated, be- 
lief prejudice is stronger than race prejudice, 
even for intimate social interactions. The ob- 
servation of Triandis and Davis (1965), that 
friendship acceptance operates according to 
the belief-congruence principle, but behavioral 
intentions for small social distance interac- 
tions, such as marriage or dating, are gov- 
erned more by racial considerations, was con- 
firmed. However, their proposal, that the de- 
gree of intimacy of the interaction will deter- 
mine whether race or belief is the more im- 
portant determinant of behavioral intentions, 
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can be replaced by the more general explana- 
tion that differential perceived social pressure 
toward race and belief prejudice accounts for 
the data in the literature. The operation of 
perceived social pressure is consistent with 
the findings of regional and national varia- 
tions in prejudice, as we can expect regional 
and national variations in effective social pres- 
sure against interracial interactions. 

It is of interest to note that the perceived 
social pressure variable operates differently 
for race and belief prejudice. The present 
study provides the first empirical demonstra- 
tion of the fact that both perceived social 
pressure and effectiveness of the perceived 
social pressure, as measured by the correla- 
tion coefficient between the prejudice and so- 
cial pressure scores, are significant determi- 
nants of race rejection. As examination of the 
correlation coefficients 755 ssp between belief 
rejection and perceived social pressure to be- 
lief rejection indicates, social pressure to be- 
lief rejection is a significant factor, but the 
effectiveness is level across the 10 social 
interactions. In contrast for race rejection, as 
examination of the 755 rsp values in Table 3 
indicates, the perceived social pressure effec- 
tiveness is the highest for the three marital 
types of interactions, for which the race re- 
jection was also the highest. 

The finding that for high race rejection to 
take place there is a high effective social pres- 
sure, while for high belief prejudice there is 
no highly effective social pressure, indicates 
that over and above the perception of parental 
approval and disapproval for engaging in in- 
terracial social interactions, another variable 
operates to make the social pressure more 
effective for race rejection in certain interac- 
tions. One possible explanation is that in ad- 
dition to the degree of approval and disap- 
proval measured in the current study, expecta- 
tions about more severe sanctions against some 
of the interracial behavioral intentions make 
the observed social pressure score more effec- 
tive for the situations in which there is higher 
race than belief rejection. Further research 
can answer the question, Why does social pres- 
sure against dating a Negro by a white person 
result in a higher degree of conformity than 
social pressure against dating a communist? 


Prejudice, Social Pressure to Prejudice, and 
Balance Theory 

The statement that acceptance oí people is 
based on belief and not on race c uence 
is compatible with Heider's (1958) discussion 
of stable triads between a person, P, snother 
person, O, and an object, X. The POX bal- 
ance theory model states that if there is per- 
ceived congruence of beliefs of O toward X 
by P, O will be liked or accepted by P under 
balanced conditions. The race of O in rela- 
tion to P does not need to be considered; it 
could enter the picture only if it results in 


misperception of the relationship of O to X, 
as in racial stereotyping or in attributing un- 
acceptable beliefs to an undifferentiated mem- 
ber of another race. 

Because of the operation of perceived social 
pressure to race rejection, the POX model 
needs to be modified to account for the ad- 
ditional relationship perceived by P, that a 
liked significant other, S, rejects O on the 
basis of race and expects P to do the same. 
The actual relationship, in terms ac- 
ceptance or rejection of O by P, will be the 
resultant of two sets of forces, the POX bal- 
ance as outlined by Heider, and the PSO 
balance. The two sets of forces can oppose 
or reinforce each other. Whenever conformity 
to social pressure is a significant determinant 
of behavioral intentions, the balance mode! for 
determining the acceptance or rejection of 
another person can be, and needs to be, ex- 
panded to take into account both the belief 
congruence and the social pressure variables. 


Is Perceived Social Pressure an Independent 
Variable? 


The possibility exists that the social pres- 
sure judgments obtained in the study have 
been contaminated by rationalization of the 
rejection score judgments. The risk of con- 
tamination was reduced by construction of 
the questionnaires in such a way that the 
social pressure judgments were made inde- 
pendently of the rejection judgments. One 
set of internal comparisons of the data sup- 
ports the belief that the judgments were made 
independently. Comparisons between the rank- 
order correlations between the mean prejudice 
and mean social pressure scores across the 10 


‘social interactions show that there was a 
“higher correlation between social pressure by 
T friends and prejudice (7 = .85) than parents 
~ (r=.41) for belief prejudice in the civil 
2 rights experiment. If we assume that the 
social pressure scores are the result of ra- 
- tionalization, we would expect uniform cor- 
relations between the prejudice and social 
pressure scores for parents and friends. 

Using a different design, degree of approval 
and disapproval by parents and friends for 
the interactions with the stimulus persons 
could be measured independently; however, 
- even if there is divergence between the actual 
and perceived social pressure scores, the latter 
"are of interest as a measure of the effective 
psychological variable. 2h27 
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INCENTIVE TO PERFORM COUNTERATTITUDINAL ACTS: 


BRIBE OR GOLD STAR? * 


DUANE R. KAUFFMANN # 


University of Illinois 


The study of counterattitudinal behavior has focused on the relationship oí 
a monetary incentive to postperformance attitude change. However, little 
concern has been directed to the perceptions and interpretations of those who 
participate in such research. This study examined the effects of four variables 
on the subject's interpretations of the incentive, task, and experimenter. The 
variables were type of act (counterattitudinal versus proattitudinal speech), 
freedom clause (present or absent from the experimental script), amount of 
the incentive ($1 versus $4), and the time at which the incentive was offered 
to the subject (before versus after commitment to the task). Videotapes were 
shown to uncommitted viewers, who were then asked to indicate how they felt 
the subject they had witnessed interpreted the events he had been subjected to. 
The findings support the hypothesis that a large monetary incentive offered 
before commitment is perceived as a bribe, as an incentive to do something 
many other persons might refuse to do. Contrary to expectation, the presence 
of a freedom clause did not result in a greater perception of bribe. The fnd- 
ings are consistent with the contention that subjects are active problem solvers 
in a specialized social setting and with the notion that events in the setting 
may evoke widely divergent views of what the experimenter expects of the 


subject. 


People who are induced to participate in 
counterattitudinal acts sometimes adjust their 
attitudes to harmonize with the behavior 
which they have produced. The magnitude of 
such attitude changes appears to depend upon 
the strength of the incentive that is offered to 
induce counterattitudinal behavior. However, 
experimental evidence concerning the incen- 
tive-attitude-change relationship is somewhat 
contradictory. According to dissonance theory, 
small inducements should generate greater 
dissonance, and therefore larger attitude 
shifts, than large rewards which provide suf- 
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ficient justification for the counterattitudinal 
behavior. The findings of Festinger and Carl- 
smith (1959), Cohen (Brehm & Cohen, 1962, 
pp. 73-78), and Carlsmith, Collins, and 
Helmreich (1966) support this view. On the 
other hand, incentive theory argues that indi- 
viduals offered large rewards should be more 
inclined than those for whom incentives are 
small to engage in "biased scanning" of argu- 
ments pertaining to the propriety of their 
actions. Thus, large rewards should favor self- 
persuasion and large attitude shifts. Evidence 
supporting this contention has been reported 
by Elms and Janis (1965) and Janis and 
Gilmore (1965), and subsequently reviewed 
by Elms (1967). Incentive theorists suggest 
that the contrary findings of dissonance re- 
searchers may reflect the use of inordinately 
large rewards which provoke "negative af- 
fect”; doubts and suspicions may impede 
biased scanning when the proffered payoff 
seems incommensurate with the demands of 
the task. 

In a recent study, Linder, Cooper, and 
Jones (1967) found that the time at which 
rewards were offered determined whether the 
relationship between the magnitude of the 
incentive and the degree of attitude change 


positive or negative. When offered before 
subjects agreed to perform the counter- 
itudinal act, large rewards evoked less 
attitude change than did small rewards. But 
when monetary inducements were announced 
miter the subject indicated his intention to 
comply, incentive theory was supported. These 
— findings suggest that the subject's interpreta- 
tion of the meaning of the reward, and the 
experimenter's motive for offering it, may be 

crucial determinant of his response. As 
Rosenberg (1965) has noted, a large reward 
“may be interpreted as a bribe and may lead 
the subject to conclude that his integrity or 
personality is being tested. In response to 
“evaluation apprehension,” the subject may 
steadfastly maintain his initial attitude and 
thus demonstrate that he cannot be bribed or 
bought off. Although Rosenberg's theory high- 


lights the subject’s interpretations of the re- 
| ward, it is not explicitly concerned with the 


_ time at which the reward is offered. 

Steiner (in press) has presented an analysis 
of counterattitudinal behavior that parallels 
Rosenberg's (1965) study but places even 
"greater stress on the manner in which sub- 
jects interpret the situation. Steiner noted 
that dissonance researchers inform their sub- 
jects that they are free to decline participa- 

1 tion if they wish, and offer a payoff, in some 
"cases as large as $20, for performing the 
 counterattitudinal act. Such precommitment 

bargaining may be interpreted to mean that 
the experimenter believes that the subjects do 

- mot wish to cooperate ( thus, the statement 

. about freedom to decline), and may even im- 

| ply that the experimenter himself has reserva- 
tions about the propriety of the behavior he 
is requesting (thus, the large reward). Under 
these circumstances, subjects may perceive the 
payoff as a bribe for performing a reprehensi- 
- ble act about which even the experimenter has 
doubts. 
On the other hand, incentive researchers 
employ manipulations that appear more likely 
to generate the impression that the proffered 
payoff is a “gold star” for performing a cred- 
- itable act. They generally omit statements 

concerning the subject’s freedom to decline, 
and avoid creating the impression that the 
. . fequested behavior is in some sense repre- 
hensible. Thus, when a large reward is of- 
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fered, the investigator may assert that the 
study is sponsored by a highly respected or- 
ganization (Elms & Janis, 1965), or the in- 
centive may be offered after the subject has 
indicated his intention to comply, practices 
which probably make the payoff seem to be a 
reward for commendable behavior, rather than 
a bribe to induce questionable behavior. Other 
procedures, such as, for example, Finkelstein 
and Rosenberg’s (cited in Rosenberg, 1968, 
pp. 107-109) playing down the duplicity in- 
volved in their task, may also serve to allevi- 
ate the occurrence of negative affect and its 
by-products. 

Noting these differences between the typical 
procedures of dissonance and incentive re- 
searchers, Steiner suggested that the relation- 
ship between the magnitude of the incentive 
and the degree of attitude change depends 
upon whether the reward is perceived as a 
bribe or gold star. When the incentive is of- 
fered before commitment, and especially when 
it is accompanied by an assertion concerning 
freedom to decline, it will be seen as a bribe. 
Under these circumstances, the larger the in- 
centive, the more reprehensible the counter- 
attitudinal act will appear to be, and the less 
likely the subject is to manifest postperform- 
ance attitude change. On the other hand, when 
a reward is offered after commitment, and 
particularly when it is not associated with a 
freedom clause, it will be seen as a gold star. 
The larger the incentive, the more com- 
mendable the behavior will appear to be, and 
the more likely the subject is to bring his 
attitude into harmony with the act. 

A test of Steiner’s contentions would ap- 
pear to require asking subjects how they in- 
terpret the experimental situation to which 
they are exposed. But, it is obvious that direct 
questioning during the course of an experi- 
ment may have reactive effects on subjects’ 
behaviors, and subjects who are interviewed 
at the conclusion of a session are likely to 
describe their perceptions in a manner that 
justifies their overt acts. In order to reduce 
these dangers, the present study examined the 
interpretations of passive observers, rather 
than those of actual subjects. Individuals 
watched videotaped segments of a forced 
compliance experiment and were then asked 
to respond to a series of questions concerning 
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events they had witnessed. Because the ob- 
servers did not commit themselves by overt 
participation in the experiment, it was be- 
lieved that their subjective reports were not 
strongly affected by defensive needs. 

The variables manipulated in the taped epi- 
sodes parallel those employed in forced com- 
pliance research. 


Magnitude of Incentive 


In conjunction with other variables, the 
magnitude of the monetary incentive is ex- 
pected to determine whether observers (and, 
presumably, experimental subjects) interpret 
the proffered reward as a bribe or as a gold 
star. 


Freedom Clause 


The presence of a “freedom to decline” 
clause in the experimental instructions is ex- 
pected to affect the observers’ interpretations 
of how others, including the experimenter, 
view the act the subject is being asked to 
perform. Holmes and Strickland (1970) found 
that the inclusion of such a clause had a 
marked effect on the relationship between the 
magnitude of an incentive and the amount of 
postperformance attitude change. 


Time of Incentive Offer 


The temporal relationship between the offer 
of a monetary incentive and the subject’s com- 
mitment to perform the counterattitudinal 
act was expected to interact with the magni- 
tude of the incentive in producing bribe or 
gold star interpretations. 


Type of Act 


Because it seems probable that payment for 
performing counterattitudinal acts will be 
more likely to generate the bribe perception 
than will payment for producing proattitudi- 
nal behaviors, both types of situations were 
examined. 

It was anticipated that observers will be 
most likely to impose a bribe interpretation 
on taped sequences in which a large monetary 
incentive is offered before the subject commits 
himself to perform the requested act, and 
when the experimenter’s instructions include a 
freedom to decline clause. The gold star in- 
terpretation was anticipated when the mone- 
tary incentive was offered following the sub- 


ject’s commitment, and when nothing was 
said about freedom to decline. ] 

The bribe perception was expected to occur 
more frequently when the requested action 
was counterattitudinal than when it was pro- 


attitudinal. 
METHOD 

Viewers 

Viewers were members of 32 quiz secti for 
Introduction to Social Psychology at the University 
of Illinois. Each section consisted of 10-20 persons- 
A total of 532 viewers participated in t! tudy. 
Nine were discarded for failure to answer the ques- 
tions pertaining to personal background, anc were 


discarded for giving obscene and improbab 
or for failure to complete the major q 
Three hundred and forty males and 178 fer 
comprised the usable sample of 518 viewers. 


Each section viewed a videotape showing seg- 
ments from an experiment which was said t havet 
been conducted and filmed earlier. Each of the 16 
tapes, representing one cell of a 2 (levels of incem 
tive) X 2 (freedom clause versus no freedom clause 


X2 (incentive before versus incentive after commit- 
ment) X 2 (counterattitudinal versus proatiitudinal 
act) design, was shown to 2 separate sections. The | 
first 16 sections were randomly assigned to 1 of the J 
16 treatments. The second quiz section for each tape — 
was selected to have a different quiz instructor and a 
different meeting time than the first section. 


Videotape 


After a brief verbal introduction, the viewers were 
shown the videotape. In an introductory clip, they | 
saw an experimenter sitting alone in his office. They 
heard him assert that experimenters are often guilty 
of not asking their subjects how they interpret 
experimental settings, and that if investigators cor- 
rected this omission, they might be in a better posi- 
tion to evaluate their empirical findings. The experi- 
menter then explained that the viewers would be 
seeing taped episodes in which a student was par- 
ticipating as a subject in an experiment conducted 
"about a year ago." Viewers were told that the 
experimenter knew that this particular subject 
strongly favored draft deferments for college stu- 
dents, but that the subject did not know such infor- 
mation was available to the experimenter. They were 
asked to observe the taped sequences carefully and 
to assist the investigator by reporting their judg- 
ments of the way the subject perceived the situation, 

Following the introductory clip, the scene shifted, 
and the observers saw the same experimenter with a 
male subject (actually an accomplice recruited for 
the role). They heard the experimenter tell the sub- 
ject that he was collecting speeches favoring or 
opposing draft deferments for college students. These 
speeches, the experimenter asserted, would later be 
played to speech clas on the campus, and the 
class members and their instructors would evaluate 
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TABLE 1 
NUMBER OF VIEWERS PER TREATMENT 


Name M 


supo 


Type of act 
Counterattitudinal | Proattitudinal 
Inc v Viewer ——— d 2 
| Freedom clause | No freedom clause Freedom clause | No freedom clause 
| 
| s | s | $1 | $4 TW TN $1 | $ 
Before commitment | ^ Male 30 21 19 21 | 23. | 25 20 22 
| Female 8 9 10 6 14 | 12 17 12 
After com: t Male 21 17 23 TP T2 WTA 17 
| Female 12 13 7 13 alto | Ln ett 15 
| 
the speake i “persuasiveness, sincerity, logical told that he would be allowed 10 minutes to organ- 


soundness of arguments, and things of that kind.” 


Manipuiations 

Type of speech. The experimenter told the subject 
that he needed “one more person to speak against 
draft defe s" (counterattitudinal speech) or 


“one more n to speak in favor of draft defer- 
ments” ( itudinal speech). The experimenter ex- 
plained he subject’s speech would be video- 
taped, that he would introduce the subject by name 
and homet i, and would assert that the subject 
was going to express his own personal views on 
draft deferments. The subject was instructed that 
he would | ven time to prepare a 5-minute speech 
on the ened topic. A proattitudinal tape was 


shown to half the quiz sections, and a counteratti- 
tudinal tape was shown to the other half. 
_ Freedom. Immediately following the task descrip- 
tion, one-half of the quiz sections heard the experi- 
menter tell the subject that he was entirely free to 
decide whether or not to make the speech, and that 
he could “decline and still get credit in Psych 100.” 
For the remaining half of the sections, this state- 
ment was omitted from the script entirely. 
Incentive. The subject was informed that the ex- 
perimenter had a research grant and would pay the 
subject $1 (low-incentive treatment) or $4 (high- 
incentive treatment) for making the speech. Half of 
the sections were exposed to each of these two state- 
ments. 
J Time of incentive offer. For one-half of the sec- 
tions receiving each of the incentive conditions, the 
inducement was offered immediately after the sub- 
ject had been told the topic of the speech, and thus 
before he had indicated a commitment to comply. 
For the remaining one-half of the sections, the in- 
ducement was mentioned after the subject had com- 
Mitted himself to make the speech. 


End of Tape and Questionnaire 


The tape ended with the experimenter reiterating 
that he would introduce the subject by name and 
allege that he was expressing his own personal views. 
The subject was then handed paper and a pencil and 


ize his speech. 

Immediately after the conclusion of the videotape, 
the viewers were given a copy of a questionnaire 
which requested them to indicate how the observed 
subject felt about critical elements of the experi- 
mental situation. The questions elicited interpreta- 
tions of the experimenter's motives and expecta- 
tions, feelings generated by the monetary reward, 
and attitudes toward the experiment and experi- 
menter. 


End of Session and Debriefing 


After the completion of the second questionnaire, 
viewers were thanked for their cooperation and asked 
not to discuss the experiment with other persons in 
the course. Later in the semester, the quiz sections 
were revisited for an explanation of the study and its 
procedures. Time was also given to discussion of 
questions which students raised concerning the re- 
search and its findings. 

A complete outline of the script, procedures, and 
questionnaire may be found in Kauffmann (1970). 


RESULTS 
Analysis 


The questionnaire responses were subjected 
to a fixed-effects analysis of variance, whose 
independent variables, each having two levels, 
were type of act, magnitude of incentive, time 
of incentive offer, freedom clause, and sex of 
the viewer. Since the cell sizes were not equal 
(see Table 1), the analysis used was an un- 
weighted-means formula for unequal ms 
(Winer, 1962, pp. 241-244). All F tests re- 
ported in this study involve 1 and 486 degrees 
of freedom, The means on which these F tests 
are based are found in Table 2. 

Although the sex of observer was included 
as a variable in all of the analyses, it proved 
to have little effect. Except where explicitly 
noted below, neither its main nor interaction 
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TABLE 2 
MEAN Responses TO FIVE QUESTIONS 


Manipulation 
Item | 2 Befo fte 
Sene | ln mm a | Oe ose | io 
nal act | nal act clause | ment ment 
Interpretation of the monetary offer 6.37 6.81 6.14 7.04 6.47 6.71 6.17 7.00 
Freedom to decline participation 7.86 7.40 8.07 7.18 7.95 7.30 | 7.92 7.34 
Effect on how hard the subject will work| 4.94 4.54 4.93 4.55 387 561 485 1.63 
Feelings after completion of the speech | 5.78 7.50 6.52 6.77 6.66 6.63 6.55 6.74 
Resentment at being in the study 547 5.89 5.49 5.58 5.40 5.66 | 529 5.78 
Note.—Underlined pairs of means are significantly different, p < .05. 
effects were significant, and data for the two hear a freedom-to-decline clause indicated a 
sexes were combined for ease of exposition. greater perception of bribe (F = 17.91, p< 
.0001). 
Findings Bearing on the Bribe-Bonus Theory Responses to another questionnaire item 


Bribe versus bonus. Responses to the criti- 
cal question, “How do you think the subject 
interpreted the offer of money?" with end 
points “as a bribe” (scored 1) and “as a 
bonus (scored 11), indicate considerable sup- 
port for the hypotheses advanced earlier. Sig- 
nificant main effects revealed a greater percep- 
tion of bribe for those who observed the offer 
of money before commitment (F = 15.33, p 
< .0001) and for those who received the 
counterattitudinal speech treatments (F = 
4.15, p < 043). As expected, the magnitude 
of the incentive interacted with the time at 
which it was offered (F = 8.83, p< .004). 
The greatest perception of bribe occurred in 
the before-commitment, $4 setting; the after- 
commitment, $4 condition stimulated the 
greatest perception of bonus. As indicated in 
Table 3, the means for the $1 conditions fall 
at intermediate levels of “bribeness” and are 
not greatly affected by the time of offer. Con- 
trary to the hypothesis, those who did not 

TABLE 3 
MEANS FOR THE SIGNIFICANT INTERACTION BETWEEN 


MAGNITUDE OF INCENTIVE AND TIME 
or Irs OFFER 


Magnitude of incentive 
Time of offer 


F $1? $4 
Before commitment 6.37 5.98 
After commitment 6.57 7.45 


also suggest that the time and magnitude of a 
monetary offer affect the manner in which it 
is interpreted. When asked to rank íour in- 
terpretations of the incentive, viewers in the 
$4, before-commitment group were notably 
less likely than viewers in other categories to 
subscribe to the notion that it was a “routine 
payment made to all subjects," and more in- 
clined to believe that it was “an incentive to 
do something many other subjects wouldn’t 
do.” Table 4 reports the mean ranks given to 
these two interpretations by the eight cate- 
gories of viewers. 

When viewers were dichotomized on the 
basis of the interpretation that they ranked 
higher, the data reported in Table 5 were gen- 
erated. Nonparametric tests performed on 
these frequencies indicated that before-com- 
mitment viewers were more likely to favor the 
alternative “incentive to induce” than were 
their after-commitment counterparts (x^ = 
10.18, df = 1, p < .005, for counterattitudinal 
conditions; x? = 18.83, df = 1, p < .001, for 
proattitudinal conditions). Observers who 
heard a $4 offer manifested a greater tendency 
than those exposed to the $1 treatment to in- 
terpret the money as a bribe (y? = 15.27, df 
= 1, p<.001, for counterattitudinal condi- 
tions; x? = 11.17, df = 1, p < .001, for pro- 
attitudinal conditions). Among observers sub- 
jected to the $4 treatments, those witnessing 
the offer before commitment were more in- 
clined to accept the bribe interpretation than 
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TABLE 4 
MEAN RANKS FOR Two INTERPRETATIONS OF THE MONETARY INCENTIVE 


were those who heard the offer after commit- 
ment (x? = 11.04, df=1, p< .001, for 
counterattitudinal conditions; x? = 17.91, df 
= 1, p < .001, for proattitudinal conditions). 
When only $1 was offered, the time of the 
offer did not have a significant effect on the 
viewers’ interpretations (x? — 1.84, df — 1, 
ns, for counterattitudinal conditions; x^ = 
3.16, df = 1, p < .10, for proattitudinal con- 
ditions). The effect of proattitudinal versus 
counterattitudinal speech was not significant 
(x? = 142, df — 1). 

Attitude of other subjects. The impact of 
the magnitude and time of the monetary offer 


TABLE 5 
NUMBER or Viewers FAvoRING Eacu or Two INTER- 
PRETATIONS OF THE MONETARY INCENTIVE 


Treatment 
Item $1 $1 $4 
before before after after 
commit- | commit- | commit- | commit- 
ment ment ment ment 
Counterattitudinal speech treatment 
Routine payment 53 26 52 44 
Incentive to 
induce 14 M 7 t4 
Proattitudinal speech treatment 
Routine payment 55 32 57 48 
Incentive to | 
induce 19 39 9 1 


Treatment 
Item 1 ! | 
$ $4 $i 
| before | before after 
commit- | commit | commit- 
ment ment ment 
Counterattitudinal speech treatment 
le d Lr — — $ 
utinely made to subjects who participate | 
at is supported by a grant 1.37 2.10 €277 
e to induce the subjects to do something P 
students would not do | 2.40 1.81 2.74 2.45 1 
| | 
m = ; 1 
Proattitudinal speech treatment 3 
routinely made 1.45 1.92 1.22 1.40 
Incentive to induce 2.28 1.91 2.60 2.54 
t 


was also revealed by the viewers’ responses to 
an item asking them to estimate the percent- 
age of subjects who would have declined to 
make the speech if “no money had been of- 
fered in the experiment.” Table 6 reports the 
number of viewers who estimated the refusal — 
rate to be 10% or less, as opposed to 11% or — 
more. Chi-square tests of the reported fre- 
quencies indicate that before-commitment 
viewers were more likely to expect refusal 
rates of 11% or higher than were after-com- 
mitment viewers (x° = 15.88, df=1, p< 
.001, for counterattitudinal conditions; x^ = 
6.61, df = 1, p < .025, for proattitudinal con- 
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TABLE 6 


NUMBER or VIEWERS Estmatinc Two LEVELS 
or REFUsALS TO Participate Hap No 
MONEY BEEN OFFERED IN THE 


EXPERIMENT 
Treatment 
Estimated 
$1 4 $1 $4 
See before de after after 
commit- | commit- | commit- | commit- 
ment ment ment ment 
Counterattitudinal speech treatment 
10% or less 43 24 52 44 
11% or more 24 33 11 16 
Proattitudinal speech treatment 

10%, or less 50 42 56 42 
11% or more 24 29 11 17 
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ditions). Viewers who heard $4 offered to the 
subject predicted higher refusal rates than 
those hearing a $1 offer (x* = 6.14, df — 1, 
p < .025, for counterattitudinal conditions; 
xX = 3.60, df = 1, p < .10, for proattitudinal 
conditions). The $4, before-commitment view- 
ers expected more refusals than their aíter- 
commitment counterparts in the counteratti- 
tudinal treatment (x? = 11.72, df=1, p< 
001), but the trend was not as clear for the 
proattitudinal treatment (x? = 2.043, df = 1, 
p < .250). The effect of proattitudinal versus 
counterattitudinal speech was not significant 
(x? = 1.009, df = 1). 

The larger estimated refusal rates for $4 
viewers than for $1 viewers, and for “before” 
viewers than for “after” viewers, suggest 
that the assigned task is judged to be more 
onerous and/or reprehensible when large 
incentives are offered, and particularly when 
they are offered before commitment. 

These findings indicate that a large pay- 
ment offered before commitment is likely to 
be interpreted as a bribe (“inducement”) for 
performing an act of questionable desirability, 
an act which the subject believes that many 
of his colleagues might refuse to perform. This 
conclusion appears equally valid, regardless of 
whether the behavior is dissonant or conso- 
nant with the actor’s own views. 

Freedom to decline. The time and magni- 
tude of the monetary offer also influenced the 
viewers’ interpretations of the subject’s free- 
dom to decline to make the speech. In re- 
sponse to the question, “How free do you 
think the subject felt to decline to partici- 
pate?" viewers generally indicated that they 
felt the subject was more than a little co- 
erced. The mean rating for all viewers was 7.6 
on a scale ranging from 1 (“completely free”) 
to 11 (“not at all free"). Even for viewers 
who heard the experimenter explicitly inform 
the subject that he was free to decline, the 
mean was 7,18. Findings of this kind raise 
grave doubts about the amount of freedom 
subjects experience in laboratory studies. 

Analysis of variance performed on the free- 
dom ratings revealed that significantly greater 
freedom was attributed to the subject when 
the monetary incentive was offered after com- 
mitment than when it was offered before (F 
= 6.38, p < .02). However, contrary to ex- 


pectation, more freedom was ascribed in the 
$4 conditions than in the $1 conditions (F = 
7.84, p « 006). Viewers who heard the 
* freedom clause" attributed more freedom to 
the subject than those who did not (F— 
14.83, p < .001), and greater freedom was 
ascribed by those who received proattitudinal 
treatments than by those who were subjected 
to the counterattitudinal treatments (/ 
3.86, p < .05). 


Findings Bearing on Incentive and 
Dissonance Theories 


In forced compliance studies, the focal de- 
pendent variable has been the amount of atti- 
tude change generated by the manipulations. 
Viewers in the present investigation were 
asked to indicate how they felt “students i 
general reacted after making their speeches. 
The 11-step continuum on which responses 
were made was bounded by “much more fa- 
vorable to draft deferments" (scored 1) and 
“much less favorable to draft deferments” 
(scored 11). The sixth interval was labeled 
“didn’t change.” Analysis of responses to this 
inquiry revealed a significant difference for 
the type of act (F = 94.7, p < .00001). The 
means for this effect reveal that viewers ex- 
pected proattitudinal speakers to move in the 
role-played direction (X = 4.10), but those 
who viewed counterattitudinal treatments an- 
ticipated almost no change in the subjects’ 
attitudes. The mean response of the latter 
viewers was 5.87. 

Incentive theory implies a relationship be- 
tween the magnitude of a proffered reward 
and the effort that subjects will expend. 
Viewers in this study were asked, “Do you 
think the amount of money affected how hard 
the subject worked on the essay?” End points 
were labeled “had no effect on how he 
worked” (scored 1) and “had a very large 
effect on how hard he worked” (scored 11). A 
large, significant effect for magnitude or re- 
ward indicated that viewers who received the 
$4 treatments felt that the subject’s efforts 
were more strongly affected by the money 
than did viewers subjected to the $1 treat- 
ments (F = 43.83, p< .0001). A two-way 
interaction revealed that whether the viewer 
was exposed to the $1 or $4 conditions had à 
greater impact on his estimate of the incen- 


's effect when the speech was proattitudi- 
‘than when it was counterattitudinal (F = 
, P< 04). The means for this interaction 
Pare given in Table 7.* 
Dissonance theory contends that counter- 
attitudinal behavior stimulates negative moti- 
“vations. When asked about the feelings of the 
"subject after he had finished his speech, view- 
ers in the present study attributed more guilt 
to those who had given a counterattitudinal 
Speech (F = 88.37, p < .00001). The grand 
— mean on this 1 1-step scale which ranged from 
“guilty about doing something morally wrong" 
"(scored 1) to “happy about having performed 
a useful task" (scored 11) was 6.65, about 
midway between the end points. For viewers - 
in the proattitudinal conditions, the mean 
^ was 7.50. Apparently, many persons had seri- 
—— Qus reservations about the task, regardless of 
the speech topic. 
— Finally, the observers were asked to specify 
"whether they íelt that the subject resented 
— having to participate in the study. As might 
_ be anticipated, those who viewed counteratti- 
tudinal films indicated more resentment than 
those who viewed proattitudinal films (F = 
14.03, p < 0003). In addition, those who 
heard the money offered as part of the initial 
bargain showed more resentment than those 
S. who heard the incentive mentioned only after 
- commitment (F = 6.58, p < .02). 


Fe 


DISCUSSION 


If the impressions and attributions of pas- 
^ sive viewers are indicative of the reactions of 
experimental subjects, then this study pro- 
— vides a modicum of support for each of the 

three theories discussed above. As dissonance 
theory would predict, counterattitudinal 
. speeches appear to leave the speaker with a 
- greater feeling of resentment and guilt than 
do proattitudinal speeches. However, the im- 
pact of this finding is tempered by the fact 
that reported resentment and guilt are un- 
affected by the magnitude of the incentive of- 
3 There was also a significant interaction effect of 
-. SX of viewer and the freedom clause. Male viewers 
attributed a larger role to the monetary incentive 
= When exposed to the freedom clause, whereas female 
viewers manifested an opposite effect. However, 
since sex of viewers was involved in very few sig- 
nificant findings, its impact in the present instance 
- Should probably be interpreted with caution. 
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TABLE 7 


MEANS FOR THE INTERACTION OF MAGNITUDE 
or THe INCENTIVE AND Tyre or Sprecit 


Type of speech 


Proattitudinal 


Incentive f 
Counterattitudinal 


4.34 | 
5.53 


$1 
84 


fered for making the speech. Incentive theory 
is supported by the present viewers' report 
that $4 rewards evoke more strenuous efforts 
than do $1 rewards. But these findings con- 
stitute, at best, only meager evidence concern- 
ing the validity of dissonance or incentive 
theory. The preponderant theme in the data 
is a validation of the “bribery” arguments 
first advanced by Rosenberg (1965) and sub- 
sequently modified and elaborated by Steiner 
(in press). While acceptance of the bribery 
interpretation does not dictate a complete 
rejection of explanations based on dissonance 
and incentive theory, it does imply that both 
of these formulations ignore potentially critical 
aspects of the situation. Subtle changes in 
the experimenter’s behavior can have marked 
effects on the subjects’ conceptions of their 
role and the task that they are being asked to 
perform. 

A large monetary incentive for performing 
an act is likely to be interpreted as a bribe if 
it is offered before a commitment to comply is 
elicited, Such an incentive implies that the 
act itself is onerous or reprehensible, that 
many persons would refuse to perform it, and 
that the individual to whom the offer is made 
has little freedom to decline. Furthermore, 
large incentives appear to have these effects 
even when the act is not counterattitudinal. 
On the other hand, equally large monetary in- 
centives offered after the individual has com- 
mitted himself to the task are likely to be 
interpreted as routine payments for perform- 
ance of a useful service. The act itself is 
judged to be less reprehensible, and to be dis- 
approved by fewer of one’s colleagues; more- 
over, the individual is seen as enjoying a 
larger margin of freedom to decline. Clearly, 
the timing and magnitude of monetary offers 
affect the recipient’s perceptions of the be- 
havior he is being asked to perform. 


Pi 
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The present study was not designed to 
yield data bearing directly on the amount of 
attitude change elicited by counterattitudinal 
(or proattitudinal) advocacy. But the above 
findings suggest an explanation of results ob- 
tained by other investigators. Dissonance the- 
orists, who always offer their incentives before 
obtaining the subject’s commitment to com- 
ply, generally report negative relationships 
between the magnitude of the incentive and 
the amount of attitude change. This pattern 
of results is not difficult to understand if large 
incentives tend to make the act seem repre- 
hensible, unpopular, and coerced. Having been 
bribed, the individual may actually attempt 
to reestablish his independence and integrity 
by adhering more tenaciously than ever to his 
initial views. (“I performed this obnoxious act 
only because I was compelled to do 30; obvi- 
ously I do not subscribe to the position I have 
defended.") 

When incentives are offered after commit- 
ment, subjects are less likely to see them- 
selves coerced into performing an act which 
many of their associates would disapprove. 
Furthermore, the magnitude of a postcom- 
mitment offer has relatively little effect on 
the subjects perceptions of the task or situa- 
tion. Thus, the process envisioned by the in- 
centive theorist is free to operate unhampered 
by inhibitory cognitive factors; the more 
money the subject is to receive, the greater the 
effort he will expend to do a good job, and the 
more likely he will be to convince himself of 
the merits of the position he is taking. How- 
ever, it should be acknowledged that inordi- 
nately large incentives, even though offered 
after commitment, may arouse suspicion and 
doubt, and thus may invalidate incentive 
theory. As Elms and Janis (1965) suggested, 
positive correlations between incentives and 
attitude change should be anticipated only 
when “negative affect” is not aroused. l 

One aspect of Steiner’s (in press) version 
of the “bribe-gold star” theory was not sup- 
ported by the data of this study. Steiner sug- 
gested that a freedom clause should accentu- 
ate the tendency of subjects to interpret large 
before-commitment offers as bribes. Inclusion 
of such a clause in the experimental instruc- 
tions might imply that the experimenter him- 

self doubted the ethics or wisdom of the action 


he was requesting, and thus might accentuate 
the subject’s impression that he was being 
purchased for an unworthy end. Con , 
this contention, the data revealed a signific 
tendency for the inclusion of a freedom clause 
to generate a reduced impression of bribery. 

However, the present manipulations may 
not have provided a true test of Steiner’s 
contention. The freedom clause included in 
the experimental script asserted that the sub- 
ject could leave, at no penalty to himself, if 
he so chose. The subject stayed. Thus, an 
incentive offered after this explicit choice 
point may have seemed to be an “after-com- 
mitment" bonus. Any later manipulation of 
commitment might be expected to have little 
effect after this earlier decision. 

This reasoning suggests that a freedom 
clause not only provides information about 
the desirability of the task but also forces the 
subject to commit himself. Additional research 
is needed to untangle these interrelated ef- 
fects, but present data imply that the intro- 
duction of a freedom clause into the experi- 
mental setting may be a complex and reactive 
procedure. 

On a closely related issue, Steiner argued 
that a freedom clause is probably not taken 
very seriously by experimental subjects, and 
that contrary to the claims made by numerous 
researchers, it does not really induce a percep- 
tion of freedom. In the present study, view- 
ers who heard a freedom clause attributed 
significantly more freedom to the subject than 
did viewers who did not. However, even those 
who heard the clause gave the subject an 
average rating of 7.18 on freedom. Since rat- 
ings could vary from 1 (“completely free") to 
11 (*not at all free"), it is clear that inclusion 
of the clause did not create the kind of per- 
ception that it is widely alleged to produce. 
One may reasonably question the adequacy of 
theories which rest on the assumption that 
telling an experimental subject he is free 
makes him feel free, Although assurances of 
freedom are ordinarily given very early in the 
experimental session, as they were in the pres- 
ent study, the subject is already somewhat 
committed to the task that is imposed upon 
him. He has invested effort in appearing at 
the appointed time and place, and has tacitly 
or explicitly consented to serve as a subject. 


can scarcely withdraw without seeming to 
ir the experimenter’s scientific research. 
short, a freedom clause may not make an 
mental subject feel free, but, as sug- 
d above, it may tell the subject some- 
hing about what is to be expected in the 
—study, and may suggest interpretations of the 
experimental situation not intended by the 
experimenter. 
The results of this investigation underscore 
the contentions of Orne (1962), Rosenthal 
- (1964), Rosenberg (1965), and others. While 
serving in an experimental setting, human 
- subjects are prone to look for cues concerning 
the meaning of the situation. They are likely 
— to react to the aims which they attribute to 
the experimenter, and to behave in ways that 
reflect their own appraisals of the manipula- 
tions to which they are subjected. Subtle vari- 
_ ations in experimental procedures are capable 
- of producing substantial changes in the in- 
terpretations, and, hence, the behavior, of 
Such subjects. In the present instance, the 
magnitude of the monetary incentive and the 
: time (in relation to commitment to comply) 
at which it was offered appear to have had a 
large and potentially decisive impact on the 
subjects’ evaluations of the experimental task 
and their role in performing it. 
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CONTEXT EFFECTS IN IMPRESSION FORMATION: 


THE WEIGHTED AVERAGE VERSUS THE 
MEANING-CHANGE FORMULATION * 


MARTIN F. KAPLAN ? 


Northern Illinois University 


In two experiments, subjects formed impressions of persons described by sets 
of personality-trait adjectives, and then rated the likableness of one test trait 
in the set. Test-trait ratings were shifted toward the values of the other traits 
in the set, a context effect. This context effect was the same for traits high and 
low in connotative variability, as predicted by the weighted average model, 
but contrary to a change-in-meaning hypothesis. Trait credibility and extremity 
of person-impression ratings directly influenced the magnitude of the context 
effect, also in agreement with the weighted average model. 


Context effects have been noted in the study 
of personality impressions since the early work 
of Asch (1946). The rating of a given trait 
(test trait) within the context of other traits 
differs from its *context-free" rating. For ex- 
ample, the trait *clever" would receive a more 
favorable rating in a person also described as 
"intelligent" and “industrious” and a less 
favorable rating when paired with “selfish” 
and “dishonest,” compared to its rating out 
of context. This study considers the two 
major interpretations of this context effect. 

Asch interpreted this context effect as a 
change in phenomenological meaning of the 
test trait. That is, a given trait possesses a 
latitude of meaning, and the meaning which 
best fits the context in which the trait is em- 
bedded is selected. The  meaning-change 
formulation thus implies a change toward 
greater congruity with the context. 

Supportive evidence was offered by Asch 
(1946), who found that synonyms for the 
same trait word varied with changes in con- 
text. Anderson and Norman (1964) have 
pointed out, however, that this does not 
necessarily demonstrate change in meaning; 
the synonyms could be responses to the per- 
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son or to the context, rather than the test 
trait alone. ‘ 2 

Parenthetically, Asch (1946) su; jester 
that order effects, which may be considered : 
special case of context effects, are due 
earlier traits creating a “directed impres sion, 
by which the meanings of later presented 
traits are modified. In a direct test, Tesser 
(1968) found no support for this meaning- 
change interpretation. Ratings of individual 
traits were independent of ordinal position, 
whereas order affected the contribution oi a 
given trait to the person impression, indicating 
that the meaning of a trait is unaffected by 
serial position. 

An alternative interpretation of context 
effects has been provided by Anderson (An- 
derson, 1966; Anderson & Lampel, 1965). In 
this formulation, it is not the meaning that 
changes with variations in context, but the 
affective rating assigned to the test trait, the 
affective rating being a weighted average of 
its context-free value and the impression of 
the person. The distinction thus must be made 
between the meaning of a trait, which is rela- 
tively free of context, and the evaluative rat- 
ing assigned to it. The model proposed by 
Anderson (1966) to account for changes in 
rating is 


s'—ws-4 (1— w), [1] 


where s’ is the rating in context, s is the con- 
text-free value of the trait, Z is the overall im- 
pression, and w is a weighting factor, A num- 
ber of studies have supported the weighted 
average model, though not disconfirming the 


we-in-meaning formulation. The predicted 
relationship has been demonstrated be- 


when an overall person impression is 
nderson, 1966; Anderson & Lampel, 


ation with any one stimulus person (Anderson 
®& Lampel, 1965). 
The present experiments attempted to pro- 


context effects by varying polarity and weight 
s of the overall impression and variability of 

connotative meaning of test traits. In Experi- 
— ment I, the latitude of meaning of a test 
"trait and the set size of the context were 
"varied. In Experiment IT, the weight assigned 
to the person impression was manipulated by 
counting instructions. In both experiments, 
sableness of test and context traits was also 
varied. The experiments provided several pre- 
dictions, based on the two alternative inter- 
- pretations. The first prediction, relevant to 
4 Experiment I, was derived from the meaning- 
change formulation. Three further predictions 
were derived from the weighted average 
. model. 
f Prediction I 

j Ti context effects are attributable to changes 
Lin meaning, they should be greater in test 
- traits of high connotative variability. This 
© Would appear to be a straightforward conse- 
- quence of Asch’s meaning-change formulation. 
Qualified support for this prediction has been 
_ reported by Wyer and Watson (1969), but 

' Several shortcomings render interpretation 
questionable; these will be discussed later. 


- Prediction II 


It has been amply demonstrated that in- 
_ reases in set size lead to increased polarity of 
. impression (cf. Anderson, 1967). Therefore, 
_ Increases in context effects should be directly 
. elated to increases in set size; that is, the 
Context effect should be greatest at the largest 
(Set size, This follows directly from Equation 
mf ratings of traits in context are an average 
_ €f context-free ratings and overall impression, 
_ then a linear relation between trait rating and 
polarity of overall impression should be found. 
— That is, the magnitude of the context effect 
K should vary directly with discrepancy between 
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the overall impression (7) and the context- 
free test-trait ratings (s). Et 


Prediction 111 


Context effects should not interact with the 
evaluative level of test traits. Again, this fol- 
lows directly from Equation 1, where the 
test-trait rating is a linear function of the 
person impression and the normative test- 
trait rating. 


Prediction IV 


Reducing the weight of the overall impres- 
sion should reduce the context effect. The less 
weight placed on the overall impression, the 
more the test-trait rating should approach its 
normative value, since the weights of the two 
components are inversely related. 


METHOD 


Experiment I 
Procedure 


Subjects rated 42 stimulus persons described by 
sets of personality-trait adjectives. Sets were shown 
with a projector, and the subjects rated each stimu- 
lus person on a separate page of a booklet. First 
they rated their overall impression of the person on 
a 7-point scale anchored at “dislike very much” (0) 
and “like very much” (6). Then, after a pause, they 
rated the test trait on a 21-point scale according to 
how likable or dislikable they considered that par- 
ticular aspect of the person. “Like very much” (20) 
and “dislike very much” (0) constituted the anchors. 

Trait sets were constructed according to a 2 X 2 X 
2X3 matrix, consisting of 2 levels of likability of 
test traits, 2 levels of variability of test traits, 2 
levels of likability of context traits, and 3 set sizes. 
Likableness of test and context traits was determined 
from published norms (Anderson, 19684). Highly 
likable and highly unlikable context traits, 36 of 
each, were randomly chosen without replacement ` 
from those in the range of 5.00-5.73, and .26-1.00, 
respectively; moderately likable and moderately un- 
likable test traits, from subsets ranging from 3.5 to 
3.74 and 2.22-2.54, respectively. Test traits were 
chosen so that half at each level were highly variable 
in the normative ratings (s? > 1.66) and half low in 
variability (s? < .90). 

This variability manipulation reflects between- 
subject variability, and may not be indicative of 
within-subject variability in meaning. To check this, 
an independent sample of 18 subjects was asked to 
rate the selected test traits, along with 12 highly 
likable and 12 highly unlikable filler traits, on two 
successive occasions, with a 2-day interval between 
ratings. The magnitude of the change in rating bè- 
tween the two sessions is an index of within-subject 
variability in meaning. Table 1 presents the mean 5 
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TABLE 1 


Mean Sutets iN Ratne OF Test Traits IN. Two 
SuccEssivE RATING SESSIONS 


Intersubject variability 


Trait likableness | 


High variable Low variable 
Moderately likable 63 32 
Moderately unlikable 45 36 


magnitude of shift in rating between sessions of the 
four types of test traits employed in the experiment. 
As can be seen, the traits that were high in between- 
subject variability were also high in within-subject 
variability (ts=4.17 and 3.35, respectively, df =17, 
p<.005). It is concluded, therefore, that test-trait 
variability represents a real within-subject phenome- 
non. 

The 2X2 matrix of test traits was randomly 
paired with either 1, 3, or 5 context traits at each 
likableness level to complete the 2X2X2X3 
matrix of test and context. The only restriction on 
randomness was that the three set sizes be matched 
in average likableness within each level. To control 
for incidental pairing effects, half of the subjects 
were given a first replication of the sets and half a 
second replication, constructed by randomly recom- 
bining context traits within levels and randomly re- 
pairing context with test stimuli within levels of 
variability and likableness. In addition to the 24 
experimental sets, 6 practice sets with test and con- 
text covering the range of likableness, and 12 filler 
sets composed of uniformly highly likable, moderately 
likable, moderately unlikable, or highly unlikable 
traits, in each set size, were included. Words within 
sets were randomized for simultaneous presentation, 
and the sets were presented in random order. 


Subjects 


Subjects were 76 volunteer introductory psychol- 
ogy students, half of each sex. They were randomly 
assigned to two groups of 38 each, with each group 
receiving a different replication of the stimulus ma- 
terials. 


Experiment II 
Purpose 


This experiment specifically tested Prediction IV. 
The independent variable of prime interest here was 
the weight assigned to the impression. Stimulus ma- 
terials were similar to those employed in Experiment 
I, with the deletion of context set size and test-trait 
variability as variables. 


Procedure 


The experiment was based on a 2 X 4, Test X Con- 
text, design. The four contexts, highly likable, mod- 
erately likable, moderately unlikable, and highly un- 
likable, were randomly chosen from the same subsets 
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as Experiment I and randomly paired with mod- 
erately likable or moderately unlikable test traits. 
Thus, there were 8 sets, each containing two context 
and one test adjective. Three replications of this 2 X 
4, Test X Context, matrix were constructed, with 
different traits being selected for each replication. In 
addition to the 24 experimental sets, 6 practice and 
12 filler sets were constructed, the latter consisting 
of uniformly highly likable, moderately likable, mod- - 
erately unlikable, and highly unlikable stimuli. As - 
in Experiment I, traits were randomized within sets - 
and typed onto slides for simultaneous presen 
and the order of presentation of sets was ra 
Rating instructions were identical to those i 
periment I. 

The between-groups variable consisted oí the 
weight assigned to the overall impression. One group 
of subjects was simply told that each trait was sup- 
plied by someone who knew the stimulus person 
well. The second group was given discounting in- 
structions; they were told that acquaintances had 
been asked to supply many traits of the stimulus 
person, rating their certainty that the person ‘lid, in 
fact, possess a given trait. The traits shown were 
represented as being chosen from the lower «nd € 
the certainty ratings. To facilitate credibility, th 
subjects were told that impressions are often forme 
on the basis of unreliable information and that thi 
was an investigation of the process of forming such 
judgments. 


Subjects 


Sixty undergraduate volunteers were randomly 
divided into two experimental groups, with 15 males 
and 15 females in each. 


RESULTS 
Experiment 1 


Mean ratings of the two dependent varia- 
bles, test-trait impressions and person im- 
pressions, are found in Figures 1 and 2, re- 
spectively. For ease of presentation, data 
were combined across the two variability 
conditions. Within-subjects analyses of vari- 
ance were performed on both dependent vari- 
ables. Since the replication factor was not 
significant in either analysis, data and analy- 
ses were collapsed over replications. 


Test-Trait Ratings 


Three theoretical predictions were consid- 
ered in Experiment I, 

Prediction I. The meaning-change hypothe- 
sis required that context effects be greater 
for test traits of high variability. A context ^ 
effect is clearly evident in Figure 1, as indi- 
cated by the higher ratings assigned test traits 
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in highly lit2ble contexts compared to highly 
unlikable contexts. Contrary to the meaning- 
change hypothesis, however, test-trait varia- 
bility did gnificantly affect the magni- 
tude of the context effect. Statistically, this 
prediction ; tested by the Variability x 
Context interaction, which was not significant 
(F = 2.1 1/75). 

Prediction II. The averaging model pre- 
dicted that ‘he magnitude of the context effect 
would vary directly with the polarity of 
person impressions. Since the latter increases 
with contex! set size (see Figure 2), the data 
in Figur clearly support this prediction: 
differences in ratings of both moderately lika- 
ble and moderately unlikable traits as a 
function of context become larger with in- 
creased context set size. This prediction was 
further tested by the Context X Set Size in- 
teraction, which was statistically significant 


if = 2/150, p < .05). 
: JII. The averaging model pre- 


dicts that context effects should not depend on 
evaluative level of test traits. Theoretically, 
this assumption would lead to an expectation 


of zero interaction between test and context. 
That this interaction was negligible (F = .81, 


df = 1/75) lends support to the averaging 
model. 
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CONTEXT SET SIZE 


Fic. 1. Mean test-trait rating as a function of value 
and set size of context. (H = highly likable, L = 
highly unlikable, M+ = moderately likable, M— = 
moderately unlikable.) 
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MEAN PERSON IMPRESSION 


CONTEXT SET SIZE 


Fic. 2. Mean person impression as a function of 
test trait, context, and set size of context. (H= 
highly likable, L= highly unlikable, M+ = mod- 
erately likable, M— = moderately unlikable.) 


One last effect was noted: high-variability 
traits generally received more positive ratings 
than did low-variability traits (F = 25.36, df 
= 1/75, p < .001). Since this effect was inde- 
pendent of context and set size, however, it 
does not contradict either the meaning-change 
or averaging hypothesis. 

Related work. Two of the findings require 
comment in light of recent contradictory evi- 
dence. First, no support was found for a 
meaning-change formulation, even when con- 
ditions were conducive to such change; that is, 
the context effect was similar for high- and 
low-variable test traits. A contrary interpre- 
tation was recently advanced by Wyer and 
Watson (1969) in a similar test. Defining 
trait variability by the mean discrepancy be- 
tween typical trait rating (x) and rating un- 
der instructions to judge “most” (m) or 
“least” (1) likableness, they found evidence 
for the Variability X Context interaction pre- 
dicted by the meaning-change formulation. 

A number of considerations, however, limit 
interpretability of their results. The primary 
problem is that the Variability X Context in- 
teraction was in the predicted direction only 
when the x — 1 discrepancy was the criterion. 
That is, compared to low-variability traits, 
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15 dition, the subjects were asked to form an 
mą impression of the person described by a trait 
14 set prior to rating of the test trait; in the 
w. second, they simply rated the traits as a col. 
eA lective, without being told that a single person 
E 1 SOUEITIORR was being described. Since an analysi of vari- 
1 à à HoH weicHt Ance Was not reported for the form condi- 
SEn Low were OR alone, it is unclear whether the i erac- 
E tion would obtain given the same rating in- 

40 structions as the present experiment. 
m j i The second finding requiring comment is 
Z 9 m. the lack of a significant Test X Contex! inter- 
y action. This would be predicted by anv linear 
8 model (Anderson, 1966). However, Wyer and 
Dermer (1968), contrary to the p: t re- 
7 sults, found such an interaction in a series of 
A l studies. It should be noted that they con- 
L M- Ms H structed their trait sets from a limited pool of 
CONTEXT adjectives, so that each trait appeared eventu- 


Fic. 3. Mean test-trait rating as a function of 
context and weighting conditions. (H — highly 
likable, L = highly unlikable, M+ = moderately lik- 
able, M— = moderately unlikable.) 


context effects were greater for variable traits 
when variability was defined by magnitude of 
the x —1 discrepancy but less when m — x 
was defined. When both ranges of variability 
(x — l and m — x) were combined, the inter- 
action was negligible. This agrees with the 
present results. 

A further problem in interpreting Wyer 
and Watson's obtained interaction is that it 
largely occurs in ratings within highly positive 
contexts. Wyer and Watson concluded that 
the magnitude of the context effect is related 
to test-trait variability when the context is 
positive and variability is limited to nuances 
below typical ratings, providing limited sup- 
port for the meaning-change hypothesis. The 
facts that a very limited sample of context 
traits were employed (two traits at each con- 
text-likability level), that results were specific 
to one particular context, and that the direc- 
tion of results was contingent on the type of 
variability examined suggests the alternative 
possibility that the interactions were random 
effects. 

A final consideration is that their observed 
interaction was based on pooled data írom 
two instructional conditions. In the first con- 


ally as both test and context, and eac? 
trait was rated several times, In contrss 
subject rated the same test trait more than 
once in the present studies. It is possible that 
earlier person impressions formed in conjunc- 
tion with a given test trait acted as contexts 
in future ratings, making the test rating con- 
tingent on past ratings, as well as the present 
context. Further, the limited sample of traits 
employed by Wyer and Dermer conceivably 
allows for effects due to peculiarities associ- 
ated with a given trait-context pairing 


Person Impressions 


The support for the averaging model in 
Prediction IT rests on the assumption that 
increased set size leads to greater polarity of 
person impressions. Figure 2 confirms this as- 
sumption: impressions of persons described by 
highly likable context traits become more posi- 
tive, and those described by highly unlikable 
context traits become more negative, with in- 
creases in set size. This polarizing effect of 
set size is reflected in a significant Context X 
Set Size interaction in the overall person im- 
pression (F = 93.49, df = 2/150, p < .001). 

One further significant effect needs mention, 
this being the three-way Context x Set Size 
X Test-Trait Variability interaction (F= 
4.20, df = 2/150, p < .05). This interaction 
reflected a less pronounced set-size effect for 
sets containing high-variability test traits. 


7 TE 


1 (1967) found variability within sets to 
"be inversely related to confidence in judg- 
ments. If veriance of test traits contributed to 
"total set variance, it is possible that attenu- 
ated set-size effects were due to lower confi- 
dence or weight associated with sets contain- 
ing a variable trait. 


Experiment II 
Test-Trai 

Predicti 

predicts t! 
person impr 
effect. T! 
data, as si 
erately lil 


: IV. The averaging formulation 
a reduction in weight of the 
;sion would decrease the context 
rediction is supported by the 

"n in Figure 3. For both mod- 

‘ble and moderately unlikable 

traits, the effect of the context was more pro- 

nounced under high weighting conditions, as 

- evidenced by the crossing of the high-weight 

and low-weight curves. Statistically, this pre- 

diction was tested by the Weighting Instruc- 
tions X Context interaction, which was sig- 

nificant | 4.24, df = 3/87, p € 01). 

Once . consistent with the weighted 
average model, the Test X Context interac- 

- tion was nonsignificant (F = 1.94, df = 3/ 
87). That is, the displacement of test-trait 
ratings toward the context was similar for 
both moderately likable and moderately un- 
likable traits. A three-way interaction, how- 
ever, was found between test, context, and 
weighting instructions (F = 3.00, df = 3/87, 
p< .05). While Test X Context interactions 
were nonsignificant in both weighting condi- 
tions when analyzed separately, the devia- 
tions were in opposite directions, accounting 
for the observed triple interaction. 


Person Impressions 


Person impressions were analyzed as a 
function of weighting instructions and value 
of the context and test-trait components. In- 
crements in value of trait components, 
Whether designated as test or context, led to 
increments in person impression ratings (see 
Table 2). Person impressions were linearly 
related to component values, with the Test X 
Context interaction being nonsignificant (F 
= L87, df = 3/87). 

Anderson (1968b) has proposed a model in 
Which person impressions are à weighted 
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TABLE 2 
MEAN PERSON IMPRESSION As A FUNCTION oF 
WEIGHTING CONDITIONS AND TRAIT 


LIKABLENESS 
= = ——————— 
‘Weighting cond. 
Trait 
High Low 

Context 

Highly unlikable 1.80 241 

Moderately unlikable 3.10 3.04 

Moderately likable 4.70 4.66 

Highly likable 5.76 5.31 
Test 

Moderately unlikable 3.46 3.53 

Moderately likable 44 4.03 


average of trait values and an initial impres- 
sion, In this model, weight assigned to a pre- 
existing initial impression (presumably neu- 
tral) varies inversely with weight assigned to 
the traits. Therefore, it is expected that with 
less weight placed on traits, responses should 
tend more toward the initial impression, that 
is, neutrality. Impressions obtained under dis- 
counting instructions were less polarized, con- 
sistent with this prediction. Table 2 shows 
that given unlikable traits, high-weight in- 
structions led to more negative ratings than 
did low-weight instructions. Similarly, given 
likable traits, high-weight instructions led to 
more positive ratings. These observations were 
supported by obtained Instructions X Context 
Traits (F = 7.13, df = 3/87, p < 001) and 
Instructions X Test Traits (F = 5.84, df= 
1/87, p < .05) interactions. 
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DISCUSSION 
Weighted Average Model | 


Support was obtained in both experiments 
for the averaging model. That all predictions 
were verified, both for test-trait ratings and 
person impressions, has already been noted. 
Several results, however, invite further dis- 
cussion. 

Increased set size led to more extreme per- 
son impressions (Figure 2) as well as greater 
context effects (Figure 1). This agrees with 
the prediction that context effects are a func- 
tion of the discrepancy between person im- 
pression and test-trait value. As a further test 
of this association between context effects and 
person impressions, an analysis of covariance 
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was performed on test-trait ratings, with 
person-impression ratings treated as a co- 
variate. The Context X Set Size interaction 
became nonsignificant when test-trait ratings 
were adjusted for person impressions (F = 
2.27, dj — 2/149), supporting the predicted 
relationship between the two variables. 

Experiment II indicated that when the 
overall person impression is discounted, the 
rating of the test trait will be less influenced 
by context (Figure 3). This again suggests the 
importance of an overall impression in con- 
text effects. That the context effect was di- 
minished by discounting instructions is un- 
derscored by the lack of a significant Test 
Trait X Instructions interaction. This sug- 
gests that attenuation of the context effect 
cannot be attributed to a general tendency to 
assign more neutral ratings to test traits under 
discounting conditions. 

An alternative explanation for the observed 
diminishment of context effects due to dis- 
counting is based on the fact that person im- 
pressions were also less extreme in this condi- 
tion. If the test-trait rating in context is an 
average of its rating in isolation and the over- 
all impression, then the trait rating would be 
expected to be less extreme, that is, less af- 
fected by context, if the overall impression is 
less extreme. Both interpretations of Experi- 
ment II, that is, that either the diminished 
weight of the overall impression or that the 
more neutral person ratings served to diminish 
context effects, are consistent with the 
weighted average model. 

On the basis of the preceding discussion, it 
appears that an overall impression with some 
degree of associated weight is necessary to 
obtain context effects. The greater the weight 
assigned to the impression, the greater the 
observed context effect (Experiment II); and 
with weight held constant, the greater the dis- 
crepancy between the impression and the 
trait rating in isolation, the greater the con- 
text effect (Experiment I). This overall im- 
pression need not be a person impression. 
Wyer and Dermer (1968) have demonstrated 
that context effects may occur when subjects 
are simply instructed to rate test and context 
collectively, without ascribing them to a per- 
son. 


Change-in-Meaning Formulation 


The lack of a Variability x Context inter- 
action argues against a change-in-meaning 
formulation. Context effects were not vreater 
for those trait adjectives permitting greater 
shifts in meaning. 

A meaning-change formulation could con- 
ceivably account for the major finding in Ex- 
periment I, that context effects increase with 
increments in  person-impression larity. 
That context effects were reduced by ount- 
ing instructions in Experiment II is also con- 
sistent with the meaning-change hypothesis if 
shifts in meaning are considered to be pro- 
portional to discrepancy between test and 
context. 

It is important to note, however, that the 
major findings in both experiments were not 
clearly predictable from the meaning-csange 


hypothesis. For example, this formulation does 
not require a greater shift in response with | 
increased set size in Experiment I. In ‘act, it 
could be easily argued that mere increase in 
the number of similarly valued traits si : 
not appreciably affect the meaning 
to a trait. Furthermore, meaning cha 
gests that changes in meaning occur as addi- 
tional context traits clarify the meaning of 
test traits. There is no clear requirement that 
trait ratings should change in the direction of 
evaluatively discrepant contexts; the direc- 
tion of change is unspecified. Similarly, the | 
effect of instructions in Experiment H was 
not clearly predictable from the meaning- 
change hypothesis; there is no reason to ex- 
pect test-trait meanings to be clarified more 
by reliable, as compared to unreliable, con- | 
text traits. 

In contrast, the obtained influence of set - 
size and discounting instructions upon the — 
context effect was required by the averaging 
formulation. The effect of increased discrep- 
ancy between test trait and person impres- 
sion is a critical test of the averaging model. 
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Ninety-eight psychology graduate students were asked to forecast first-year- 
graduate grade point averages based on 110 profiles containing 10 cues derived 
from their fellow students' records. For each subject, the cue values were re- 


gressed onto the subjects’ judgments 
that the models (predicted judgments 
own judgments when correlated with 


(the model of man). Results indicated 
) were more accurate than the subjects’ 
the actual criterion. The average judg- 


ments for each profile were considerably more valid than the average of the 


judges' individual validities. Even the 


average judgments were improved upon 


by modeling. It was recommended that the model of man substitute for man 
in forecasting graduate grade point averages. 


The relative superiority of the actuary over 
the human judge in forecasting a criterion 
has been well-documented (Goldberg, 1965; 
Gough, 1962; Meehl, 1954; Sawyer, 1966). 
Actuarial methods, however, presuppose 
knowledge of the empirical relationships exist- 
ing between the predictors (cues) and the 
criterion. Unfortunately, for many, if not 
most, clinical judgment situations, informa- 
tion regarding criterion outcome is sadly lack- 
ing. Consequently, although the human judge 
is a notoriously unreliable predictor (Gold- 
berg, 1968; Little & Schneidman, 1959), his 
judgments are often the only data on which to 
base assessment decisions. Indeed, many cli- 
nicians would prefer the “human” element to 
enter selection decisions, even for those situa- 
tions in which actuarial methods can be used 
(Holt, 1970), 

Goldberg (1970) recently raised the pro- 
vocative question of whether the clinician’s 
predictions might be improved even in those 
circumstances for which criterion information 
was lacking. Goldberg considered the situa- 
tion in which clinical judges are asked to make 
a diagnostic judgment for each of a series of 
profiles (representing, e.g., patients, appli- 
cants, etc.), where each profile consists of a 
set of previously quantified cues. The cue- 
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Institute of Mental Health Grant MH 13892 to the 
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weighting scheme of each judge is captured 
(Hoffman, 1960; Naylor & Wherry, 1955) by 
treating the judgments as the dependent 


variables and the cue values as the indepen- 
dent variables in a multiple- regression equa- 
tion. This multiple-regression equation relat- 
ing cues and judgments can be thought of as 
the linear “model” of the man (Goldberg, 
1970). The corresponding multiple « rrela- 
tion indicates the linear predictability of the 
man, Similarly, the multiple-regression equa- 
tion relating cues to criterion will be termed 
the “actuarial” formula. The corresponding 
multiple correlation between cues and criterion 
represents the linear predictability of the cri- 
terion. The correlation between the clinician’s 
own judgments and the criterion values indi- 
cates the accuracy of the judge. 

Goldberg has argued that one of the major 
reasons that the actuarial formula outperforms 
man is the inherent unreliability of human 
judgments, as compared to the reliability of 
the predicted criterion values generated by the 
actuarial formula. Extending this reasoning, 
Goldberg noted that the model of man also 
should be more reliable than the man himself. 
That is, for a given judge, the predicted judg- 
ments based on a regression equation relating 
cues to judgments would be more reliable than 
the actual judgments themselves. It would 
follow, then, that these predicted judgments 
should be more valid than man’s own judg- 
ments when they are correlated with an eX- 
ternal criterion. Goldberg (1970) noted: 
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tent that the model fails to capture valid 
jance in the judge's decision processes, 
perform less creditably than the judge. To 
that it eliminates the random error com- 
gments, it should perform more 
Ire [p. 424]. 


's (1964) reformulation of the 
juation (Hammond, Hursch, 
Todd, 1964), Goldberg (1970) derived the 
mathematical conditions favoring man’s model 
‘over man. Specifically, the model will be more 
"accurate when 


GR(1—R)-Cy/I —R&SyVI— R5, 
where G = the linear component of the judg- 
mental accuracy, the correlation between the 


-Jinear mode! oí the judge and the linear model 
of the criterion; R, = the linear predictability 
‘of the criterion, the multiple correlation be- 
tween cues and criterion values; R, = the 
"linear predictability of the judge, the multiple 
correlation between cues and judgments; and 
C= the nonlinear component of the judg- 
“mental accuracy, the correlation between the 
residual values of both the criterion and the 
. judge's predictions after removal of the linear 
i components. 

- As Goldberg (1970) noted, the theoretical 
conditions that favor the model over man 
occur when man has some validity and the 
criterion is perfectly predictable from a linear 
- model, As man becomes more linearly predic- 
table, the model’s superiority diminishes. Since 
no criterion and no judge in the real world is 
perfectly linearly predictable, Goldberg sub- 


- jected his formulation to empirical test. His 
- data, originally collected by Meehl (1959), 
; involved the judgments of psychosis versus 
neurosis on an 11-step, forced normal distri- 
bution for 861 MMPI profiles made by 29 
clinical psychologists. The profiles, collected 
from seven hospitals and installations around 
the country, consisted of the eight MMPI 
Clinical scales (excluding Mf) and the three 
validity scales. These data have been described 
extensively elsewhere (Goldberg, 1965, 1970; 
- Meehl, 1959; Wiggins & Hoffman, 1968). 
— For each of the 29 judges, Goldberg (1970) 
computed the model of man by regressing the 
Cues onto the judges’ predictions, thereby gen- 
erating a set of “predicted” judgments. In ad- 
dition, a composite judge was created by 
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averaging the 29 clinicians’ judgments for each | 
of the MMPI profiles, Since actual criterion : 
information was available (hospital staff di- 


agnosis of psychosis or neurosis), Goldberg | 


was able to compare the validity of each judge 
with the validity of each judge's model (i.e. 
the validity of the judgments predicted by the 
model of man) for the seven samples of 
MMPI profiles. Goldberg's results can be 
summarized as follows: 


1. Despite some evidence of slight non- 
linearity or low linear predictability for some 
of the clinicians, the incremental validity of 
man’s model over man was positive in the 
majority of the samples. 

2. The superiority of the model to the man 
held up even when the model was based on a 
small subset of the MMPI profiles and cross- 
validated to the remaining samples. 

3. Although a judge’s model, on the average, 
outperformed the judge himself, the composite 
judge (average judgments) performed as well 
and sometimes better than the model for this 
composite judge. 

4. When criterion information is unavail- 
able, thus vitiating a priori identification of 
the most valid judge or the most valid model, 
Goldberg’s evidence supported the use of the 
composite judge as a relatively accurate pre- 
dictor, Although it is expected that the com- 
posite judge will outperform the mean of the 
individual judge’s validities ë (Goldberg, 
1970; Winkler, 1967), Goldberg’s composite 
judge was even more accurate than the model 
of the composite judge. 

5. When, however, clinical time is too valua- 
ble to be spent in a prediction situation, Gold- 
berg (1970) noted that 
the similarity of the clinician to the composite bore 
4 curvilinear relationship to the modes validity: 
Models of judges who were highly similar to the 


composite performed approximately as well as it did, 
while models of judges who were less similar to the 


composite achieved either quite superior or quite 
inferior validities [p. 4311. 


A conservative strategy thus suggests modeling 
the judge with the highest correlation. with 

3Robyn M. Dawes (personal communication, 
July 1970) has demonstrated mathematically that 
the correlation of an average with a criterion will be 
greater than or equal to the average of the correla- 
tions. 
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the composite judge. Goldberg’s general con- 
clusions were to use either the composite judge 
or the model of a given individual judge. 


The purpose of the present study was to 
test Goldberg’s (1970) hypotheses favoring 
model over man in a clinical judgment situa- 
tion other than clinical diagnosis of the 
MMPI. A task was selected in which clinical 
judges regularly engage, and for which the 
criterion was relatively reliable and moder- 
ately predictable. In particular, graduate stu- 
dents in psychology (the clinical judges) 
were asked to forecast first-year-graduate 
grade point averages based on the profiles 
derived from their fellow students’ records. 
Following Goldberg, it was hypothesized that 
man’s model would outperform man himself. 
Further, it was expected that the composite 
judge should yield relatively accurate predic- 
tions which should surpass the predictions of 
a “typical” judge, that is, surpass the average 
of the individual validities. 


METHOD 
Subjects 


Ninety-eight out of 200 psychology graduate stu- 
dents at the University of Illinois answered a request 
to participate in a study of graduate prediction. The 
98 volunteers represented all 4 years of graduate 
status and were well-paid volunteers. The task was 
carefully explained to all subjects, and they were 
instructed to fill out their booklets on their own time 
and without any help. They were given 2 weeks in 
which to perform the task and return the booklets. 


Task 


Each subject was asked to predict the actual first- 
year-graduate grade point averages of 90 profiles 
representing real psychology graduate students at 
the University of Illinois who had entered between 
the years of 1965-1968. Each profile consisted of 
the following 10 cues: (a) Graduate Record Exami- 
nation (GRE)—Verbal; (b) GRE—Quantitative; 
(c) GRE—Advanced; (d) cumulative undergraduate 
grade point average for the last 2 years of college; 
(e) Astin (1965) ratings of selectivity of the under- 
graduate school; (f) mean peer ratings received on 
a S-point scale for need Achievement, (g) Extra- 
version, and (k) Anxiety; (7) self-rating on con- 
scientiousness; and (j) sex of student. The peer rat- 
ings and self-ratings were obtained at the end of 
the first semester of graduate work for all first-year 
psychology students at the University of Illinois. It 
should be noted that the first 5 cues, a-e, were used, 
in part, as a basis for selecting applicants to gradu- 
ate school at the University of Illinois (Wiggins, 
Blackburn, & Hackman, 1969). 
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Each of the cues was carefully describe? to the 
subject. Examples of norms and averages for various 
cue values were given. The subjects judged 110 pro- 
files altogether: 90 original profiles plus 20 :vpeated 
profiles. The judges’ predictions were made on an 
11-place scale, ranging from a 3.0 grade poin: :verage 
(“C”) to a 5.0 grade point average ("A" steps 


of .2. An average of 3.0 was considered a 
zero point. 


Method of Analysis 


For each of the 98 judges, the followin 
were computed following Goldberg's (197 


nable 
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analy- 
sis: (a) The judges validity (r4): the correlation 
between a judge's predictions and the actual frst- 
year grade point averages. (b) The linear predicta- 
bility of the judge (Rs): the multiple correlation 
between the 10 cue values and the judge's predic- 
tions. (c) The validity of the judge’s mode! (rm): 
the correlation between the actual grade poini aver- 
ages and the predicted judgments from s multiple 


regression of the 10 cues upon the judge’ 
tions. (d) The incremental validity of m over 
man (Am): the arithmetic difference be ; the 
validity coefficient for the model and the judge, that 
is, c— a. (e) The correlation with the com posite 
judge: the correlation of each judge’s predictions 
with the composite predictions across all proñles. (f) 
Test-retest reliability (r..): the correlation between 
the 20 repeated profiles for each judge. =<) The 
linear predictability of the criterion (Re): this rep- 
resents the actuarial prediction, the multiple corre- 
lation, between the cue values and the actual grade 
point average criterion. (h) The accuracy of the 
composite judge: the correlation between the mean 
judgments and the grade point average criterion. (i) 
The accuracy of the model of the composite judge: 
the correlation between the grade point average cri- 
terion and the predicted composite judgments from 
a multiple regression of cue values onto the com- 
posite judgments. (j) Typical judges’ validity: the 
average of the individual judges’ validity coefficients. 
(k) Typical models’ validity: the average of the 
validities of the models of the individual judges. 

The interrelationships among these various indexes 
were computed, and the results were compared to 
those obtained by Goldberg (1970). 
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RESULTS 


When graduate students were asked to pre- 
dict the grade point averages of profiles rep- 
resenting other graduate students, they were 
fairly accurate: the mean validity coefficient 
for the 98 judges was .33, the median validity 
was .34, and the range was .07—.48. These 
validities were substantially raised, however, 
by modeling the judges. For every 1 of the 
98 judges in the present study, the model of 
the judge was more valid than the judgments 
themselves. A summary of the present results 


a comparison with those obtained by Gold- 
(1970) are presented in Table 1. Note 
all of the validities in the present study 
to be higher than Goldberg's, excepting 
least accu judge and the least accurate 
lel. The most accurate model (rn = .64) 
21 correlation points higher than Gold- 
curate model. The most accurate 
judge (r, = 48) was more accurate than 
accurate judge (ra = 39); 


and the highest incremental validity of .37 
far surpassed Goldberg's high value of .07. 
The higher incremental validities of the 
present data compared to Goldberg’s (1970) 
can be explained, in part, from examination of 
Goldberg’s mathematical inequality for de- 
termining w the model will be more valid 


than the man. The inequality suggests that 
high linear predictability of the actual cri- 
“terion from the cues is one factor underlying 
the model’s superiority. Although the linear 
‘predictability of the present criterion (R, = 
169) falls short of the optimal value of 1.00, 
“it clearly exceeds the value (R. = 46) in 
Goldberg's MMPI study. Goldberg’s inequal- 
ity also indicates that the difference between 
“man and model decreases as man becomes 
More linearly predictable (ie, as R, ap- 
proaches unity). In the present study the 
average linear predictability of man was .65, 
compared to the higher value of .77 in Gold- 
berg's study. 

Table 2 presents the intercorrelations 
among six judgmental indexes calculated for 
each judge: accuracy, accuracy of model, 
incremental validity, reliability, linear pre- 
dictability, and correlation with the composite 
judge. In support of Goldberg’s (1970) formu- 
lations, as man becomes more linearly pre- 
dictable, the difference between model and 
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‘TABLE 1 


VALIDITY COEFFICIENTS or JUDGMENTAL VARIABLES ^ 


IN Two CLINICAL PREDICTION as 


Grade 
3 — 
Judgmental variable MEN Ion ¢ 
study) 1970) 
Average incremental validity AT 03 
Composite judge’s incremental 
validity ll —.02 
High incremental validity value KH 7 
Low incremental validity value 08 —.01 
Most accurate judge’s validity -48 -39 
Composite judge’s validity AT 35 
Typical judge's validity (aver- 
age validity) 33 .28 
Least accurate judge’s validity 07 14 
Most accurate model's validity 64 43 
Composite model’s validity 58 33 
Average model’s validity 50 31 
Least accurate model's validity 10 16 
Linear predictability of criterion 69 46 
Mean linear predictability of 
judges .65 Eri 


man diminishes (r= —.39, p < .001). Un- 
fortunately, neither the test-retest reliability: 
nor the correlation with the composite judge 
was related significantly to any of the other 
judgmental variables. These two indexes 
would not provide a reasonable basis for a 
priori selection of the more valid judges or 
the more valid models. 

The relative ordering of validity coefficients 
for predicting first-year grade point averages 
are presented in Table 3 and again compared 
with Goldberg’s (1970) results. The most 
striking finding of the present study is that 
the model of the composite judge outper- 
formed the composite judge himself. The 
incremental validity of the model of the com- 


TABLE 2 
INTERCORRELATIONS AMONG JUDGMENTAL VARIABLES FOR GRADE POINT AVERAGE JUDGMENTS 
Variable 1 | 2 | 3 4 | 5 | 6 
1. Accuracy of man 3" 
2. Accuracy of model Ja eA 
- Incremental validity model—man —.39** 33 
4. Test retest reliability a E dh i t 
. Linea: iras? i d a ; 
on with eda EN 04 10 —.03 00 


- Correlation with composite 
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TABLE 3 
Rank OmpERING Or VALIDITY COEFFICIENTS FOR 
GRADE POINT AVERAGE PREDICTIONS AND 
Psycuoric-NEUROTIC PREDICTIONS 


| Grade point Psychotic 
average neurotic 
Validity coefficient judgment | judgment 
(present (Goldberg, 
study) 1970) 
Linear predictability of 
criterion 69 AG 
Most accurate model | 64 43 
Model of composite judge | 58 33 
Typical model .50 31 
Most accurate judge A8 39 
Composite judge AT 35 
Typical judge 33 28 
Least accurate model -10 16 
Least accurate judge .07 14 


posite judge over composite judge was .11. In 
contrast, Goldberg found a slight negative 
incremental validity between the model of his 
composite judge and the composite judg- 
ments. One possible explanation of this dif- 
ference in results may be due to the fact that 
Goldberg’s composite judge was more linearly 
predictable (R, = .89) than the present com- 
posite judge (R, = .79); an increase in linear 
predictability would be a situation favoring 
man over model. In further contrast to Gold- 
berg’s results, the model of the typical judge 
for the present data (i.e., the average of the 
individual judge's models) slightly outper- 
formed the composite judge. 

Despite these differences, the present re- 
sults generally tended to replicate those of 
Goldberg (1970). For example, the most ac- 
curate model performed almost as well as the 
actuarial formula. Unfortunately, without 
prior knowledge of the predictor—criterion re- 
lationships, it would be impossible to select 
the most valid judge or the most valid model. 
For this situation Goldberg has suggested us- 
ing the composite judge or, when clinical 
time is scarce, using the judge with the high- 
est correlation with the composite. In the 
present data, the accuracy of the single judge 
with the highest correlation with the com- 
posite was only .18, although his model’s va- 
lidity was .36. Goldberg found no relationship 
between linear predictability of the judge and 
his judgmental accuracy; however, the present 
study does indicate a fairly sizable correla- 
tion between these two variables (r = .59). In 
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fact, the accuracy of the judge with the high 


est linear predictability was .43, and his mod- 
el’s accuracy was .50, higher than that of the 
validity of the composite judge. Linear pre- 
dictability, then, might be a viable index on 
which to select a single judge, although as 
linear predictability increases, the difference 
between model and man decreases 
SUMMARY AND CONCLUSIONS 

The major results from the present study 
were: 

1, Each of the 98 judges who predicted 
first-year-graduate grade point averages was 
outperformed by his model. Although Gold- 


berg’s (1970) data revealed a similar phe- 
nomenon, the incremental validities of model 
over man were not as striking as those of the 
present study. 

2. The most accurate model (7 = .64) per- 
formed almost as well as the actuaria’ ‘ormula 
(r = .69) in predicting the criterio: 

3. Although the composite judge performed 
creditably in the present task (r = .47), the 
model of the composite judge was even more 
valid (r = .58). This result is in contrast to 
Goldberg's (1970) findings that his composite 
judge slightly outperformed its mode]. Because 
of the increased reliability of composite judg- 
ments (Cooke, 1967a, 1967b), it would be 
difficult, theoretically, to bootstrap the valid- 
ity of the composite judgments by means of à 
model. However, for the present data, the 
model of the composite judgments did increase 
predictive validity. The reasons for this in- 
crease are not immediately apparent, since 
the number of judges entering into the com- 
posite was large; the large sample of subjects 
should remove unwanted error variance, and, 
as such, the model would not be expected to 
improve accuracy. : 

lt is possible that the present composite 
judge was not particularly representative of 
the individual judges comprising the average 
(Tucker & Messick, 1963). If it were the case 
that the composite judge represented, say, two 
types of judges, both of whom were weighting 
valid but different cues, the linear predicta- 
bility of the composite would be decreased: 
the error variance of the independent varia- 
bles would be compounded rather than can- 


| 


siate of affairs would allow the 
to exceed that of the com- 
>, Evidence for this presumption is 
n’s (1971) individual difference 
cer & Messick, 1963) of the 

4 factor analysis of the 98 
two orthogonal subject factors, 

ere moderately correlated with 
contrast to this result, a factor 

29 Meehl judges used by Gold- 

yielded a single validity factor 
of judges (Wiggins, 1971). A condition favor- 
ing more than one type of valid judge would 
be one which involved a number of uncorre- 
lated but valid cues. The present set of pro- 
files satisfied this condition (Kohen, 1971). 
In contrast, the higher intercorrelations 
“among the MMPI scales (Goldberg, 1965) 
would tend to vitiate the possibility of dif- 
ferent types of valid judges emerging. Thus, 
a situation in which the validity of the com- 
posite judge might be bootstrapped is one in 
which the cues are uncorrelated, and judges 
would be likely to utilize valid, but different, 
sets of cues. 

4. The composite judge was also slightly 
less valid (y = .47) than the average of the 
- models of the individual judges, that is, the 
typical model (r = .50). Goldberg's (1970) 
data indicated the reverse, although the dif- 
ference between the composite and typical 
models was slight. 

5. Neither the test-retest reliability nor the 
Correlation of a judge with the composite was 
- related to man’s validity, his model's validity, 
Or his incremental validity. However, linear 
predictability was significantly, positively re- 
lated to man's validity and his model's valid- 
ity, and was negatively related to his incre- 
. mental validity. 

6. Goldberg (1970) suggested that in the 
absence of criterion information and when 
Clinical time is at a premium, the model of 
- the judge with the highest correlation with 
the composite might be a reasonable predictor. 

the present study this strategy was not 
Supported over alternative strategies; the 
- Validity of the model of the judge with the 
highest correlation with the composite was 
only .36. This is contrasted with the accuracy 
of the model of the judge with the highest 
linear predictability (x = .50). 


- validity. In 
analysis of th 
berg (1970) 
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Based on the above results, a single major 
recommendation can be extrapolated: always 
use the model. This advice, of course, assumes 
that the actuarial formula is not accessible, 
due to a lack of criterion information. When 
clinical time is not at a premium and all 
judges can assess all profiles, the composite 
judge is a fairly valid predictor. But even in 
this situation, the model of the composite may 
be even more valid, particularly if the validi- 
ties of different judges were due to utilizing 
valid, but different, sets of relatively uncor- 
related cues. When clinical time is at a 
premium, it is best to model the judge or 
judges who are able to perform the clinical 
task. In fact, the average of the judges’ mod- 
els slightly outperformed the single most ac- 
curate judge, as well as the composite judge. 

Although the present conclusions are lim- 
ited to a single situation which employed 
fewer profiles than is usually desirable, it 
should be noted that Goldberg (1970) found 
that models of man, which were based on a 
small subset of the sample of profiles, still 
outperformed man himself upon cross-valida- 
tion. Thus, members of admissions commit- 
tees for graduate selection who spend large 
portions of their time making clinical judg- 
ments might well use models of their own 
judgments. Assuming that the committee 
knew the variables on which they based their 
decisions, their judgmental models could be 
determined on one set of cases and used for 
selection of the remaining set of cases. This 
would free valuable time which would be 
better employed in the search for new admis- 
sions indexes. 
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rhe relationship between figure-ground perception and introversion-extraver- 
ion was investigated using more than the usual type and number of figures, 
measures, and groups. The results for six ambiguous patterns, measured in 
ms of the initial percept, its latency and duration, and the number of re- 
rsals, failed to confirm either McDougall’s or Eysenck’s theories relating 
two phenomena. Only the reversal rate, which was lowest for introverts, 
high for both extraverts and ambiverts who did not differ from one an- 


r, indicated any link between the perceptual and personality dimensions. 
nature of figure-ground, especially with respect to personality, and of 


introversion-extraversion, with reference to perception, is discussed in light 
of the results. Although the data suggested some relationship between the two 


ables, which appeared perceptual rather than cognitive or behavioral, cer- 


tain cautions are considered. It is suggested that additional clarification appears 
necessary prior to further investigation of the existence of an effect and its 


determinants. 


Two theories have opposite positions re- 
garding the link between figure-ground percep- 
tion and introversion-extraversion, based on 
divergent views of their mediating neural ex- 
citation (the details of which are not relevant 
to this study). McDougall's (1926, Ch. 28; 
1929) position predicts greater figure-ground 
fluctuation for introverts than for extraverts 
and has received varying degrees of support 
(Eysenck, 1947; Franks & Lindahl, 1963; 
Ginsburg, 1969; Washburn, Keeler, New, & 
Parshall, 1929). Eysenck has argued for the 
contrary view, that introverts should show 
fewer reversals than extraverts (Eysenck, 
1955, 1967; see also, Costello, 1957). Com- 
pleting the possibilities regarding the relation- 
ship between the two phenomena, Guilford 
and Braly (1931) found none. In light of the 
importance of the larger issue to which this 
problem refers, concerning the tie between 
personality and perception (Vernon, 1966, 
Ch. 12), and in view of the fundamental po- 
sition which each topic occupies within its 
own field (Carrigan, 1960; Fisher, 1967), it 


1A briefer report of the research reported in this 
Paper was presented at the meeting of the Eastern 
Psychological Association, Atlantic City, April 1970. 

2 Requests for reprints should be sent to Martin S. 
Lindauer, Department of Psychology, State Univer- 
sity of New York, College at Brockport, Brockport, 
New York 14420. 


is disappointing to find such contradictory 
data. The difficulties may be due to several 
factors: first, a reliance upon a few figure- 
ground patterns, mainly of the reversible per- 
spective type (e.g., Necker cube, staircase, 
etc.), the use of which varies between studies. 
Second, an exclusive dependence upon fluctu- 
ation measures, which is understandable since 
the figural change of orientation character- 
istic of perspective patterns is difficult to de- 
scribe in other terms. Consequently, divergent 
results may be due to the idiosyncracies of 
the specific type of figures irregularly used in 
different studies. There is also the limitation 
of ignoring other characteristics of figure- 
ground perception (Lindauer & Lindauer, 
1970; Price, 1969), for example, which as- 
pect of the pattern is initially seen, a report 
more easily obtained when shape rather than 
spatial orientation is the primary ambiguity 
(e.g., vase-face). In addition, the majority of 
previous studies have omitted a control-group 
moderate on the introversion-extraversion di- 
mension (i.e., ambiverts), assuming that there 
is a similar polarization in perception as in 
personality. Further, many investigators have 
instructed subjects to “hold” or “reverse,” 
thereby possibly confounding the perceptual 
response with either expectancy or response 
bias. In order to remedy these apparent short- 
comings and possible ambiguities in the earlier 
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Fic. 1. Sample and test stimuli used. [Samples: 1 


“Nuns and nudes"; 2—“T’s and 


Test stimuli: 1—‘Maltese cross" (8-sector circle); 2 and 3—“Vase (chalice) and faces"; 4 rm 


and Yang"; 5—“Gog and Ma 


research, the present study sought to more 
reliably establish an effect for a broad sample 
of stimuli. Six figure-ground patterns of the 
ambiguous shape type were used, recording 
not only fluctuation, but also the initial per- 
cept along with its latency and duration. 
These additional features are not only char- 
acteristics of figure-ground perception but 
are presumably as much a function of under- 
lying neurological mechanisms as fluctuation 
is thought to be, In addition to the parametric 
aspects of the study, predictions derived from 
Eysenck (1955, pp. 99, 104—105), the more 
specific of the two theories under discussion. 
This view holds that the 
figural response of introverts, in contrast to 


were also tested 


extraverts, would not only show fewer re- 


zog"; 6—Rubin figures.] 


versals, but the initial response should conse- 
quently be slower in latency and longer D. 
duration; the opposite findings would fulfill 
McDougall's theory. Further, it was expected 


that both theories would maintain that any 
effect would be consistent for all figure-ground 
patterns. Finally, although it is difficult t0 
specify each theory's expectation, introverts 
and extraverts might also differ from one alr 
other on which aspect of the figure would be 
initially seen. 


METHOD 


Subjects and Procedure 


Three groups of 20 
in each 


each, with 1 


subject females 


group, were selected from a pool of 1! 


ne basis for their assignment was 
yderate, or low (introvert) scores 
introversion scale of the Eysenck 
ry (Eysenck & Eysenck, 1968); 


ý percentiles > 81-99, 23-31, and 0-3, respec- 
“tively, based o ms for American college students 
“In order to obtair equal number of males in all 
groups, of wh: e were unexpectedly few eligi- 
ble, several females with extreme scores were ex- 
cluded. The six questions on the test relating to 


response falsification were also used, and led to the 
" rejection of a few additional eligible subjects. Six 
ambiguous black and white figure-ground patterns, 
and two samples, most of which were taken from 
Fisher (1968), were used (see Figure 1). Two 
“vase-face” iuli (2 and 3), their filled areas re- 
versed, were used in order to note the role of bright- 
mess differences on figure-ground perception (Lin- 
dauer & Lindauer, 1970) within the context of per- 
sonality. This was also the only pattern in which 
the black and white aspects were not only familiar 
- objects, but different in meaning. Black-white pat- 
terns, differing primarily in figural shape, were used, 
rather than line drawings varying in their near-far 
perspective (c. Necker cube), which require sub- 
jects to indicate where the object is. The former type 
of ambiguous figure more readily enables subjects to 
report what they saw on the initial view, and to ob- 
tain latency and duration measures for a more easily 
located stimulus. Stimuli were presented binocularly, 
- in one of six completely counterbalanced sequences 
assigned to the subjects in the order in which they 
arrived at the lab; in addition, each figure was 
preceded and followed by different patterns in all 
Sequences. The stimuli on 35-millimeter slides were 
projected (300 watt) in a well-lit room (six 40-watt 
fluorescent bulbs produced at least 75 footcandles of 
light) for 30 seconds each, following a 30-second 
intertrial interval, in which the screen was filled 
with white light. The size of the projected image 
varied somewhat with each stimulus, but just about 
filled a 254 X 37.5 centimeter area, centered 304.8 
centimeters in front of the subject (visual angle, 5 

degrees/45 minutes). Subjects were instructed to hold 
an index finger of each hand just above two keys 
labeled either “black” or "white" (placed left or 
right in a regular sequence for each subject) and to 
Dress and hold down the appropriate key to indicate 
their perceptual experience for the pattern. Their 
responses automatically recorded whether the black 
Or white portion of the figure was initially seen, 
how long before the initial view occurred (latency) 
and how long it lasted (duration), and the total 
number oí reversals. Subjects were instructed to 
"look straight ahead," with no special instructions 
regarding whether they should force ("reverse") or 
inhibit (“hold”) their perceptual response (ie. neu- 
tral or “passive” viewing). They received several 
Practice trials with the two sample stimuli for 1-2 
minutes in order to familiarize themselves with the 
figure-ground phenomenon and procedures for re- 
Porting. Six subjects were replaced (five were in the 
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8.50 
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7.50 
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HIGH MODERATE Low 
DEGREE OF EXTRAVERSION 


Fic. 2. Mean number oí fluctuations for highs (ex- 
traverts), moderates, and lows (introverts). 


control group) at the end of the study, following a 
questioning period, for failure to follow procedures. 


RESULTS 


Personality differences between high, mod- 
erate, and low extraverts in figure-ground per- 
ception were not found among either latency 
[F <1; Ms (in seconds) = 1.97, 1.88, and 
1.96, respectively] or duration measures [F — 
1.55, df — 2/57, p > .05; Ms (in seconds) — 
3.12, 3.15, and 4.19, respectively], but were 
shown only by fluctuation (F = 3.24, df = 
2/57, p « 05). A Neuman-Keuls analysis 
(MS, = 90.98) between the individual means 
(Figure 2) indicated that the differences be- 
tween lows, or introverts (M = 7.78, and 
both moderates (M = 10.81), and highs, or 
extraverts (M = 9.98), were significant (p < 
.05), but that extraverts and moderates did 
not significantly differ (p> .05) from one 
another. Reversals did not vary as a function 
of the specific figure-ground pattern used (F 
< 1). This absence of an interaction between 
personality and pattern was also the case for 
both latency (F = 1.32, df= 10/285, p> 
05) and duration (F < 1). The initial portion 
of the figure which was seen was generally 
the black fill (79% of the responses), which 
was true for each of the figures, assuming a 
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Fic. 3. Latency of initial response to figure-ground patterns. 


50-50 distribution of black and white re- 
sponses (x*> 11.27, df=1, p< .01). This 
reaction was found within each personality 
category (x° > 26.13, df=1, p< 01) with- 
out varying as a function of the specific fig- 
ures used (x? < 5.80, df = 2, P > .05) or for 
all figures combined (x? = 1.19, df = 2, p> 
.05). Irrespective of personality considera- 
tions, the figure-ground patterns did differ 
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onse 


df= 


from one another on each of the three 
measures (Fs = 3.44, 2.52, and 11.7 
5/285, p < 05) for latency, duration, and 


fluctuation measures, respectively (Figures 
3-5). Individual analyses of the differences 
between the means based on MS, = 11, 7.51, 


and 6.30, for latency, duration, and fiuctua- 
tion, respectively, indicated that only figures 
with extreme values significantly differed (p 
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£.05) from one another on measures of 


(Stimui 2 and 4) and duration 

5 i 2 and 6), while most comparisons 
mong fluctuation responses (10 of 15) signifi- 
intly differed from one another, With respect 
the latter measure, no significant differ- 
‘ences (p> | found between Stimuli 
1, 4, and 5; ; 2 and 3; and 3 and 5. 
An examinat of the Pearson correlations 


ires on each of the three re- 
evealed a good deal of simi- 
responses of each of the three 
ities (Table 1). Among 135 


between the 

“sponse meas: 

- arity among t 
types of per 


comparisons were significantly positive 
(P< .05), with only 3 of the remainder neg- 
‘ative (M stude = .58). Although the 


"highest correlations tended to be among the 
(M magnitude — .65, of which 
ie correlations were significant), 
and introverts were not much 
Tower (M magnitudes = .56 and .53, with 
64% and 67% of the correlations significant, 
Fespectively ). se summary findings were 
Consistently id for each of the figure- 
ground patte whether examined in terms 
T Of each personality type or for all personali- 
ties combined. Observation of the correla- 
the three response measures, 


tions between 
involving 18 comparisons between six figures 
i and three personalities, indicated that fluctu- 
ation values were significantly (p< .05), 
Negatively correlated with duration in all 
Cases (M magnitude = —.57) and negatively 
Correlated with latency in all but 1 of the 
© Comparisons (M magnitude = —.29), of 
Which only 4 were significant. Latency and 
duration were positively correlated with each 
Mother in 15 cases (M magnitude = .30), but 
“only 3 were significant (p < .05). 


É 


DISCUSSION 


i The introversion-extraversion dimension of 
- Personality influenced only the number of 
figure-ground reversals, a measure whose dis- 
tinction from other indexes was also indicated 
by its negative correlation with most latency 
3nd duration responses. This personality ef- 
Vect occurred after the initial view, since dif- 
erences between types of personalities were 
hot found on what was seen first, its latency 
E duration. The finding of more similarities 
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Fic, 5. Mean number of fluctuations for 
figure-ground patterns, 


than differences between personality types 
was also reflected by the absence of individual 
differences, as revealed by the generally high 
positive intercorrelations between figures on 
each of the measures. The consistency of the 
figural response could be taken as support for 
a unitary process underlying figure-ground 
perception (Hochberg, 1950; Köhler & Wal- 
lach, 1944). 

Although the fluctuation data imply the 
necessity for obtaining an ongoing record of 
reversals in order to assess its progress over 
time, the reliance that previous investigators 
have placed on this measure is apparently 
justified. In addition, its usefulness has also 
been shown in expressing the figure-ground's 


TABLE 1 


MEAN MAGNITUDE OF PEARSON CORRELATIONS FOR 
EACH PERSONALITY AND MEASURE (AND PERCENTAGE 
OF CORRELATIONS SIGNIFICANT, $ <.05) 


Personality 
Measure Combined 
Extravert | Moderate | Introvert 
Latency .65(100) | .52(53) | .68(100) | .62(84) 
Duration .61(73) .47(60) | .22(20) | .43(51) 
Fluctuation | .69(100) | .69(80) | .68(80) | .69(87) 
.56(64) | .53(67) | -58(74) 


Combined} .65(91) | 
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relationship to other perceptual phenomenon, 
such as the discovery of the hidden figure 
(Newbigging, 1954). Also indicative of the 
reversal measure’s meaningfulness is that it 
has been tied to figural aftereffects and per- 
sonality, including field dependence (Haro- 
nian & Sugerman, 1966; Immergluck, 1966; 
Kidd & Cherymism, 1965; Rudin, 1968) and 
interpersonal perception (Newbigging, 1953). 
Although these studies have usually relied 
upon a few figures, generally of the perspec- 
tive reversal type, the absence of an interac- 
tion effect between personality and figure in 
the present study suggests a stability of figure- 
ground, irrespective of type. However, the 
present data were based on ambiguous figures 
only, and did indicate that several figures 
differed in fluctuation and, to a lesser degree, 
latency and duration. Thus, some caution in 
generalizing to other types of patterns may be 
in order. In this connection, while the face- 
vase Stimuli 2 and 3 led to proximal stimula- 
tion equivalent in shape but not in brightness, 
they were responded to differentially on sev- 
eral measures. This again indicates the role of 
brightness in figure-ground perception (Lin- 
dauer & Lindauer, 1970) and the phenomeno- 
logical independence of alternate figure- 
ground percepts from one another. 

"The specific effect of personality was found 
only in terms of a low fluctuation rate for 
introverts, with both extraverts and moderates 
equally showing a higher number of reversals. 
This difference means that it is only the intro- 
version trait which is critically related to 
figure-ground perception. Because the main 
purpose of the study was to establish the gen- 
erality and the direction of an effect, it is 
difficult to speculate on the reasons for this 
surprising finding. However, the results do 
suggest that it is the input or information- 
processing side of perception on which intro- 
verts differ. The irrelevance of output varia- 
bles, such as response bias or reaction time, 
is indicated by the equivalence between the 
three personality groups on their initial la- 
tency and duration responses, although this is 
not to deny that differences could have oc- 
curred subsequently. These similarities, as well 
as the absence of a difference between the 
groups as to which aspect of the figure was 
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seen first, also suggest that the results for 
introverts were not due to cognitive diTerences 
in expectancy or in understanding the instruc- 
tions or the requirements of the task. In rela- 
tion to the two opposing theories which 
these findings refer, neither w »orted. 
Although the data for introverts provide par- 
tial support for Eysenck, the simi'rity be- 
tween extraverts and moderates co radicts 
his other main prediction regarding ‘uctua- 
tion. This emphasizes the necessity i con- 
trol group in comparing extremes, cially 
in light of its frequent neglect in earlier work, 
which assumed that opposite pe iities 
would lead to opposite perceptus! effects. 
Furthermore, the assumption of a co. inuum 
on which moderates are in an inte mediate 
position does not seem to be upheld by the 
data. Yet the nature of ambiverts, whether 
truly distinct from both introverts and extra- 
verts or composed of a mixture of each type, 
remains unclear, and perhaps should )escon- 


sidered in evaluating the meaning oi ‘he con- 
trol findings. These questions indic: a that | 
additional clarification of the rel: $ 
among the three personality types is rai 
prior to further psych 7 theorizing, and 
that neurological spec tions at this point | 
are, of necessity, most likely premature. | 
The present desi resents some other am- 
biguities, Although tl 5 no reason to expect 
sex differences to. play a role (Carrig 
1960), the sample | was co femi 
sin ly. ‘indicate that even int 
les, perhaps because of oth 
needs (e.g, service), were more willing to 
volunteer than extraverted males. In addition, 
introverts had more extreme scores, with à 
narrower range of vate than the other two | 
types. As Shapiro and Alexander (1969) have 
pointed out, results in this area may differ, 
depending upon whether introverts and extra- 
verts are within the normal range or are net- 
rotic. No such data are available for this 
sample. In general, these ambiguities regard- 
ing the role of personality in figure-ground 
perception suggest further cautions in inter- 
preting those studies which have related intro- 
version-extraversion to other perceptual phe- 
nomena such as field dependence (Young: 
1959), constancy (Ardis & Fraser, 1957), a 


“(Howerth, 1969a, 1969b), sensory 
is, 1969; Rossi & Solomon, 
perception (Lynn, 1961). 
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A study was designed to investigate the conditions under which women are 
prejudiced against women. Paintings were shown to 120 college women for 
evaluation. Half of the subjects thought that the artist was female, half thought 
that the artist was male; half thought that the painting was an entry in a 
contest, half thought that it was a winner. Some questionnaire data showed 
that women judged the entry paintings by men to be significantly better than 
the identical paintings by women. Winning paintings were not evaluated dií- 
ferently depending on sex. Obstacles faced by winners or entrants had no effect. 
It was concluded that women who are attempting to accomplish are judged 
less favorably than men, but that women who have successfully accomplished 
work are evaluated as favorably, as are men. 


One explanation for the apparent failure 
of women to achieve as much success as men 
is prejudicial evaluations of their work by 
men (cf. Klein, 1950; Scheinfeld, 1944). If 
men undervalue the accomplishments of 
women, women also may do so. Women’s 
misjudgment of themselves should contribute 
to an actual lack of achievement. If women 
devalue their own and each other’s work, they 
should be less willing to try to achieve and 
less supportive of their fellow women’s ef- 
forts. The present study investigates the con- 
ditions under which women devalue female 
performance. 

Goldberg (1968) designed a study to in- 
vestigate prejudice among women toward 
women in the areas of intellectual and pro- 
fessional competence. College women were 
asked to evaluate supposedly published jour- 
nal articles on linguistics, law, art history, 
dietetics, education, and city planning; for 
each article, half of the subjects saw a male 
author's name and half saw a female author's 
name, The results confirmed the hypothesis 


1 This study was supported, in part, by funds 
Írom National Science Foundation Grant GS 27292 
to the second author. 
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that college women value the professional 
work of men more highly than the identical 
work of women. Women devalued ‘smale 
work for no other reason than the female name 
associated with the article. Sensitivity to the 
sex of the author served to distort judgment 
and thereby prejudice women ag: the 
work of other women. 

Using the identical procedure, Pheterson 
(1969) explored prejudice against women 
among middle-aged, uneducated women. The 
professional articles were on marriage, child 
discipline, and special education. The results 
did not support the findings of Goldberg. 
Women judged female work to be equal to 
male work; in fact, evaluations were almost 
significantly more favorable for female work 
than for male work. 

The differing results of Goldberg and Phet- 
erson were perhaps due to the different sub- 
jects used, to the different articles, or to some 
combination of the two. One plausible ex- 
planation might be that the printed articles 
had different significance for the two sets of 
subjects, College women see the printed word 
frequently, are taught to be critical, and may 
take the publication of a paper relatively 
lightly. They might have viewed the articles 
simply as vehicles for presenting ideas or pro- 
posals. Uneducated women, on the other hand, 


xe publication or, even, writing 
big accomplishment in itself, 
specific ideas presented, Per- 
; judge women less favorably 
; evaluating their proposals or 
yk because men are more likely 
That is, given a piece of work 
rtain status, the man's, rather 
vs, is more likely in our so- 
uly be successful. On the other 
may judge the recognized ac- 
or already successful work of 
qual to or even better than the 
men. Success is less common 
j . A contrast effect may cause people 
to overvaluc achievement when they expect 
me. Also, women may overvalue female 
mplish:: because they assume that 
e eater obstacles to success and 
efore must exert more energy, display 
compeience, or make more sacrifices 
n men. 
The present study was designed to in- 
estigate the divergent results of. Goldberg 
"heterson (1969) and, further, to 
iously presented arguments. 
ked to judge paintings created 
y men and women. Some paintings repre- 
Sented attempts to accomplish, that is, were 
entries in art competitions. Other paintings 
represented actual accomplishments—they had 
already won prizes. The first hypothesis was 
that women will evaluate male attempts to 
- accomplish more highly than female attempts. 
- The second hypothesis was that women will 
evaluate female accomplishments as equal to 
_ Or better than male accomplishments. 
. The above hypotheses suggest that people 
- judge successful persons more highly when 
they have more odds against them (as 
- Women presumably do.) Thus, a woman's 
accomplishment might be praised more than 
-.& man's accomplishment because women face 
greater obstacles. Our culture shows great 
admiration for the achievements of the handi- 
= Capped or underprivileged (Allport, 1958). 
A third hypothesis was formulated to explore 
is admiration and its influence on female 
Judgments. It stated that women will evaluate 
the accomplishments of people with personal 
- Odds against them more favorably than the 
- accomplishments of people without such odds. 
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Subjects 


The subjects were 120 freshmen and sophomore 
women students at Connecticut College. College — 
women were used to permit a replication of the —— 
Goldberg (1968) study within the experimental de- —— 
sign. They volunteered in student dormitories for 
immediate participation in the 15-minute task. UN 


Experimental Design 


There were three experimental manipulations con- 
stituting a 2X 2X2 design. Eight paintings were 
presented to small groups of subjects for evaluation, 
The sex of the artist, the status of the painting, and 
the personal odds faced by the artist were manipu- - 
lated, such that for each painting half of the subjects - 
thought that it had been created by a male artist, - 
and half thought that it had been created by a female 
artist; half thought it was a prize-winning painting, 
and half thought it was just an entry in a show; 
half thought the artist had faced unusually severe - 
obstacles, and half thought the artist had faced no 
unusual obstacles. Each subject participated in each - 
experimental condition, evaluating all eight paintings 
sequentially. The identity of each painting was 
counterbalanced among subjects, so that all condi- — — 
tions were represented for each painting. 


Procedure 


Subjects were seated in a room equipped with a 
slide projector and screen, Each subject was given a 
booklet and was told to read the directions: 


Slides of eight paintings will be shown in con- 
junction with brief biographical sketches of the 
artists, After viewing the slide, turn the page 
and answer five evaluative questions about the 
painting. No personal information about your 
identity, talents, or tastes is required. This is a 
study of the artistic judgments of college stu- 
dents. 


The subjects were then instructed to read the 
first artist sketch, inspect the projected painting, 
turn the page and answer the appropriate questions, 
and then proceed in the same manner for each of 
the eight slides. i 

Fight slides of unfamiliar modern art paintings 
were used. To accompany them, fictitious artist pro- 
files were composed to include the eight experimental 
conditions. These profiles appeared in the booklets in 
different orders for the different subjects. Half of the 
profiles described a female artist, and half described 
a male, Their age, residence, and occupations were 
briefly described (identical for male or female). For 
example, “Bob (Barbara) Soulman, born in 1941 in 
Cleveland, Ohio, teaches English in a progressive 
program of adult education. Painting is his (her) 
hobby and most creative pastime.” Cross-cutting the 
sex manipulation, half of the profiles described the 
painting as a contest entry (c.g, “She has entered 
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TABLE 1 
MEAN COMPETENCE RATINGS OF MALE AND FEMALE 
ARTISTS WITH WINNING OR ENTRY PAINTINGS 


Sex of artist 
Status of painting = 
Male Female 
Winner 3.483 3.483 
Entry 3.562 3.354 


this painting in a museum-sponsored young artists’ 
contest”), and half described it as a recognized 
winner (eg, “This painting is the winner of the 
Annual Cleveland Color Competition"). In a third 
manipulation, halí of the profiles described the 
painter as having had obstacles to success (e.g., “An 
arm amputee since 1967, he has been amazingly 
productive as as artist"), 

After each slide, the subjects turned a booklet page. 
Five questions asked the subjects to evaluate the 
paintings on a scale of 1-5, with higher ratings 
representing more favorable evaluations, After every 
slide, the following questions were posed: (a) Judg- 
ing from this painting, how technically competent 
would you judge Mr. (or Miss) to be? 
(b) How creative would you judge Mr. (Miss) 
——————— to be? (c) What rating would you 
give to Mr. (or Mis) .— . — . for the overall 
quality and content of his (her) painting? (d) What 
emotional impact has Mr. (or Miss). — in- 
stilled in his painting? (e) Judging from this paint- 
ing, what prediction would you make for the artistic 
future of Mr. (or Miss) :; 

After all eight slides were shown, the study 
was explained, and the subjects were asked not to 
discuss it. 


RESULTS 


The questionnaire data were analyzed using 
four-way analyses of variance, with three ex- 
perimental conditions and subjects as the 
fourth factor. Three questions asked the sub- 
jects to evaluate the artists; these were as- 
sumed to be directly relevant to the hy- 
potheses. 

The first question, technical competence, 
revealed an overall rating of the male artists 
as significantly superior to the female artists 
(F = 3.99, df = 1/119, p <.05). There was 
a significant Sex X Painting Status interaction 
(F = 5.42, df = 1/119, p < .05). Inspection 
of the mean ratings of males and females 
under winner and entry conditions indicates 
that the main effect of male superiority was 
attributable to the entry condition and showed 
no differences in the winner condition, Means 
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TABLE 2 
MEAN RATINGS OF ARTISTIC FUTU \LE AND 
FEMALE ARTISTS WiTH WINN 
ENTRY PAINTINGS 
Sex of a 
Status of painting |—— aes — 
Male | nale 
Winner 2.970 87 
Entry 3.062 | 312 
in the entry condition differed signi^-antly in 
favor of men (7 = 1.99, p < .05); means in 
the winner condition were identica! Table 
1). All other main effects and actions 
were not significant. 

The question concerning the artistic future 
of the artist produced results paraiiciing the 
competence data (see Table 2). T: was à 
significant Sex X Painting Status in ecraction 


(F = 4.52, df = 1/119, p < .05). Males were 
evaluated significantly more favors)! than 
females for their entry paintings (; = 1.92, 


p< .06). Evaluations did not differ signifi- 
cantly for the winning paintings, although 
evaluations tended to favor the female winners, 

A third question, asking about the artist’s 
creativity, yielded no significant differences. 
(Intuitively, these data are not s si 
given the ambiguity of the term “cr: 
Also, “creativity” has some feminine connota- 
tions which judges may not wish to attribute 
to men, even when they believe the men are 
better artists.) In addition, the subjects 
evaluated the paintings themselves, equally 
among conditions (their quality and their 
emotional impact). Bias apparently was di- 
rected toward the performer, rather than 
toward his or her work. 

The data presented above support our first 
and second hypothesis. Women yalue male 
work more highly than female work when it 
is only an attempt or entry; however, this 
bias dissipates when the work advances from 
entry to winner. The third hypothesis con- 
cerning the odds condition was not confirmed; 
there were no significant differences among 
the odds conditions. 


Discussion 


Some professional women have claimed that 
their work is evaluated by men less well than 


be if they were men (e.g., Klein, 
. The recent data of Goldberg (1968) 
id Phe ;1 (1969) have added a new 
dimensio: he attitudinal factors inhibiting 
— female suc Under certain conditions, even 


women are prejudiced against the perform- 
ance of í women. The present study in- 
vestigate: » aspect of this prejudice. Women 
evaluated ale entries in a contest less 
favorably ‘han identical male entries, but 
female equally to identical male 
winners. 

The in ations of this finding are far- 
reaching. fic work of women in competition 
is devalued by other women. Even work that 
is equivalent to the work of a man will be 
judged inferior until it receives special dis- 
tinction, end that distinction is difficult to 

hieve «hen judgment is biased against the 

work in competition. According to the present 


data and those of Goldberg, women cannot 
expect ur d evaluations until they prove 
themselves by award, trophy, or other obvious 
success. Obvious success is perceived differ- 
ently by some groups than by others. The 
present research was based on the speculation 
that uneducated, middle-aged women per- 
ceived published articles as signs of obvious 
Success, whereas college women perceived such 
work simply as a presentation of ideas. Women 
were prejudiced against female ideas but not 
against female success. The manipulation of 
entry and winner in this study permitted con- 
trolled examination and confirmation of that 
Speculation. 

A question might be raised regarding the 
Strength of the present findings. Of five ques- 
tionnaire items, only two supported our hy- 


__ pothesis. These were the first question (tech- 


nical competence of the artist) and the fifth 
(the artist's future). However, a priori rea- 
soning would suggest that these were the very 
questions where one would expect bias against 
Women to occur. As mentioned earlier, crea- 
tivity is ambiguous and may have feminine 
connotations. The paintings themselves were 
abstract, unknown, and also difficult to judge 
9n the dimensions covered (quality and emo- 
tional impact). If people are expecting men 
to perform better than women, they should 
have the strongest expectations about tasks 
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on which society has already labeled men as 
superior. In everyday life, many professional 
men are regarded as technically competent and 
are successful; we see fewer women in these 
positions (girls are not raised to be engi- 
neers or business executives). Thus, the sub- 
jects might simply be described as reflecting 
attitudes in society at large. They assumed the 
men to be more competent and predicted a 
more successful future for men unless there 
was evidence to the contrary, that is, that the 
women had, in fact, succeeded. The subjects 
probably did not have very strong convictions 
about whether men are more creative than 
women (husbands usually leave such crea- 
tive tasks as home decorating and sewing to 
their wives). The quality and emotional im- 
pact of an abstract painting is also unlikely 
to have aroused strong attitudes favoring men. 
We argue, then, that our questionnaire data 
reflected the differing expectations which 
women (or men) have about men and women. 
That is, a woman will probably be less com- 
petent and her accomplishments fewer than a 
man, although she may be as creative (but 
probably not in science or business) and 
certainly as “emotional.” Such an analysis 
implies that the subjects were not really 
judging the artists or paintings at all, but 
were simply expressing attitudes they held 
prior to the study. This, of course, was our 
purpose. 

The third hypothesis, which predicted 
evaluations of paintings by people with odds 
against them to be more favorable in the 
winner condition than evaluations of identical 
paintings by people without odds, was not 
supported by the data. It is possible that the 
odds manipulation was too obvious. Perhaps 
some subjects were immediately aware of their 
special admiration for achievers with odds and 
therefore controlled their responses or under- 
rated them, thus masking any positive bias 
the odds may have instilled. Informal sub- 
jects’ feedback after the task supports this 
explanation. No subject suspected the im- 
portance of artist sex differences; however, 
many subjects reported the suspicion that 
they were expected to overvalue paintings of 
the handicapped or underprivileged. This 
suspicion may have caused a reaction which 
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obscured prejudicial responses. Remaining to 
be demonstrated, then, is the hypothesis that 
obstacles make successes seem greater. 

Why do women devalue each others’ per- 
formance? If one accepts women as a group 
which has important similarities to minority 
groups in our society, the answer is obvious. 
The members of minority groups, and women, 
have less power and fewer opportunities than 
do the dominant group, white Anglo-Saxon 
males. Self-defeating as it is, groups feeling 
themselves to be the target of prejudice never- 
theless tend to accept the attitudes of the 
dominant majority. This process has also 
been called identification with the aggressor 
(Allport, 1958). Women, then, when con- 
fronted with another woman who is trying to 
succeed in some endeavor, will assume that 
she is less motivated, less expert, or simply 
less favored by others than a man would (all 
these assumptions may be perfectly true). 
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Our data suggest that women do not de- 
value another woman when she has attained 
success. Without evidence, we think men do 
not either. In fact, a woman who has suc- 
ceeded may be overevaluated. The present 
study apparently did not afford a proper test 
of this hypothesis. Perhaps if the artists had 
been identified as famous and really superior, 
women would have been rated: more highly 
than men. 
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is study investigated whether cognitive 


ent using the Thurstone method of 


style contributes to item displace- 
equal-appearing intervals. Two hundred 


id sixty-three judges rated 121 statements on each of three issues (commu- 


», liquor by the drink, and television), 


n highly intense to neutral. The 
exity and dogmatism. For cognitive 
nificant. Dogmatism was relatively 


nitive style, independent of any 


representing a gradient of affectivity 
predictor variables were cognitive com- 
complexity, the overall results were not 
independent of affectivity, and highly 


atic subjects consistently displaced more items into the extreme categories 
all three issues, including the neutral issue. These results suggest that 


affectivity dimension, may contribute to 


m displacement and to distortion of social judgment. 


Early Hes in attitude were focused 
largely on noting frequencies of various at- 
titudes, with little interest in the underlying 
psychological processes. Thurstone (1929) 
pioneered the systematic study of attitudes by 
providing scales for assessing attitudinal re- 
sponses, Fundamental to this approach is the 
assumption that the categorical judgments 
used in developing the scale are independent 
of the subjects’ own attitudes (Thurstone & 
Chave, 16 A test of this assumption was 
made by Hinckley (1932). His results com- 
pletely justiñed the assumption, since scale 
values, regardless of bias, were highly cor- 
related (7 = .98). 

Meanwhile, attitude and ego involvement 
were shown to produce systematic variation in 
judgments (e.g., Lambert, Solomon, & Wat- 
son, 1949; Proshansky & Murphy, 1942; Taj- 
fel, 1957). These various experiments indicate 
that social judgment is affected by attitude 
and ego involvement, creating an inconsis- 
tency in results. Specifically, how would it be 
possible to reconcile the findings of Hinckley 
(1932), Pintner and Forlano (1937), and 
Eysenck and Crown (1949), who found that 
ego involvement did not affect judgment in 
developing scale values for Thurstone scales, 


_* This study is based in part on a doctoral disserta- 
tion submitted to Brigham Young University, 1969. 
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Hardy and other members of his committee for as- 
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with the findings which showed that value, 
attitude, and ego involvement did cause dis- 
placement of judgment in several dimensions? 
(Displacement may be considered as a polar- 
ization of responses to social stimuli, which 
in Thurstone scaling procedures is reflected 
by the subject placing an inordinate number 
of attitudinal items in the extreme categories.) 
The aforementioned inconsistency led Hov- 
land and Sherif (1952) to replicate Hinck- 
ley’s study, this time taking care to insure the 
inclusion of highly ego-involved subjects. The 
results of this experiment supported the 
postulate that subjects holding extreme at- 
titudes concentrated the statements into fewer 
categories, with the bunching tending to occur 
in the extreme categories. 

While affectivity has been demonstrated to 
be a variable creating displacement in social 
judgment, the claim (Sherif & Hovland, 1953) 
for an exclusive role for affectivity as in- 
fluencing displacement may be unwarranted. 
The obtained differences in item placement 
(Hovland & Sherif, 1952) could partially be 
explained by cognitive factors, especially the 
ability to differentiate the items. Granneberg 
(1955) attributes displacement to a rather 
complex interaction between attitude and in- 
tellectual factors. One cognitive factor relevant 
to social judgment is cognitive complexity, 
which may be defined as “the capacity to 
construe social behavior in a multi-dimen- 
sional way [Bieri, 1966, p. 185].” The Thur- 
stone procedure for developing scales presents 
the judge with a multivaried task, permitting 
us to relate differences in cognitive structures 
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to differences in the judgmental task. That 
cognitively complex subjects can differentiate, 
anticipate, and react more appropriately to 
their environment has been supported in a 
number of studies (e.g., Bieri, 1955, 1961; 
Leventhal, 1957; Tripodi & Bieri, 1964). 
These studies as a whole indicate that more 
complex judges as a group are capable of 
better discrimination among incongruent 
stimuli than less complex judges. 

Rokeach (1960) made a major contribution 
to the study of authoritarianism and cognitive 
processes by his Dogmatism Scale, which 
measures a relative content-free variable and 
thus provides a general assessment of au- 
thoritarianism and cognitive rigidity, free from 
ideological bias. The relationship between au- 
thoritarianism and social judgment has been 
documented in a number of recent studies 
(e.g., Bookbinder, 1963; Kaplan & Singer, 
1963; Powell, 1962). These studies tend to 
support the view that authoritarian subjects 
are less well-informed, more sure of their in- 
formation, and less able to differentiate on 
both perceptual and sensory tasks. 

The aforementioned studies suggest that 
relative cognitive complexity and dogmatism 
may affect a person’s ability to discriminate 
attitudinal statements. Subjects who are low 
in cognitive complexity and subjects who are 
high in dogmatism may rely more on percep- 
tual anchors (e.g., the end points on the 
Thurstone scale). In other words, such sub- 
jects develop a cognitive style, characterized 
by dependency on perceptual anchors, which 
in turn may produce item displacement to- 
ward the extreme categories on the Thurstone 
scale. 

One important problem is the extent to 
which such cognitive style may or may not be 
dependent on the affectivity of the issue. A 
post hoc control would be to calculate the cor- 
relation between the affectivity of the issue 
and the operational definition of cognitive 
style (cognitive complexity and dogmatism 
measures). If the correlation is nonsignificant, 
the two variables are independent. Another 
approach would be to choose issues which a 
priori differed in affectivity, and one which 

was relatively neutral to most subjects. If 
cognitive style is an efficacious predictor, cog- 
nitive complexity and dogmatism should dif- 
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ferentiate subjects’ item displaceme 
an ordinal gradient in affectivity of iss 
cluding the neutral. An essential cond iti 
the test of the cognitive style hy; 
a sample which would respond with va 


affectivity to several diverse issues. Brigha 
Young University students met the »ocessary 
requirements for the issues chosen ‘or this. 
study. 
METHOD q 
The three issues were communism, liq by the 
drink, and television, The issue of comr m was 
thought to be central to the value sys! of the 
average student attending Brigham Young University. - 
This issue should be highly loaded on a(fectivity 
since communism is officially opposed by the Morm 


Church, to which 98% of Brigham Young U versity 


students belong, thus adding a religious factor to th 
already prevalent anticommunist norms oí the larger 
society. The liquor-by-the-drink issue wes a local” 


state issue on control of the sale of liquor and ¥ 
thought to be intermediate in affectivity. The Mor. 
mon Church has opposed liquor by the drink on & 
variety of grounds, but certainly not as nsely as 
communism. In addition, the larger socie! 
(at least outside Utah) fail to support thc ^ 
Church's position. This, along with the fc 
some respected elements in the state favor 
has reduced the emotional effect of the i 

what, leaving it in an intermediate cate; 

vision was thought to be least in affectivity e 
there was no obvious support or opposition from 
norm-creating groups of the larger society. | 


The Sample 


The sample consisted of 263 Brigham Young Uni-- 
versity undergraduate students. Undergraduate stU- ^ 
dents at Brigham Young University adhere as 4 
whole to the norms of the sponsoring institution of. 
the University, the Church of Jesus Christ of Latter” 
Day Saints, known colloquially as the Mormon 
Church. Larsen and Schwendiman (1968) indica! 
that the students generally came from an activi 
Church background, with 7296 characterizing th 
family as either “very active" or “active.” This nom 
background formed the basis for the choice of 
three issues as objects for social judgment. 


Experimental Variables 


Bieri (1966) proposed a modified group version OF 
the Role Construct Repertory Test (Kelly, 1955) 
which was used in the present study. Dogmatismi 
was measured by administering Form E (Short 
Form) of Rokeach's Dogmatism Scale (Rokeachy 
1960) . d 


Procedure 


un 


The subjects were requested to fill out the 
questionnaire containing the Cognitive Complexib 
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and the Dogmatism Scale (Ro- 
ese data were collected during the 
side for this purpose, Times were 
the completion of the second phase, 


Test (Bieri, 
keach, 196C } 


at least 1 w 


fter the original testing. 

The secor e consisted of the collection of 
data concerning each person's affectivity toward each 
of the thre . and the placement of 121 state- 
ments on itudinal object into 11 categories 
based on t thod of equal-appearing intervals 
(Thurstone ). This testing took place in groups 
from abou ( in an average-sized classroom 
with large jach subject picked up an envelope 
for each of three issues containing 121 state- 
ments, desig; represent the range from extremely 
favorable tremely unfavorable toward the atti- 
tudinal o! The experimenter then read the fol- 
lowing genc nstructions for the affectivity scales: 
“We would to know how you feel about three 
issues of ! concern. Please feel assured that the 
best answ each item is your answer." Having 
completed .Hectivity scales in haphazard order, 
the subject e then read the instructions for social 
judgment, ired by the Thurstone procedure. 
The experim« then asked the subjects to proceed 


task, judging the issues in any 
haphazard order. The experimenter 
hat he would be available for any 


personal clarification of the instructions. When the 
subjects hac completed the judging task, they were 
asked to tr er the number of statements placed 


into each category to a sheet provided for that 
purpose. T ge time spent by each subject on 
the judging was 2 hours. 

Displacem was calculated by subtracting the 
expected frequency (11) from the observed frequency 
of statem placed into each of the extreme three 
categories at both the favorable and unfavorable ends 


of the continuum. This score was then added across 
the six categories. 


Operational Hypotheses 


1. To test whether cognitive complexity is pre- 
dictive of displacement, the sample was divided into 
equal high, middle, and low thirds on cognitive com- 
plexity. A two-way analysis of variance comparing 
the three groups on displacement scores across the 
three issues was performed. The prediction was that 
low-cognitive-complex subjects would displace more 
statements on each issue. 

2. To test whether dogmatism is predictive of dis- 
placement, the sample was divided into equal high, 
middle, and low thirds on dogmatism. A two-way 
analysis of variance comparing the three groups on 
displacement scores across the three issues was per- 
formed, The prediction was that subjects high in doki 
matism would displace more statements on each issue. 


RESULTS AND DISCUSSION 


The sample mean scores on affectivity to- 
ward the issues showed a mean score of 16.35 
for communism, 13.01 for liquor by the drink, 


` 


and 10.57 for television. Since the possible 
mean values ranged from 5 to 20, with a score 
of 12 expressing a uniform “don’t know” 
response and lower scores indicating disagree- 
ment that the issue was important to the sub- 
jects, it seems apparent that the television 
issue was not an important one and bordered 
on neutrality for most subjects. 


Affectivity, Cognitive Complexity, and 
Dogmatism 


One possibility is that cognitive complexity 
and dogmatism may be confounded by affec- 
tivity, and therefore any displacement attrib- 
uted to the latter predictors may in part be 
due to an affectivity component in each. Small 
but significant correlations occurred only be- 
tween cognitive complexity and affectivity 
toward television (N = 263, r = .11, p € .05) 
and dogmatism and affectivity toward liquor 
by the drink (W = 263, r= 24, p < .05). 
Even the significant correlations were so low 
that it is evident that affectivity, cognitive 
complexity, and dogmatism were relatively 
independent factors. 


Cognitive Complexity and Displacement 


The analysis carried out to test the first 
hypothesis failed to lend any support to the 
effect of cognitive complexity on social judg- 
ment, The F value for cognitive complexity 
was 1.57 (p > .05). 


Dogmatism and Displacement 


Table 1 shows the results for a two-way 
analysis of variance between groups on dog- 
matism and issues, analyzing for differences 
in displacement. 

There were highly significant mean differ- 
ences for both groups and issues. Inspection 
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TABLE 1 
ANALYSIS OF VARIANCE IN DISPLACEMENT SCORES 
BETWEEN Groups VARYING ON DOGMATISM 
AND ISSUES 
Source df SS MS F 
Dogmatism (A) 2 36.25 | 18.12 | 4.25* 
Tssues (B) 2 25.33 | 12.66 | 5.61** 
AXB 4 9.03 | 2.26 | .53 
Error 777 | 289,536.92 | 4.27 
* p < 025. 
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Fic. 1. Mean number of items placed into 11 
Thurstone categories by top and bottom thirds on 
dogmatism on the communism issue. 


of the mean values shows a linear pattern, 
with greatest displacement found in the high- 
dogmatism-communism cell and least dis- 
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Fic. 2. Mean number of items placed into 11 
Thurstone categories by top and bottom thirds on 
dogmatism on the liquor-by-the-drink issue, 


ke os Ge? 1 y 
Fic. 3. Mean number of items placed into 11 
Thurstone categories by top and bottom thirds on 


dogmatism on the television issue. 


placement in the low-dogmatism~‘elevision 

cell. These results lend clear support to the 

second hypothesis. 

Figures 1, 2, and 3 show the mean number 
of items placed into the 11 categories by sub- 
jects high and low on dogmatism across the 
three issues. The results are quite clear. Highly 
dogmatic subjects placed significantly more 
items into the extreme categories across all 
three issues, These results show that dogma- 
tism may predict item displacement indepen- 
dently of affectivity in a Thurstone-type judg- 
ment task. 
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NEED FOR STIMULATION AS A FACTOR IN 
SOCIAL BEHAVIOR 


STEPHEN M. SALES ! 


University of Michigan 


Recent experiments suggested that individuals classified as "reducers" on the 
kinesthetic aftereffects task are particularly likely to seek out and respond 
favorably to complex, interesting, and intense stimulus situations. Conversely, 
these studies indicated that individuals classified as "augmenters" by this pro- 
cedure are particularly likely to seek out and respond favorably to quiet, dul! 
and simple stimulus situations. These differential tendencies appear to stem 
from differences among individuals in a putative need for stimulation. The 
present paper sketches a theory which would account for these data, 
and it describes four experiments which investigated various ramifications of 
this theory. The studies indicated that individuals high in measured need for 
stimulation (a) react favorably to interesting stimuli, (5) attend closely to 
complex verbal communications, (c) exhibit high levels of activity in a de 
prived stimulus situation, and (d) show high levels of talking in a group 


discussion. 


Informal speech clearly distinguishes varia- 
tions in individuals’ tendencies to seek out 
and enjoy, and presumably need, complex, 
novel, interesting, and exciting stimuli. Some 
people are known as “swingers”; others are 
considered *dullards" and bores. This in- 
formal distinction appears to provide a fruit- 
ful area for research. However, relatively few 
investigations have been directed toward this 
topic, possibly (a) because a meaningful 
measure of a situation's “stimulus value" has 
yet to be devised and (5) because no measure 
of individuals! needs for stimulation has been 
widely accepted. 

Nevertheless, several recent investigations 
do seem relevant to this issue. The research 
workers involved in these studies have em- 
ployed the kinesthetic aftereffects task as their 
measure of “stimulus-seeking tendencies" (and 
presumably of the needs for stimulation which 
underlie these tendencies), and they have re- 
ported substantial relationships between this 
measure and various forms of stimulus seek- 
ing. Ryan and Foster (1967) have demon- 


1 The author is indebted to M. Alvin, A. Caplan, 
G, Eaton, J. Ettinger, D. Hlady, R. Kaplan, B. 
Kelch, D. Mostov, L. Ruhf, L. Schlamowitz, A. 
Smolens, and L. Spetner for their help in designing 
and executing these studies. 

Requests for reprints should be sent to the author, 
who is now at the Department of Psychology, Car- 
negie-Mellon University, Pittsburgh, Pennsylvania 
15213. 


strated that contact athletes (who presumably 
enjoy the intense stimulation of contact 
sports) tend to be “reducers” on the kines- 
thetic aftereffects task, while individuals who 
do not participate in any sport tend to be 


“augmenters.” Petrie, McCulloch, and Kazdin 
(1962) reported that juvenile delinquents, 
who perhaps were led to their delinquency by 
a desire for “thrills,” were more likely than 
control subjects to be kinesthetic aftereffects 
reducers. Eysenck (1955) and Petrie (1967) 
have shown that extraverts are particularly 
likely to reduce on the kinesthetic aftereffects 
task; and Petrie has reported that double- 
blind ratings of campers’ “activity level” are 
related to kinesthetic aftereffects performance, 
with reducers being more active than aug- 
menters, Finally, Petrie has reported that 
kinestethic aftereffects reducers are more 
likely than kinesthetic aftereffects augmenters 
to smoke, drink, and bite their nails (be 
haviors which presumably are performed for 
the stimulation they provide). These investiga- 
tions suggest that the kinesthetic aftereffects 
task does measure individuals’ needs for stim- 
ulation and that kinesthetic aftereffects re 
ducers have higher levels of this motive than 
do kinesthetic aftereffects augmenters. 
Individual’s performance on the kinesthetic 
aftereffects measure may seem à rather pecu- 
liar operational definition for a putative need 
for stimulation. Indeed, the face validity of 
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s essentially zero. Further, the 
‘hod by which the kinesthetic 
¿ is applied in these studies 
ably from traditional kines- 
ts methodology; it is not the 
ich, but rather the tendency to 
interpolated stimulation, 
: the measure of stimulus-seek- 
However, the interpretation of 
as an indicator of need for 
-onsistent with the available 


ung 


ing tenden 
the proce 
stimulation 


~ data. Thi nsistency provides considerable 
construct lity for the operation in ques- 
tion (Cronbach & Meehl, 1955). In addition, 
use of this technique suggests a theory which 
helps to account for observed individual dif- 
ferences i» stimulus-seeking tendencies. This 
theory involves three relatively simple as- 

sumptions. 
1. Exposure to a specific stimulus does not 


affect ali 
some ind 
come sm 
tense ove 
lus appea 
and more i 
differentia! 
kinesthet: 

subjects’ original judgments of the size of a 
stimulus with their judgments of this same 
stimulus following continued stimulation of 
the modality involved). 

2. Individuals desire to maintain an “op- 
timum” level of internal, received stimulation. 
If their current level of objective stimulation 
evokes an internal response which is either 
"tog small” or “too large,” they will attempt 
to change their input and will react negatively 
to the current input. If their current level of 
objective stimulation evokes an internal re- 
sponse which is “just right,” they will attempt 
to maintain this input and will react posi- 
tively to it. This assumption has appeared in 
the theories of Dember and Earl (1957), 
Fiske and Maddi (1961), Walker (1964), 
and Berlyne (1960); the available data (e.g., 
Brennen, Ames, & Moore, 1966; Hershenson, 
Munsinger, & Kessen, 1964; Sackett, 1965; 
Sales, 1968; Vitz, 1964, 1966a, 1966b) ap- 
pear to support it. 


lividuals in the same way. For 
iduals, the stimulus appears to be- 
. lighter, quieter, and less in- 
ime. For others, this same stimu- 
to become larger, heavier, louder, 
:tense over time. Presumably, this 
tendency is measured by the 
ftereffects task (which compares 
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3. The optimum level of internal, evoked 
stimulation tends to be similar for all indi- 
viduals. This third assumption is admittedly 
unsubstantiated. Nevertheless, combined with. 
the two which precede it, this assumption 
seems to account for the available data. 

Consider, for instance, the situation of à 
kinesthetic aftereffects reducer placed in an 
objectively “average” stimulus environment. 
For him, this environment should evoke a 
perceptual response which is somehow duller 
than the response which it evokes in an aver- 
age person. However, the reducer is assumed 
to have the same optimum subjective stimula- 
tion level as the average person; thus, he 
should be motivated to seek out situations 
which are objectively “more exciting” than 
average. The reducer would then be said to 
have a high need for (objective) stimulation, 
since he requires a great deal of input to at- 
tain the desired subjective optimum. 

Similarly, consider the situation of a kines- 
thetic aftereffects augmenter placed in the 
same average stimulus environment. For him, 
this environment should evoke an internal re- 
sponse which is somehow more intense than 
the response which it evokes in an average 
person. Thus, the augmenter should be mo- 
tivated to seek out situations which are ob- 
jectively “less exciting” than average. The 
augmenter would then be said to have a low 
need for (objective) stimulation, since he re- 
quires relatively little input to attain the de- 
sired subjective optimum. 

In the present paper, four studies are re- 
ported, based on this general argument. These 
studies vary in their specific content. How- 
ever, they are all informed by two broad 
hypotheses: 

1. Individuals with a high need for stimula- 
tion, as indicated by reduction on the kines- 
thetic aftereffects task, are attracted to com- 
plex, novel, and interesting stimulus situa- 
tions; presumably, this tendency results from. 
the fact that “normal” stimulus environments 
evoke subjective responses which are “too 
dull” for them. Individuals with a low need 
for stimulation, as indicated by augmentation 
on the kinesthetic aftereffects task, are at- 
tracted to quiet, peaceful, and simple stimulus 
situations; presumably, this tendency results 
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from the fact that normal stimulus situations 
evoke subjective responses which are “too in- 
tense” for them. 

2. Individuals with a high need for stimula- 
tion react favorably to complex, novel, and 
interesting stimulus situations; presumably, 
this results from the fact that these situations 
provide them with the greater-than-average 
objective input which they require. Individuals 
with a low need for stimulation react favor- 
ably to quiet, tranquil, and simple stimulus 
situations; presumably, this results from the 
fact that these situations provide them with 
the less-than-average objective input which 
they require. 

Since all of the studies presented here are 
“little” experiments, none represents a truly 
convincing test of these hypotheses. Further- 
more, not all of the findings are impressive 
when considered alone. In some cases the sig- 
nificance tests reached only marginal levels of 
statistical significance ( < .10, one-tailed); 
in one instance the expected findings were 
noted for one subsample, but not for another. 
However, the body of data is both internally 
consistent and congruent with the theory 
sketched above. The investigations reported 
below should thus be considered not primarily 
as separate experiments, but rather as 
parallel explorations in a single area, 


Stupy I—PREFERENCES AMONG VISUAL 
STIMULI 


A straightforward derivation from the the- 
ory developed here is that individuals with a 
high need for stimulation prefer complex, 
visually interesting stimuli, while individuals 
with a low need for stimulation prefer simple, 
visually “quiet” stimuli. Study I provides a 
direct test of this prediction. 


Method 


Subjects. Subjects were 30 undergraduates at the 
University of Michigan. The sample included 15 
males and 15 females, all drawn from the Univer- 
sity's paid subject pool. 

Experimental. procedure, Subjects were run indi- 
vidually for this experiment. The subject was ad- 
ministered the kinesthetic aftereffects measure as soon 
as he reported to the laboratory; the methodology 
employed was a variant of the procedure described 
by Petrie (1967). The subject was blindfolded and 
then was presented with (a) a 30-inch-long wedge- 
shaped block which increased from 4-inch wide at 
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its narrowest end to 4 inches at its widest end and 
(b) a 1ł-inch-wide rectangular block, The subject 


was asked to grasp the rectangular bk vith one — 
hand and then to run his other hand up the wedge- 
shaped block (an ascending judgment) until he 
found the point where the width of the wedge 
seemed to be the same as the width of the rectangle, 
The subject was given two practice trials, and then 
he was asked to make four judgments of the ree- 


tangular block. The average of these 
served as the subject’s pretest score for 
thetic aftereffects task. 


gments 
e kines- 


Following these judgments, the subje asked 
to rub, handle, and feel a 24-inch rectar * block 
for 1 minute. Thereafter, the subject v sked to 
make four additional judgments of of the 
14-inch rectangular block. A subject e on the 
kinesthetic aftereffects task was his e signed 
change from the first four judgments last four 
judgments. This measure is continuous small or 
negative numbers implying relative kines after- 
effects reduction (and presumably hi levels of 
need for stimulation) and large positive numbers 
implying relative kinesthetic aftereffects augmenta- 
tion (and presumably low levels of need for stimula- 
tion). The sample was divided into “high-need-for- 
stimulation” and “low-need-for-stimulation” groups 
of 15 each. 

Following the administration of the kinesthetic 
aftereffects procedure, the subject was given twelve 
3X5 inch black and white photographs of Scottish 
tartans,? each laminated in a separate cesing. The 
tartan photographs exhibited approximately the same 


shade of gray; however, they differed co 
the number of separate rectangles whic J 
tained. The smallest number of elements in any of 
the photographs was 91, while the |. t number 
of elements was 2,109. These variations among the 
tartans in number of stimulus elements served as the 
indicator of stimulus complexity. 


The subjects were asked to indicate, for each of 
the 12 stimuli, (a) their liking for the tartan and 
(b) the complexity which they perceived in the 


tartan. One-hundred-point scales were employed for 
each of these ratings, with ratings of 100 indicating 
extreme liking (or extreme complexity) and ratings 
of 1 indicating extreme disliking (or extreme sim- 
plicity). The subjects were told that they could as- 
sign all high numbers to the 12 tartans, all low num- 
bers, or numbers from the entire range of the scale. 
"These ratings constituted the measures of preference 
and of subjective complexity for this study. 

When the experiment was concluded, the subjects 
were thoroughly debriefed and were given the Op- 
portunity to question the experimenter concerning all 
aspects of the study. 


Results 


The present theory demands that the high- 
need-for-stimulation subjects should evidence 


"The author is indebted to F. Joseph Mortenson 
for making these photographs available. 
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of the subjects! need for 


a relative preference for the more complex 
tartans, while the low-need-for-stimulation 
subjects should evidence a relative preference 
for the less complex tartans. Figure 1 presents 
- the data relevant to this prediction. 

— As can be seen, the data generally support 
- the hypothesis. For five of the six less com- 
plex tartans in the set, the average rating 
given by the low-need-for-stimulation sub- 
jects is higher than that given by high-need- 
for-stimulation subjects. For five of the more 
complex tartans in the set, the average rating 
given by low-need-for-stimulation subjects is 
lower than that given by high-need-for-stimu- 
lation subjects. This pattern of results is sig- 
nificant beyond the .05 level (Fisher's exact 
test). There was no pronounced difference be- 
tween male and female subjects in either the 
Strength or magnitude of this relationship. 

Other ways of analyzing the data support 
the same conclusion. The average rating given 
to the six least complex tartans by subjects 

With a low need for stimulation was larger 
than the average rating given to the six most 
. Complex tartans by these subjects, while this 
pattern was reversed for subjects with a high 
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STIMULUS COMPLEXITY 


der of increasing complexity) 


Frc. 1, Liking for 12 tartans which differ in stimulus complexity as a function 


stimulation (n Stim). 


need for stimulation. The data relevant to this 
point are presented in Table 1; the interaction 
effect barely missed significance at the .10 
level (F = 2.74, p< 15, two-tailed test). 
Furthermore, the intrasubject correlations be- 
tween objective complexity and liking over the 
12 stimuli were significantly more positive for 
high-need-for-stimulation subjects than they 
were for low-need-for-stimulation subjects; 
the Mann-Whitney U test for this effect was 
81 (p < .10, one-tailed). 

Other data from this experiment support a 
similar conclusion with regard to subjective 
(reported) complexity. Specifically, the intra- 


TABLE 1 


MEAN LIKING FOR THE Six Most COMPLEX ‘TARTANS 
AND THE Six Least COMPLEX TARTANS FOR 
InpivipvAts WITH HiGH AND Low NEEDS 
FOR STIMULATION 


Stimulus complexity 


Level of need for |— 
stimulation 


Six least com- Six most com- 


plex tartans plex tartans 
Low (n = 15) 53.87 47.19 
High (» = 15) 47.57 54.43 


es uS E od 


$ 
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subject correlations between subjective com- 
plexity and liking over the set of tartans were 
significantly higher for subjects with a high 
need for stimulation than they were for sub- 
jects with a low need for stimulation; the 
Mann-Whitney test for this effect was 64 
(p < .025, one-tailed). Again, male and fe- 
male subjects did not substantially differ in 
either the strength or direction of this effect. 


Discussion 


The results of Study I support the hypoth- 
esis under consideration. As expected, high- 
need-for-stimulation subjects indicated a rela- 
tive preference for complex stimuli, while low- 
need-for-stimulation subjects indicated a rela- 
tive preference for simple stimuli. The inter- 
action between behavior on the kinesthetic 
aftereffects task and stimulus complexity 
clearly does not account for a substantial 
amount of the variance in the preference rat- 
ings used in this study. Nevertheless, these 
ratings are undoubtedly overdetermined, and 
the interaction between the need for stimula- 
tion (as operationalized here) and tartan 
complexity does seem to have been a signifi- 
cant factor. 


Stupy IJ—BrHAvior IN AN IMPOVERISHED 
ENVIRONMENT 


Study I considered only the subjects’ re- 
ported evaluations of stimuli which varied in 
complexity, interest, or intensity. However, 
the theory sketched here also suggests predic- 
tions concerning differences in the behavior of 
individuals who differ in their measured need 
for stimulation, Study II deals explicitly with 
this issue. The hypothesis of this investiga- 
tion was that high-need-for-stimulation sub- 
jects are particularly likely to provide their 
own stimuli in situations which are objectively 
dull and uninteresting. 


Method 


Subjects. Subjects were 31 undergraduates (14 
males and 17 females) drawn from the University 
of Michigan subject pool. 

Experimental procedure. As soon as the subject ar- 
rived at the laboratory, he was administered the 
kinesthetic aftereffects measure, He then was taken 
to a moderate-sized room equipped with a one-way 
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mirror, seated at a small table, and asked 
“for a few minutes while I get some : 
ready." The subject was then left ak 

minutes, During this waiting period he w 1 
through the one-way mirror by an observer; 


observer’s task was merely to record each behavior 
performed by the subject and the time at which it 
was performed. In no case was the obser aware 
of the subject's score on the kinesthetic 'reffects 
measure. 

Following this period, the subject was trken by 
the experimenter to another room and was asked to 
estimate (to the nearest minute and second) how 
long he had been waiting for the experim in the 
one-way mirror room. This question was included 
here on the assumption that subjects with a high 
need for stimulation would experience the amount of 
time he waited as longer than would sv with a 
low need for stimulation (Ryan & Fo: 1967). 
Finally, the theory and hypotheses involved in the 
study were explained to the subject, and he was 


allowed to question the experimenter abou! «ll Eg 
of the procedure. (s Y 
Measures. The observers’ code sheets were rated 
blind by two raters for the amount of ac.ivity ex- 
hibited by the subjects during the waitiij; period. 
A 10-point scale was used by both raters, with a 
score of 10 being given to subjects. who deft their 
seats in order to explore the experimental room and 
a score of 1 being given to subjects who essentially 
waited quietly throughout the 74-minute period. This 


rating procedure is obviously highly intui ive. Never- 
theless, the two independent judgments correlated .90 
over the sample of 31 subjects, suggesting that the 
ratings probably constitute a reasonable measure of 


h subject 
i by the 


the subjects’ behavior in the situation, E 
was assigned the average of the ratings give 
two judges. 


Findings J 


| 

Overall, a significant correlation (7 = .38, 
p < .05) was found between the subjects’ 
need for stimulation and their rated activity 
in the experiment. High-need-for-stimulation 
subjects thus were significantly more likely 
than low-need-for-stimulation subjects to look 
through their purses, read magazines, walk 
around the experimental room, or (in oné 
case) leave the laboratory in order “to look 
for the experimenter.” A small negative cor- 
relation (r — —.19, not significant) was also 
observed between measured need for stimula- 
tion and the estimate given by the subjects 
of the number of seconds that they had been 
waiting for the experimenter. There were no 
noteworthy sex differences with regard t9 
these findings. 
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Discussion 
The observed correlation between need for 
stimulation and rated behavior is a reasonable 


confirma f the hypothesis involved. As 
expected, subjects high in the need for stimu- 
lation did scem particularly likely to provide 
in their í behavior the stimulation which 
was lack n this relatively impoverished 
environm This result is congruent with 
unpublished data discussed by Eysenck (1967, 
p. 116). I : unpublished study, high-need- 
for-stimu n subjects were found to be par- 
ticularly likely to cope with mild sensory 
deprivation by moving around and tapping 
their hands and feet (or to avoid it by going 
to sleep ) 

The observed small relationship between 
judged elapsed time and need for stimulation 
is somewhat surprising, particularly since 
Ryan and Foster (1967) have reported that 
high-need-for-stimulation subjects do experi- 
ence elapsed time as longer than do low-need- 


for-stimulation subjects. Perhaps the relatively 
large amount of activity exhibited by the 
present high-need-for-stimulation subjects 
made the time seem to pass quickly for them, 
while the relatively reduced amount of ac- 
tivity exhibited by the present low-need-for- 
stimulation subjects made the time drag out 
for them, This explanation seems reasonable, 
and it does not conflict with the Ryan and 
Foster data, since in their study all subjects 
were required to sit quietly during the ex- 
periment. 


Srupy TIT— BEHAVIOR IN A GROUP 
DISCUSSION 


The direction of the expected relationship 
between need for stimulation and talking in 
a group discussion is debatable. One might 
argue that individuals who have a high need 
for stimulation would be likely merely to 
listen during a discussion, since this would 
“maximize” the objective stimulation avail- 
able to them, However, this prediction seems 
to ignore the dynamics of group discussions. 
If one or more subjects in a group remain 
quiet during a conversation, the effect of their 
silence surely will be to diminish the overall 
stimulation available. Indeed, in a relatively 
small group (three or four), the effect of one 


or two quiet individuals on the conversation 


might be disastrous—as many hostesses know! 
The present hypothesis is thus that indi- 
viduals with a high need for stimulation are 
particularly likely to exhibit high levels of 
talking during a group discussion. Their high 
verbal output would increase the amount of 
conversation in the group. It would also in- 
crease the number of communications di- 
rected toward themselves (Collins, 1960), and 
it would increase the proprioceptive and 
kinesthetic stimuli which they receive from 
their own verbal behavior. All of these out- 
comes would presumably be favorably re- 
ceived by individuals with a high need for 
stimulation, and this should serve to maintain 
high levels of talking in these subjects. 


Method 


Subjects. Subjects were 44 undergraduates (24 fe- 
males and 20 males) at the University of Michigan, 
all drawn from the psychology subject pool. 

Experimental procedure. Subjects arrived at the 
laboratory in same-sexed groups of 2-4. When the 
entire group was assembled, each subject was taken 
by an experimenter to a small individual cubicle, 
and the kinesthetic aftereffects measurement was ob- 
tained. All of the subjects were then brought into a 
large discussion room. They were seated together at 
a table and were asked to work together on a “brain- 
storming” task for 8 minutes. The rules of this pro- 
cedure were simple, The subjects were given a prob- 
lem and were asked to state all ideas which the 
problem suggested to them, They were encouraged to 
emit ideas freely, and they were told that no idea 
should be rejected merely because it seemed far- 
fetched. The specific problem used was the “thumbs” 
problem, used previously by Taylor, Berry, and 
Block (1958). This problem asks subjects to state 
those difficulties or changes which would result from 
every person’s being born with an extra thumb on 
each hand. This topic was chosen because Taylor 
et al, found that it elicited the greatest interaction 
among brainstorming subjects. When the discussions 
were completed, all subjects were thoroughly de- 
briefed. 

Measures. While the subjects discussed the problem, 
two experimenters sat inconspicuously in a corner 
of the room and merely counted the number of state- 
ments made by each subject in the group. If a sub- 
ject emitted two (or more) ideas in a single verbal 
intervention, the number of specific ideas was tallied 
by the experimenters. If a subject merely reacted to 
an idea suggested by another individual (“Hey, that’s 
great”), this was coded as a statement, although no 
ideas were presented. These ratings were made inde- 
pendently by the experimenters, each of whom had 
previously scored one of the subjects on the kines- 
thetic aftereffects procedure. Although the possibility 
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of experimenter bias clearly exists, the magnitude of 
the interjudge reliability coefficient (r = .95 over the 
sample of 44 subjects) suggests that the measure in- 
volved was reasonably objective and bias free. Each 
subject was assigned the average of the ratings given 
by the two judges. Because the various groups dií- 
fered considerably in the average number of state- 
ments made, each individual's verbal output score 
was transformed to a z score on the basis of his 
group's mean and standard deviation. 


Results 


Overall, a small but significant correlation 
(r = .25, p < .05, one-tailed test) was ob- 
served between the subjects' need for stimula- 
tion and their verbal output during discussion. 
However, the male and female subsamples 
differed considerably in the strength of this 
relationship. For male subjects, the correla- 
tion between need for stimulation and verbal 
output was .45 (p < .05, one-tailed test); for 
female subjects the same correlation was only 
02. The difference between the two correla- 
tions approached (but failed to meet) ac- 
cepted levels of statistical significance (z — 
1.43, p < .15, two-tailed test). 


Discussion 


The data clearly support the major hy- 
pothesis of this study, at least for male sub- 
jects. This finding also supports Eysenck’s 
(1955, 1967) contention that high-need-for- 
stimulation subjects are likely to be social 
extraverts. 

The failure of this relationship to obtain 
in female subjects as well is baffling, and no 
intuitively compelling explanation of this sex 
difference is immediately available. Perhaps 
the female subjects were more sensitive than 
their male counterparts to the distribution of 
participation and attempted to “give everyone 
a chance to talk." "This presumed tendency 
would decrease the correlation between verbal 
behavior and the need for stimulation in fe- 
male subjects, and it is congruent with the 
popular picture of the behavior of females in 
discussions. However, there is no evidence in 
this study which compels one to accept this 
argument. In any event, the failure of the 
kinesthetic aftereffects measure significantly 
to predict verbal behavior in females does not 
necessarily indicate that the need for stimula- 
tion operates differently for the two sexes. 
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There was no appreciable difference between 
female and male subjects in the rel ‘tionship 
between need for stimulation and appropriate 
dependent variables in either of the other ex- 
periments described above. 


Srupv IV—ArrENTION to Comp! AND 
SIMPLE STIMULI 

A final derivation from the present theory 
is that subjects’ tendencies to attend to dif- 
ferent stimulus configurations are, in part, a 
joint function of their needs for stimulation 
and of the complexity of the stimuli. Specifi- 
cally, one would expect that subjects with a 
high need for stimulation are particularly 
likely to attend to relatively complex ‘imuli, 
since these stimuli offer the interest whi 


high-need-for-stimulation individuals are said 
to seek. Study IV offers a direct test of thi: 

prediction. In addition, Study IV allows one. 
to determine what effect, if any, the presence 
of multiple, conflicting inputs has vpon the 
attention of individuals who differ in need 
for stimulation. 


Method 


Subjects. Subjects for this study were 48 
undergraduates drawn from the University oi 
gan psychology subject pool. 

Experimental procedure. Two auditory stimulus in- 
puts were employed in this experiment. The first of 
these stimuli was a relatively simple monologue, 
“Soviet Youth: Myth and Reality” (Sherman, 1965). 
The monologue dealt with the attitudes and experi- 
ences of one Russian man, Yuri, and lasted for ap- 
proximately 5 minutes, The second of the stimuli was 
a relatively complex dialogue especially prepared for 
this experiment. The dialogue was between two male 
college students, and it dealt with themselves, their 
respective fiancées, their fiancées’ orientations toward 
graduate school, their relationships with their fiancées, 
and their orientations toward their respective future 
marriages. Like the monologue, the dialogue lasted 
for approximately 5 minutes, Both stimulus programs 
were recorded on magnetic tape and were presented 
to the subjects by means of a tape recorder.’ 

Two experimental conditions were employed. In 
the first, the subjects were told that two tape- 
recorded messages would be sequentially presented to 
them. They were told that one of these inputs 
would be a monologue and that the other would be 
a dialogue, and they were encouraged to “pay par- 
ticular attention to the monologue." Order of presen- 


* The author is indebted to Greg Kelch and Jim 
Plafkin for their assistance in creating the two tape 
programs employed here. 


iwo messages was counterbalanced 
Twenty-three subjects, run indi- 
d in this condition, 

condition, the subjects were told 
ecorded messages would be presented 
to them through the two speakers of 
tape recorder. Like subjects in the 


that two tape 
simultaneou 
4 stereophc 


first condition, the subjects were told that one of 
the programs would be a monologue and that the 
other would a dialogue, and they were encour- 
aged to “pay particular attention to the monologue.” 


ution was included to determine whether 
or not the hypothesized tendency of high-need-for- 
stimulation subjects to attend to the complex dialogue 
would interfere with their ability to receive a focal 
stimulus, regardless of its complexity. Twenty-five 
individually run subjects were used in this condition. 

Following presentation of the two messages, the 
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subjects were asked to answer multiple-choice factual 
uestions about the monologue and dialogue. Twenty 
tions were selected from the monologue con- 
tent; 14 were selected from the dialogue. Retention 
of factual content thus was the operational definition 
of the su attention to the two stimulus inputs. 
When the ects had finished responding to this 
factual questionnaire, the experimenter explained all 
of the ma: tions and hypotheses to them. 


Measures. The 20 questions directed toward the 
monologue the 14 questions directed toward the 
dialogue were scored for “number correct.” These 
Scores represent the two dependent variables of this 
study. 

Results 


The basic findings of Study IV are pre- 
Sented in Table 2. As can be seen, there was a 
Strong tendency for individuals with a high 
need for stimulation to retain more of the 
dialogue than was retained by individuals low 
in this motive. This difference was observed 
in both the sequential presentation (¢ = 2.23, 
P < .025) and the simultaneous presentation 
(t = 3.16, p < .001) conditions. On the other 
— hand, there was no tendency for either group 

of subjects to show significantly increased re- 
tention of the monologue in either of the con- 
ditions used here, High-need-for-stimulation 
Subjects thus appeared to attend more closely 
to highly complex stimulus patterns than did 
low-need-for-stimulation subjects, but the 
available data do not indicate that this dif- 
| lerence was repeated for less complex stimulus 
patterns, 

Further, the data do not indicate that the 
attention patterns of the two groups of sub- 
- Kcts differed as a function of the experi- 
Mental condition (simultaneous presentation 
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TABLE 2 


MEAN NUMBER OF CORRECTLY ANSWERED cu ^ 
POR THE SIMPLE MONOLOGUE AND THE COMPLEX —— 


DIALOGUE IN Two PRESENTATION. CONDITIONS 
BY SunJEcrs with Hick AND Low NEEDS 
FOR STIMULATION 


1 
| Presentation condition 


Level of 

need for Sequential Simultaneous 
stimulation | 

| Monologue | Dialogue | Monologue | Dialogue 

Low 

x 15.67 9.92 

SD 2.46 2.56 

n | 12 12 
High | 

x 16.82 | 12.00 

SD 2.08 .54 

n 1 | 11 


Note.—Four subjects in the simultaneous presentation cone 
dition did not answer any questions relevant to the dialogue. 
Their data for the monologue were omitted in these analyses, 


and sequential presentation). To be sure, the 


subjects did considerably better on both com- - 


munications when the messages were presented 
sequentially ( < .001). However, there was 
no interaction effect between measured need 
for stimulation and presentation condition for 
either of the two messages. The relevant F 
ratios for the Kinesthetic Aftereffects Per- 
formance X Presentation Condition interac- 
tion were .64 for the dialogue and .01 for 
the monologue, both clearly nonsignificant. 


Discussion 


The data from Study IV generally support 
the hypothesis involved. To the extent that 
retention of factual material is a meaningful 
operation for attention to this material, the 
high-need-for-stimulation subjects in the sam- 
ple did seem to attend more closely to the 
complex dialogue than did the low-need-for- 
stimulation subjects. This conclusion is 
strengthened by the fact that no significant 
differences were obtained with regard to the 
more simple monologue. 

The statistical insignificance of the two 
interaction effects also suggests an interesting 
conclusion. Specifically, this “nonfinding” 
seems to imply that the high-need-for-stimula- 
tion subjects’ increased attention to the com- 
plex dialogue in the simultaneous presentation 
condition does not interfere with their con- 
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current ability to attend to the more simple 
monologue. That is, subjects with a high need 
for stimulation hear about as much of the 
monologue in the simultaneous presentation 
condition as do subjects low in this motive, 
even while they are hearing substantially more 
of the dialogue. This finding is somewhat 
reminiscent of the conclusion of Silverman 
and Gaardner (1967), that high-need-for- 
stimulation subjects show a higher rate of 
saccadic eye movement than is shown by low- 
need-for-stimulation subjects, and of the find- 
ing (discussed by Eysenck, 1967, pp. 87-93) 
that high-need-for-stimulation subjects are 
particularly likely to vary their objects of 
fixation over time. Perhaps, in the simulta- 
neous presentation condition, subjects with a 
high need for stimulation managed to shift 
their attention so quickly that they were able 
to receive major parts of both inputs. Sub- 
jects with a low need for stimulation, who 
presumably varied their attention more slowly, 
seemed unable to receive as much of the two 
messages; however, their performance ap- 
parently did not deteriorate on the input to 
which they were encouraged to attend. 


CONCLUSIONS 


The present data suggest that the need for 
stimulation, as measured, accounts for a sig- 
nificant amount of the variance in individuals’ 
behavior in a broad range of situations. For 
instance, in the present data there are sug- 
gestions that kinesthetic aftereffects perform- 
ance relates to the subjects’ attitudes, social 
behavior, activity level, and preferred objects 
of attention. Further, data from previous 
studies indicate that the need for stimulation 
is a determinant of subjects’ delinquent acts 
(Petrie et al., 1962), responses to perceptual 
deprivation (Petrie, Collins, & Solomon, 
1958), and athletic participation (Ryan & 
Foster, 1967). The scope of the findings indi- 
cates that the need for stimulation is similar 
in its generality to traits such as field de- 
pendence (Witkin, Dyk, Faterson, Good- 
enough, & Karp, 1962), repression-sensitiza- 
tion (Byrne, 1964), leveling-sharpening 
(Holzman & Klein, 1954), and category width 
(Pettigrew, 1958). 

In addition, it is interesting to note that 
the present analysis seems to be remarkably 
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close to one part of Pavlov's neurological 
typology. Pavlov believed that dogs (and, by 
extension, people) could be differen\isted in 
terms of the relative “strength” or “weakness” 
of their nervous systems. “Strong” individuals, 
in Pavlov's analysis, are able to respond ef- 
fectively to extremely intense stimuli. Further, 
strong subjects are felt to be competent in 
dealing with massed stimulation. “Wes” sub- 
jects were considered to be unable to cope 
with intense or massed stimulation; con- 
versely, weak individuals are believed to be 
particularly adept in dealing with faint stimuli, 

Gray’s (1964) interpretation of this part of 
Pavlov's system is virtually identical to the 
analysis made here. Gray proposed that weak 
and strong individuals differ in their internal 
reactions to stimuli of specific intensities. A 
weak nervous system, he suggested, ‘ vonds 
to stimulation with the more intense excitatory 
process [p. 160].” This assumption would 


account for a weak subject’s relative effective- 
ness in dealing with faint stimuli, since the 
internal representation of such stimuli would 
be relatively intense in weak subjects. It 
would also account for a weak subject’s rela- 
tive difficulties in dealing with powerful stim- 
uli, since the internal representations of these 
stimuli would presumably be beyond the 
“threshold of transmarginal inhibition" in 
these subjects. Conversely, Gray (1964) 
argued that a strong nervous system “acts as 
if it damped down stimulation |p. 282]." 
Again, this assumption effectively accounts for 
the observed performance of strong subjects. 
Gray’s comments will be recognized as re- 
statements of the first assumption of the 
theory employed here. 

If Gray’s analysis of Pavlov’s position is 
correct, and if the theory proposed here is 
accurate, then strong individuals should have 
a high need for stimulation. At least some 
findings suggest that this inference may be 
accurate. For instance, Petrie (1967) has 
found that high-need-for-stimulation subjects 
(classified by the kinesthetic aftereffects pro- 
cedure) are particularly able to withstand 
painful experiences (e.g., as in childbirth); 
this result suggests that the stimuli involved 
are “damped” by their (presumably strong) 
nervous systems. Other workers have indi- 
cated that extraversion (which, as indicated 
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» is correlated with the need for stimula- 
lates to measures of strength in the 
system. Siddle, Morrish, White, and 
(1969) have reported a rank-order 
lion of —.52 (N — 15) between extra- 
and visual sensitivity in a threshold 
eriment. As expected, extraverts (who have 
| presumably high need for stimulation) were 
elatively strong, in th e that they were 
nsensitive to faint stimuli. However, Siddle 
al. suggested that hig! Is of neuroticism 
disrupt this relationship. Mangan and 
orman (1969) have reported that extra- 
verts show particularly rapid habituation of 
the orienting response (2 phenomenon which 
ould characterize strong subjects); and Man- 
gan and Farmer (1967) have indicated that 
extraverts, like strong individuals (Gray, 
1964), respond relatively slowly to low-in- 
tensity stimuli. (Surprisingly, Mangan & 
Farmer interpreted the latter finding as evi- 
e that introverts were the strong sub- 
ts.) It should be mentioned that Gray 
(1967) and White, Mangan, Morrish, and 
iddle (1969) have recently concluded that 
extraversion is only similar to, and not identi- 
with, the Pavlovian notion of strength. 
However, since extraversion is probably not 
perfectly correlated with the need for stimula- 
tion, there may still be a close relationship 
ween the concepts of strength and need for 

. Stimulation.* 


*Since the present paper was completed, a number 
of studies have appeared which are generally rele- 
| Vant to this speculative neurophysical argument. And 
many, although not all, of these investigations have 
Provided supportive data. Spilker and Callaway 
(1969) reported that individuals with a high need 
j for stimulation (kinesthetic aftereffects reducers) ex- 
hibited smaller averaged electroencephalogram (EEG) 
-Tesponses to various visual stimuli than did indi- 
- Viduals with a low need for stimulation. Buchsbaum 
and Silverman (1968) presented similar data: how- 
ever, in this second investigation the predicted rela- 
tionship between the need for stimulation and the 
Oked EEG response was not observed for rela- 
ely faint stimuli. Similarly, Silverman, Buchsbaum, 
4nd Henkin (1969) observed that in female sub- 
. jets, individuals with a high need for stimulation 
Showed relatively small EEG responses to intense 
stimuli (but not to faint stimuli). Paradoxically, 
lowever, Silverman et al. did not observe such a 
Pattern in male subjects. Furthermore, in contrast to 
the prediction which would be made from the argu- 
ment sketched here, Silverman et al. found that rela- 
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Naturally, the results of the four studies 
reported here do not exhaust the implications 
of the present theory. Indeed, the scope of 
the findings suggests that these experiments 
have only begun to indicate the various areas 
in which the need for stimulation may sig- 
nificantly influence behavior. The data do, 
however, suggest a substantial amount of 
validity for the theory (and for the kines- 
thetic aftereffects measure) used here; they 
also indicate that a need for stimulation may 
account for an unexpectedly large amount of 
the social behavior which we observe in our- 
selves and others. 


tively sensitive subjects (as determined by a series 
of threshold measures) exhibited not heightened 
EEG-evoked responses to the stimuli used, but 
rather diminished responses. Clearly, the last word 
has yet to be said on this issue. 


REFERENCES 


BerLYNE, D. E. Conflict, arousal, and curiosity. New 
York: McGraw-Hill, 1960. 

Brennen, W. M., Ames, E. W., & Moore, R. W. Age 
differences in infants’ attention to patterns of dif- 
ferent complexities. Science, 1966, 151, 354-356. 

Bucussaum, M., & SILVERMAN, J. Stimulus intensity 
control and the cortical evoked response. Psycho- 
somatic Medicine, 1968, 30, 12-22. 

Byrne, D. Repression-sensitization as a dimension of 
personality. In B. A. Maher (Ed.), Progress in 
experimental personality research. New York: Aca- 
demic Press, 1964. 

Coruws, B. E. The interaction of status and com- 
munication: Some hypotheses and an empirical 
test, Unpublished master’s thesis, Northwestern 
University, 1960. 

Cronsacu, L. J., & Meeut, P. E. Construct validity 
in psychological tests. Psychological Bulletin, 1955, 
52, 281-302. 

Demper, W. N, & Eart, R. W. Analysis of ex- 
ploratory, manipulatory, and curiosity behaviors. 
Psychological Review, 1957, 64, 91-96. 

Eysenck, H. J. Cortical inhibition, figural after- 
effect, and theory of personality. Journal of Ab- 
normal and Social Psychology, 1955, 51, 94-106. 

Evsencx, H. J. The biological basis of personality. 
Springfield, Ill.: Charles C Thomas, 1967. 

Fiske, D. W., & Mappi, S. R. The functions of 
varied experience. Homewood, Ill: Dorsey Press, 
1961. 

Gray, J. A. Pavlov's typology. Oxford: Pergamon 
Press, 1964. 

Gray, J. A. Strength of the nervous system, intro- 
version-extraversion, conditionability, and arousal. 
Behaviour Research and Therapy, 1967, 5, 151— 

169. 


r 


134 


Hersuensox, M, Munsincer, M., & Kesses, W. 
Preference for shapes of intermediate variability in 
the new born human. Science, 1964, 147, 630-631. 

Horzwaw, P, S, & Krxrx, G. S. Cognitive system- 
principles of leveling and sharpening. Jourmal of 
Personality, 1954, 22, 375-394. 

Manoa, G. L., & Farmer, R. G. Studies of the rela- 
tionship between neo-Pavlovian properties of 
higher nervous activity and Western personality 
dimensions: I. The relationship of nervous strength 
and sensitivity to extraversion. Journal of Exferi- 
mental Research in Personality, 1967, 2, 101-106. 

Maxcax, G. L., & O'Gorman, J. G. Initial amplitude 
and rate of habituation of orienting reaction in 
relation to extraversion and neuroticism. Journal of 
Experimental Research in Personality, 1969, 3, 
275-282. 

Perare, A. Individuality in pain and suffering. Chi- 
cago: University of Chicago Press, 1967. 

Perre, A., Corts, W., & Soromoy, P. Pain sensi- 
tivity, sensory deprivation, and susceptibility to 
satiation. Science, 1958, 128, 1431-1433. 

Pere, A, McCuttocu, R, & Kazprs, P. The per- 
ceptual characteristics of juvenile delinquents. Jour- 
nal of Nervous and Mental Disease, 1962, 134, 
415-421, 

Perricrew, T. F. The measurement and correlates of 
category width as a cognitive variable, Journal of 
Personality, 1958, 26, 532-544. 

Ryan, E. D., & Foster, R. Athletic participation and 
perceptual augmentation and reduction. Journal of 
"d and Social Psychology, 1967, 6, 472- 
476. 

SackETT, G. P. Effects of rearing conditions upon 
the behavior of rhesus monkeys (Macaca mulata). 
Child Development, 1965, 36, 855-868. 

Sates, S. M. Stimulus complexity as a determinant of 
approach behavior and inspection time in the 
hooded rat. Canadian Journal of Psychology, 1968, 
22, 11-17. 

SmrRMaN, G. Soviet youth: Myth and reality. In 


MET 


STEPHEN M. SALES 


E. Erikson (Ed.), The challenge of youth. Gardem 
City, N. Y.: Anchor Books, 1965. 

Sore, D. A. T, Moomisn, R. B., Wurre, K. D, & 
Maweax, G. L. Relation of visual sensitivity to 
extraversion. Journal of Experimental Research im 
Personality, 1969, 3, 264-267. 

SILVERMAN, J, BucuspAUM, M, & Hrsxiw, R 
Stimulus sensitivity and stimulus intensity con- 
trol. Perceptual and Motor’ Skills, 1969, 28, 71-78. 

SILVERMAN, J., & Gaarpner, K. Rates of saccadic eye 
movement and size judgments of normals and 
schizophrenics. Perceptual and Motor Skills, 1967, 
25, 661-667. 

SriLKER, B., & Carraway, E. “Augmenting” and “re- 
ducing” in averaged visual evoked responses to 
sine wave light. Psychophysiology, 1969, 5, 49-57. 

Tavron, D. W., Berry, P. C., & Brock, C. H. Does 
group participation when using brainstorming fa- | 
cilitate or inhibit creative thinking? Administra- 
tive Science Quarterly, 1958, 3, 23-47. 

Virz, P. C. Preferences for rates of information 
presented by sequences of tones. Journal of Ex- 
perimental Psychology, 1964, 68, 176-183 

Virz, P. C. Affect as a function of stimulus varia- 
tion. Journal of Experimental Psychology, 1966, 
71, 74-79. (a) 

Vi7z, P. C. Preference for different amounts of visual 
complexity. Behavioral Science, 1966, 11, 105- 
114. (b) 

WALKER, E. L. Psychological complexity as a basis 
for a theory of motivation and choice. Nebraska 
Symposium on Motivation, 1964, 12, 47-97. 

Warre, K. D, Mancan, G. L., Morrisa, R. B, & 
Supe, D. A. The relation of visual aiter-images 
to extraversion and neuroticism, Journal of Ex- 
perimental Research in Personality, 1969, 3, 268- 
274, 

WirkiN, H. A., Dyk, R. B., Farerson, H. F., Goop- 
ENouGH, D. R, & Karp, S. A. Psychological dif- 
ferentiation, New York: Wiley, 1962. 


(Received December 22, 1970) 


JESSE JONES VOLUME FILES 
Send orders to: 

JESSE JONES CORP. 

P.O. Box 5120 

Philadelphia, Pa. 19141 


SLOVULSAV 
TVOIDOTOHDASA 
XSIDO'TOHOAS4 
NVODHINV 


= 
a 
< 
Q 
© 
8 
fe) 
> 


Only $3.50 ea. 
3 for $10.00 
6 for $19.00 P.P.D. 


Now, the journals of the American Psychological Association can become an attractive 
permanent part of your professional library. These famous Jesse Jones volume files, 
especially designed to keep your copies orderly, readily accessible for future references — 
guard against soiling, tearing, wear Or misplacement of copies. 

This durable file will support 150 Ibs. Looks and feels like leather and is washable. The 
23-carat gold lettering makes it a fit companion for the most costly binding. 


The Journal of 


SOCIAL uo 
HE. 


The New Left and The Old 


Issue Editor: Armand L. Mauss 


The Lost Promise of Reconciliation: New Left vs. Old Left 


OX EOR EAM Armand L. v 
The New Left and American Politics after Ten Years .............4.... R. F 
Political Libertarianism at Berkeley: An Application of $ 
Perspectives from the New Student Left ................... Henry C. Fir 
Criteria and Contingencies of Success in a Radical 5 | 
Political Movement .... N. J. Demerath II, Gerald Marwell, and Michael T. A: | 
Majority Involvement in Minority Movements: 
Civil Right, Abolition, Untouchability ........ Gary T. Marx and Michael Use: 
Old and New Left Activity in the Legal Order: An Interpretation ..... Nathan Hakm 
Romantics and Renegades: Political Defection and the Radical Left ... Gilbert Abcarian 
New Left as Right: Convergent Themes of Political 
Discontent 


She acera rra acs be dog PAS ere eee David R. Schweitzer and James M. Elden 


Comments and Rejoinders 


Fred Arnstein. . , Comment on “Three Not-So-Obvious Contribution of Psychology 
to Peace” by Ralph K. White 


Ralph K. White. . . Rejoinder 
Fred Arnstein. . . Response to White's Rejoinder 
Ralph K. White. . . Counter Response 


H. Edward Ransford.. . Comment on "Internal-External Control and Black 
Militancy” by John R. Forward and Jay R. Williams 


John R. Forward and Jay R. Williams. . . Rejoinder | 


Order from: 


1971 ANNUAL SUBSCRIPTIONS RATES 
Journal of Social Issues 


INSTITUTIONS $15.00 
Post Office Box 1248 INDIVIDUALS 9.00 
Ann Arbor, Michigan 48106 SINGLE COPIES 3.00 


published quarterly by 
The Society for the Psychological Study of Social Issues 
(a division of the American Psychological Association) 


D 


JOUNAL of PERSONALITY 
and SOCIAL PSYCHOLOGY 


Volume 3. Number 2 ae b, 


CONTENTS X 
E. ORN 


The Urbən Alienations: Some Dubious Theses from Marx to Marcuse ...............-- 135 
Velvin Seeman 


Respons. ity, Reparation, and Self-Protection as Reasons for Three Types of Helping .. 144 
ncy N. Thalhofer 
io ommitment and Certainty upon Exposure to Supportive and Nonsupportive 
nf tion ases vedic ERES eT me cte ERIS SESS UAE NP ARN iia MNA 152 
tarles F. Behling 
Commi: tion Schedule and Cooperative Game Behavior .............. esses 160 
rman H. Voissem and Frank Sistrunk 
. p | Reinforcers: Classification, Interrelationship, and Some Differential x 
r CLES .. vete AREE Tere ee eee eee eee eee eee eee eee tret t rar nm 
i L. Turner, Edna B. Foa, and Uriel G. Foa 
Effect "hwarting on Cardiac Response and Physical Aggression ........... sees 181 
endan Gail Rule and Lynn Stewart Hewitt 
Some i ictive Conditions for the Occurrence of Psychological Reactance ............ 188 
sla Grabits-Gniech 
Pygm:! «9 Analyzed: Toward an Explanation of the Rosenthal-Jacobson Findings ...... 197 
Pamela C. Rubovits and Martin L. Maehr 
Effects of Agreement and Correctness on Relative Competence and Conformity ...... 204 


Ronald F. Ettinger C. J. Marino, Norman S. Endler, Sheldon H. Geller, and 
Taras Natziuk 


Tests of Preperformance and Postperformance Models of Satisfaction with Outcomes ... 213 
Robert. S. Sobel 


Changes in the Connotations of Color Names Among Negroes and Caucasians: 1963-1969 222 
John E. Williams, Richard D. Tucker, and Frances Y. Dunham 


Attitude Base as a Moderator of the Attitude Similarity-Attraction Relationship ...... 229 
Thomas R. Batchelor and Abraham Tesser 
Reduction of Physical Aggression: The Effects of Four Strategies ...... esee 237 
| Richard Pisano and Stuart P. Taylor 
Humor, Anxiety, and Task Performance ....... eee . 243 
| : Ronald 'E. Smith, James C. Ascough, Ronald F. Ettinger, and Don A. Nelson 


Attitude Change, Commitment, and Ego Involvement .........eee rererere 247 
Ramon J. Rhine and William A. J. Polowniak 


Situational Factors Contributing to a Psychological Advantage in Competitive 


i Negotiations ....... LT Nn Sc DECEM EE: 251 
Jerry I, Shaw 
Saat or Manvecarers “ADGRBTRO TE E oa ronas iorsin iy ee neas 242, 246 


| PUBLISHED MONTHLY BY THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


— MÀ 


Swarthmore 
ANTHONY GREEN W. 


College 
ALD 


University of Kentucky 


GrnALD A, Ment 
Universit, 
Norman Mi 
University o 


SOHN 
f California, Berkeley 
R 


Southern California 


Murray S, MIRON 
Syracuse University 
ALBERT PEPITONE 
University of Pennsylvania 
Dean Prurrt 
State University of New York at Buffalo 
MILTON ROSENBERG 
University of Chicago 
Davin ROSENHAN 
Swarthmore College 
Davi J, SCHNEIDER 
P Amherst College 
ELAINE WALSTER 
University of Wisconsin 
OBERT ZAJONC 
University of Michigan 
OBERT ZILLER 
‘ University of Florida 
PHILP Zimparno 
Stanford University 


Volume ? August 1971 Number 2 
Journal of Personality 
and ? 375 
` px 
Social Psychology . ' M 
f Burea 
Editor: Jon T. LANZETTA, Dartmouth College 
ssociate Editors: CHARLES Kresier, University of Kansas 
Rosert Kreck, Dartmouth College 
SEYMOUR ROSENBERG, Rutgers University 
Consulting; Editors: Issue Consultants : 
NorMan NDERSON ACK BAIR 
" University of California, San Diego J Dartmouth College 
LBERT | i 
T2 5 Jonn Bareroor 
pm di ME Carleton University, Ottawa 
Univer f Massachusetts DuONARD Benkowrtz 
Jack Br University of Wisconsin 
Univer: J California, Berkeley MERRILL CARLSMITH 
Jack Br Stanford University 
eae ersity Jonn DARLEY 
RIE Bri NPENBRENNER Princeton University 
J ea iversity RICHARD DECHARMS 
pe E Washington University 
Extr kon o Rocers ELLIOTT 


Dartmouth College 
Norman FEATHER 

Flinders University of South 

Australia 

Martin FISHBEIN 

University of Illinois, Champaign 
JACQUELINE GOODCHILDS 

Mentoris Company, Encino, California 
PETER GUMPERT 

Columbia University 
Davin HAMILTON 

Yale University 
WinLiAM HAYTHORN 

Florida State University 
Harvey HORNSTEIN 

Columbia University 
GEORGE JERNSTEDT 

Dartmouth College 
CnuanLES MCCLINTOCK 

University of Santa Barbara 
Davin SHAPIRO 

Harvard Medical School 
Ewart SMITH 

Mentoris Company, Encino, California 
RICHARD TEEVAN 

State University of New York at Albany 
ELAINE WALSTER 

University of Wisconsin 


JBLISHED BY 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 


The Journal of Personality and Social Psychology publishes original research report the 


areas of social psychology and personality dynamics. Only a limited number of the : tted 
manuscripts can be accepted, and priority is given to reports which, in the judgmen the © 
Editors, make substantial and significant contributions to the literature. Methodological les, 
studies primarily concerned with the development of measuring instruments, replicat and 
reports of negative results or failures to replicate published work are acceptable if are 
judged to make a substantial contribution to knowledge. Where possible, manuscripts ese 
categories should be brief (not more than nine typewritten pages) so that they can be pu as 


short reports. 


The Journal is issued monthly, four volumes per year. The subscription price is $40.00 a ally 
(foreign, $40.50). Single issues are $4.00. 


Manuscripts should be submitted in duplicate to: 


Professor John T. Lanzetta, Editor, Journal of Personality and Social Psychology, Depar of 
Psychology, Dartmouth College, Hanover, New Hampshire 03755. 


Manuscripts must conform in style to that described in the Publication Manual of th 
can Psychological Association (1967 revision). (a) An abstract of 120 words or less, t: 
separate page, must accompany each manuscript. Instructions for preparation of the a^^ 
in the Publication Manual (pp. 11-12) and in the American Psychologist (1961, 16, 833) 
title should be 15 words or less. If this length is exceeded, the Editors will use their jud 
shortening the titles of accepted reports. (c) Duplication of information in text, tables, 
should be avoided. (d) Figures should be of high quality and must not exceed 84 * 1! 
Figures will generally be reduced to single-column width. Symbols and letters should 
enough to be readable after reduction. Accepted papers are normally published in order oí r 
Early publication of a manuscript can be arranged by adding pages to the journal. The a 
bears the full cost of the added pages (APA office will quote current charges). Regular and early 
publication manuscripts are judged by the same standards, The length of regular manuscripts 


oY & Bae p 


should normally not exceed 45 typewritten pages. Longer papers will be considered for rev lar 
publication if the length is judged necessary for effective communication. Manuscrip! hich 
would exceed 60 typewritten pages and present a well-integrated and definitive research : m 


and theoretical development will generally be considered for publication in the Journal < f- 
sonality and Social Psychology Monograph series. 


The following policies govern reprinting or other forms of reproduction of materials 
righted by the American Psychological Association: Written permission must be obta 
copying or reprinting all tables, figures, and text of more than 500 words in length. P 
organizations who obtain permission to reproduce any such copyrighted materials must 
on the first page of the reproduced materials the exact copyright notice which appeared or 
Association’s original publication. There will be a charge of $10.00 per page for such permissions 
except when the requester is the author of the article to be reprinted, or the article is to be repro- 
duced in a limited number of copies solely for instructional purposes, 


HELEN ORR BARBARA R. RICHMAN Henry S. Sarmon Rosert D. Gorman 


Managing Editor Technical Editor Advertising and Subscription Manager 
Business Manager 


Subscriptions, address changes, and business communications should be sent to: 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
1200 Seventeenth St., N. W. 
Washington, D. C, 20036 


Changes of address must reach the Subscription Office by the tenth of the month to take 
effect the following month. Undelivered copies resulting from address changes will not be replaced; 
subscribers should notify the post office that they will guarantee second-class forwarding postage. 
Other claims for undelivered copies must be made within 4 months of publication. 


PUBLISHED BY 

THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 
Prince and Lemon Sts., Lancaster, Pa, 17604 

and 1200 Seventeenth St., N. W., Washington, D. C. 20036 


4 Second-class postage paid at Lancaster, Pa. 
Copyright © 1971 by the American Psychological Association, Inc. 


THE URBAN ALIENATIONS: 
DUBIOUS THESES FROM MARX TO MARCUSE? 


MELVIN SEEMAN * 


epartment of Sociology, University of California, Los Angeles 


accepted propositions, concerning four types of alienation (selí- 
feelings of powerlessness, social isolation, and cultural estrange- 
reviewed in the light of evidence from studies conducted in the 
, France, and Sweden. The propositions in question emphasize (a) 


nal and social consequences of modern alienated labor, (b) the co- 


various forms of alienation (ie. their typical appearance as a 
(c) the lack of community in urban society, and (d) the break- 


commonly held values in metropolitan life. In each case, the evidence 
erious doubt about the established wisdom concerning alienation. 


of Person ial Psychology 
#971, Vol. 19, N 43 
jely 
ient, 
re 
ate 
f 
The decad 60s was the era in which 
alienation en s a preeminent social and 
psychologica rhe crescendo of protest 
Seemed to c he wisdom of that word 


liagnosis of our condi- 
popularly adopted as the 

ent epoch. It has become 
routine to deñ iv troubles in the language 
of alienation to seek solutions in those 
terms. But si ires are sometimes hard to 
read, someti; irious, and sometimes too 
casually and iscuously used. They ought 
to be examined with care, and I propose to 
examine established preconceptions 


in the intel): 
tion; and it 
Signature of : 


wi 


some 


about alienation in the light of 10 years of 
analysis and re 


earch which has been aimed 
ition of that idea. 

My title hints in various ways at the main 
features that have characterized this work of 
Mine over the past decade. To speak of the 
urban alienations is to imply a concern for 
Social change, an interest in the purported 
effects of the emerging industrial and mass 
Society on the character and quality of social 
ife; and to speak of alienations is to empha- 
Size a conception of alienation which is plural- 


E ue address to the Division of Personality 
the racial Psychology, presented at the meeting of 
Bes E Psychological Association, Miami 
Bebted orida, September 1970. The author is in- 
s i three colleagues at the University of Cali- 
draft: Am, Angeles, for helpful reading of an earlier 
i i rnold Kaufman (philosophy), Gerald Shure 

ee ology), and Samuel Surace (sociology). 

on tues for reprints should be sent to the au- 
ORE fee met of Sociology, University of Cali- 

» Los Angeles 90024, 
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istic at root—that is, a conception that is 
wary of assuming the kind of thoroughgoing 
unity or consistency in social behavior which 
would encourage us to postulate, as Marcuse 
did (1964), a “one-dimensional man." To 
speak of dubious theses about alienation is to 
suggest an interest in secularizing this more or 
less “sacred” concept; it is to suggest that 
there is something here about which an em- 
pirical demonstration has to be made—the 
critical, evocative, and even romantic spirit 
that has infused the literature on alienation, 
whatever its very valid uses in some respects, 
being no substitute for clarity and rigor. And 
to speak of an idea that spans a century and 
a quarter, jrom Marx to Marcuse, is to recog- 
nize that we are dealing with an intellectual 
tradition that has shown remarkable endur- 
ance and thus to acknowledge that those, like 
myself, who take up the task of secularization 
must do so without compromising either the 
intellectual scope or the humanistic concern 
that generated and sustained the idea of ali- 
enation. 

What, specifically, are the theses at stake 
holding some promise of achieving such large 
aims? Perhaps the most important thesis con- 
cerns the centrality of work experience, the 
imputation being that alienation from work 
“Gs the core of all alienation [Schaar, 1961, 
p. 264]” and that the consequences of alien- 
ated labor color the life space of the individual 
in a profound and disturbing way. One can 
discern this theme in each of the three major 
streams of thought that constitute our legacy 
regarding the idea of alienation—in the early 
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Marx; in mass society theory; and in existen- 
tialism. It was Camus who wrote that *With- 
out work all life goes rotten; but when work 
is soulless life stifles and dies [cited in Fried- 
mann, 1961, p. i|”; but it might well have 
been Marcuse for whom the abolition of alien- 
ated labor was a cardinal necessity. It was a 
necessity for Marx, too, whose argument re- 
flects what Kaufman (1965) has called *the 
principle of the sufficiency of unalienated la- 
bor [p. 146|"—that is, the assumption that 
“the resolution of the alienation of labor was 
a sufficient condition of the resolution of all 
the other problems of alienation [p. 146]." 
And although de Tocqueville (1945), one of 
the early critics in the mass society tradition, 
never used the words “alienated labor," he 
lamented it nonetheless, arguing that indus- 
trialization and the division of labor led to 
the degradation of the worker (the workman, 
he wrote, “becomes more weak, more nar- 
row-minded, and more dependent [p. 159]"). 
We tend to put the point today in more mod- 
ern psychological language, so that Korn- 
hauser (1959), writing explicitly as a mass so- 
Ciety theorist, makes much of the effect of 
"self-estrangement" in the work place: the 
absence of a sense of personal worth and solid 
identity through work occasions the volatile 
activisms of contemporary mass movements. 

It all seems very reasonable, this notion 
that alienated work is bound up with assorted 
other alienated sentiments and life styles, and 
one does not have to be an old socialist to 
think so. Freud (1958) thought so, too 
(though in a quite different perspective from 
the standard Marxian one, at least insofar as 
work was inherently repressive, rather than 
a source of satisfaction). Writers of the most 
varied theoretical persuasion seem to agree 
that “a man's work is one of the most im- 
portant parts of his social identity [Hughes, 
1958, p. 43]"; and, if so, important conse- 
quences ought to flow from lack of fulfillment 
in that sector, What troubles me is that my 
own evidence, coming out of studies in the 
United States, France, and Sweden, does not 
coincide with this argument concerning work 
alienation. 

If we are to discuss that evidence, however, 
it becomes important to be very clear about 
what I mean by “work alienation,” how I 
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conceive of the other alienatio: is 
posed to be associated with, and it kind 
of specific consequences of alien: labor | 
have examined. I mean by work alienat 
something very close to what M meant. 
—namely, engagement in work v | is no 
intrinsically rewarding. As I note ny com 
ceptual paper on alienation son ars 
(Seeman, 1959), one of the conc: eani. 
that can be assigned to the elu idea 
"self-estrangement" is that it ses (i 
defined by) engagement in activi is ng 
rewarding in itself; in that res; ork 
simply one of the locales or activit whid 
alienation as self-estrangement c experi 
enced (as it can in housekeeping casua 
conversation, making love, or li T 
convention address—or making onet). Mi 


dex of work alienation was derivec 
analysis from a larger set of items o*iginall 
developed by Roper and reported ; 
sively by Blauner (1964) in his study 6 
factory workers. It consists of seven itemi 
which ask, in a variety of 3 Ms \ f 
individual’s job is engaging : awarding 
itseli—for example, Does le E. 
the job too simple to bring out his 
ties? Does the job really provide : 
try out one's own ideas? Are there o 
ties to make independent decision 
pretty routine work? All of this tel! 
of course, about other potential satis 
(all extrinsic) like pay, promotions, fri 
benefits, security of employment, workin 
conditions, and the social rewards on the jo 
(which is why work alienation and job satis 
faction are not the same thing). 
If alienated labor is crucial, this measut 
ought to correlate with other indexes of alien 
tion, for example, with the person’s genera 
ized sense of control over his own destiny- 
what Rotter (1966) calls internal versus € 
ternal control, and I have called *powerle: 
ness"; and it ought to correlate with thos 
extensive off-the-job consequences that dead 
ening work is supposed to lead to. You Cal 
5 Marx wrote: “What constitutes the alienation 
labour? First, that the work is external to 
worker, that it is not part of his nature; and thal 
consequently, he does not fulfill himself in his wot 
but denies himself, . . . It is not the satisfaction © 


need, but only a means for satisfying other ni 
(cf. Bottomore, 1963, pp. 124-125]." 


or isi 
othing 


easily genera ir own list of such conse- 
quences sin inciples usually invoked 
to predict t! e straightforward, and as 


far as I ca there are only three such 
“principles o ation": (a) the princi- 
ple of frustri gression, which holds that 
alienated w ilds frustrations that find 
release elsew ence, the prediction, e.g., 
that alienate should correlate with eth- 
nic hostility ve family relations, etc.) ; 
(b) the pri of social learning, which 
holds that gh alienated work, people 
learn the les dependence, lack of con- 
trol, passivi d low self-esteem that they 
apply elsev (hence, the prediction that 
alienated la ould correlate with political 
disengagem: ow political knowledge, ir- 
regular voti: ); and (c) the principle of 
substitution ^ holds that since alienated 
work fails i vide the requisite intrinsic 
satisfactions of -reativity and personal mean- 
ing, worker: iven to seek substitute, and 
generally sh: satisfaction elsewhere—for 
example, in striving and display, in 
frenetic sociability and leisure pursuits, etc. 
This is the ‘ovic, more or less, that Marcuse 
(1968) had in mind when he commented on 
our affluent | xploitative mode of produc- 
tion: “To the degree to which this sort of 


Work appears 2s superfluous, senseless, and 
unnecessary, while necessary for earning a liv- 
ing, frustration is built into the very produc- 
tivity of this society, and aggressiveness is 
activated [p. 262]." My puzzle, however, is 
that when such propositions about alienated 
Work are put in testable form, the supposed 
Consequences do not materialize. In the Swe- 
dish study (Seeman, 1967a), involving the 
Whole range of manual and white-collar oc- 
Cupations in the male work force in Malmö, I 
found that work alienation did not correlate 
with any of a set of measures chosen so as to 
represent the three principles of generaliza- 
tion: for example, work alienation did not 
Correlate with ethnic hostility, or with the 
Fespondent’s score on status striving, or with 
Voting behavior or level of political informa- 
tion as determined by an objective knowledge 
test (all this with appropriate controls ap- 
Pid -education, eg.; and in a context 
th other significant findings that suggested 

^t the failure of correlation was not 
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strictly methodological). It is conceivable, of 
course, that Sweden is a very special case 
where the effects of work alienation are tem- 
pered by the overall supportiveness of the 
surrounding social system—by the highly 
organized, relatively stable, fundamentally 
democratic, and economically advanced social 
order that modern Sweden represents. How- 
ever, entertaining severe doubts that Sweden’s 
socialism (such as it is) explained matters, I 
repeated the study both in the United States 
and France, with basically the same negative 
results. 

The American instance is doubly interesting 
since there were two separate studies, one of 
which sampled black workers only and focused 
on consequences (more precisely, correlates) 
in intergroup relations (for sampling details, 
see Ransford, 1968; and for the data on both 
blacks and whites, cf. Seeman, Bishop, & 
Grigsby, 1971). The consequences we had in 
mind were captured (again, with the aid of 
factor analysis) in three related, but discrim- 
inable, measures of the person’s attitude to- 
ward both Negroes and whites: (a) an index 
of black solidarity called “identification with 
Negroes” based on four items like ^I look at 
everything from a Negro point of view” or 
“The idea of complete integration bothers me 
because Negroes would have to give up some 
of the sense of belonging together; (b) “so- 
cial distance," a four-item Bogardus-type 
scale, ranging from marrying to eating with 
whites; and (c) “white hostility,” consisting 
of three items on the order of "Sometimes I 
hate white people." 

What is striking among these employed 
blacks is how little work alienation explains in 
comparison with another brand of alienation 
— namely, powerlessness. For both manual 
and white collar workers, out-group hostility 
and in-group solidarity were unrelated to 
work alienation, while the correlations with 
powerlessness were rather impressive. A spe- 
cific example: among manual blacks (AN = 
172), the correlation between identification 
with Negroes and work alienation was —.04, 
but for powerlessness, the correlation was .47 ; 
among white collar blacks (AN = 122), the 
same correlations were .17 and .37, respec- 
tively. These results repeat, in their own way, 
what the Swedish study found, for there, too, 
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powerlessness, but not work alienation, corre- 
lated significantly with ethnic prejudice; and 
the same tendencies, somewhat more weakly, 
are found in the French data and among 
American white workers. Nor is prejudice a 
unique variable; political knowledge works in 
the same way: low knowledge tends to go 
with high powerlessness, but is unrelated to 
work alienation. Thus, it is the powerlessness 
measure, not work alienation, that tells us 
who among the blacks will be hostile to whites, 
or who will be strongly identified with the 
black community as such; and who among the 
whites will reject Jews, or Estonians, or Ne- 
groes, or Algerians, and who will be politically 
uninformed. 

The sum of the matter is that alienation in 
work simply does not seem to make the dif- 
ference it is supposed to. Interpreting that 
fact is considerably less straightforward. Em- 
pirically inclined critics will note that the 
outcomes tested constitute an insufficient list 
—for example, I have gathered very little 
evidence about the quality of family life, 
which is where alienated work may take its 
toll; and (perhaps more importantly) the 
consequences of work alienation may reveal 
themselves in longer term, cumulative civil 
disorders—the wildcat strike or the student 
disorders from the Free Speech Movement 
through the events of May in Paris. Even 
here, however, the limited evidence suggests 
that it is powerlessness rather than the ali- 
enated labor of boring classrooms that pro- 
vides the steam of such protest. Recent (un- 
published) data from the University of Cali- 
fornia (Los Angeles and Berkeley) indicate 
that participation in political protest demon- 
strations is not related to complaints about 
the impersonality of the university or about 
the character of work in the classroom, but is 
related to the sense of low control. I am re- 
minded of the interpretation by Seale and 
McConville (1968) of the motives of thou- 
sands of French intellectuals who joined the 
protest of May: “it was a revolt against rul- 
ing bureaucracies, administrative machines, 
professional apparatuses. It found expression 
in an urge to run one’s own affairs . . . [p. 
226].5.* 

4 The findings in this recent (1969) student sur- 
vey by the University of California, Los Angeles, 
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The more philosophically inclined cri 
would, with Marcuse, turn this evidence g 
mine into the ultimate indictment, a: indig 
ment both of bourgeois social science 
ing given up, in its empiricism, any 
the critical truth about society, and : 
ment of a society that can manage .»e 
ploited worker so that he adjusts: a n 
(1968) put it, he lives “his sickness . 
without noticing that he is sick 
when he sees himself and is seen 
and normal [p. 250]." In these : 
failure to find consequences of wor! 
tion represents a subtler and deep: f 
sion, and the more formidable alien: n. 
is from his own false consciousness | òt 
worker ultimately must be nw Y 

This is not the place to debate these isst 
at length. I recognize both empirical limit 
tions and philosophic difficulties in the stan 
I have taken. But at least one thing ovght 
be made clear. An indictment of liena 
work does not depend upon the existence { 
negative consequences outside the work i 
I am certainly not convinced by th 
evidence that such consequences w 
found (the problem, of course, i 
whether there are such conseq 
what kind)—indeed, my own insti) 
assume that 8 hours a day at unr 
tasks cannot, as it were, pass unnoticed. 
consequences or not, if one defines self-# 
trangement as I do—as engagement in BW 
trinsically unrewarding activity—one is hard 
obligated to be uncritical about 8 conce 
trated hours a day of such estrangement si 
ply because many people learn to live with: 
That viewpoint may seem to be no difiere 
from Marcuse’s, but it puts a very differé 
ground under the matter—not least of 4 
because I find it unnecessary to make appe 
to the idea of “false needs" into which # 
masses are manipulated against their own tH 
nature. It seems to me a much more straigl 
forward matter of recognizing that pe 
hold differing attitudes about what is des 
able and what is possible in work, or € 


Survey Research Center are not unique, as i$ 
gested in the comment by Gales (1966): "Con 
erable dissatisfaction with many aspects of the W 
versity was found to exist, but was on the W 
unrelated to support for FSM [p. 14].” 
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where, and onc ot need the metaphysics 
of true needs ise consciousness to say 
9. 
F That metap imately is self-defeating 
- in two respect , as with Freud, there is 
the ambiguit) inpredictability that the 
position entai s one, for example, pre- 
dict aggressio t from the worker who is 
a "cheerful Mills, 1951, p. 233]?" 
Marcuse's an both “Yes” and “No,” * 
and although ied intention is to “offer 
some hypoth thing more [Marcuse, 
- 1964, p. xvii s not very clear how such 
double-edged eses can be tested. It 
should be un: l, however, that it is not 
s) much the ple of false consciousness 
- that is being ed here, but the careless 
and ideologic f it—in fact, I have used 
something ví ich like that principle to 
examine thre : of self-deception which, 
following Sart: 0), are taken as exam- 
- ples of “inau ity” (Seeman, 1966b). 
The second uty with the metaphysic 
- ef false needs it it encourages an under- 
- rating and ov »plifving of the character of 
men as they a casts the majority as one- 
dimensional : of a system which 


“claim(s) the 
only one dime: 


‘re individual. . . . There is 
9 n, and it is everywhere and 

in all forms [Marcuse, 1964, pp. 10, 11]." 
| And that brings me to my second dubious 
thesis—namely, the doubt I entertain about 
the unity or coherence of the syndrome of 
alienations from which contemporary men are 
said to suffer. On empirical grounds, I think 
that this image of a tight bundle of urban 
alienations is too pessimistic and too “totali- 
larian” by far. I refer to the image of a gen- 
tral and encompassing metropolitan malaise 
that combines the sense of alienation in work, 
powerless in politics, dislocation from com- 
E values, generalized distrust, and the like. 
i not doubt that all these sentiments some- 
E s occur together, but what impresses me 

ow discriminating, rather than “packaged,” 


5 
EL riga while arguing (as I have noted 
Vales at the frustration of alienated labor ac- 
am aggression, Marcuse (1968) added: “And to 
ae to which the society in its very structure 
Zens di aggressive, the mental structure of its citi- 
adjusts itself: the individual becomes at one and 


* same time more i i 
i e pliable and 
Submissive [p. eee and more pli 
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people can be in their responses. Some exam- 
ples are as follows: 

1. The correlation between the two brands 
of alienation already discussed—between work 
alienation and powerlessness—is regularly 
quite low. For manual workers in the various 
samples I have examined, the correlations 
are .02 in France, .05 in Sweden, .03 among 
whites in the United States, and .01 for black 
Americans. 

2. More interestingly, these two alienations 
correlate with other variables in opposite and 
discriminating ways. Thus, among the manual 
blacks, an index of occupational striving 
(composed of items asking, in effect, what 
values—friendship, family, health, neighbor- 
hood, etc.—the respondent would be willing 
to sacrifice for occupational advancement) 
correlated highly and positively (.53) with 
work alienation, but the correlation with pow- 
erlessness was negative (—.26): those who 
were engaged in highly routinized work were 
more ready to give up important values in the 
interest of improving their work situation, 
but those who felt generally powerless in 
controlling their own destiny were /ess willing 
to give up such values in the interest of mo- 
bility. It makes sense, but it is hardly a uni- 
tary package of alienations. 

3. I have already indicated that powerless- 
ness consistently correlates with intergroup 
attitudes, while work alienation does not. At 
the very least, if one is to make the argument 
that society somehow induces the worker not 
to “notice” his actual alienation, the argu- 
ment has to come to terms with the apparent 
fact that the worker seems to notice one kind 
of alienation more than another. 

4. Even without comparing alienations—if 
we keep an eye only, let us say, on powerless- 
ness—people typically reveal a rather dis- 
criminating alienation. It is not amy interest 
that is affected by the sense of low control 
(disinterest in international affairs is corre- 
lated with powerlessness, interest in discussing 
one’s work is not; cf. Seeman, 1966a). It is 
not any learning that is affected by powerless- 
ness—people who differ in their internal-ex- 
ternal scores show most sensitivity in learn- 
ing that involves control-relevant informa- 
tion: knowledge about politics is significantly 
correlated with sensed powerlessness; knowl- 
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edge about culture and the arts is not (See- 
man, 1967b). It is not any situation that will 
produce the negative effect of powerlessness 
on learning: among reformatory inmates who 
were offered information about prison pro- 
cedures respecting parole, powerlessness had 
no effect where the prisoners had long terms 
remaining to be served, but among the short 
termers (roughly 2 years or less remaining, 
where such information is situationally more 
relevant) low powerlessness was associated 
with significantly better learning (Seeman, 
1963). These are among the evidences that 
make me wonder about the totalized versions 
of alienation (usually thoroughgoing in their 
pessimism as well) that abound in the litera- 
ture. 

I wonder, too, about a third thesis that 
reflects the same pessimism, but concerns 
still other brands of alienation, namely, the 
thesis of the “lost community.” It is a thesis 
which reflects nostalgia over the stable life of 
the preindustrial world, where there was a 
local community and an extended family to 
provide comprehended and shared social 
bonds. There is probably a large element of 
romanticism in this recollection, but the un- 
recoverable history aside, how do matters 
stand today with regard to community? That 
depends, of course, on what you mean by 
"community," there being, it seems to me, 
three different types of social bond that the 
word regularly refers to. Community means 
either (a) a common fate, determined by ob- 
jective interdependence and interaction, often 
(but not necessarily) territorially delimited 
(e.g. the ghetto community or the university 
community); (5) a common identity, deter- 
mined by personal sentiments of belonging 
and attachment—what we usually mean when 
we speak of the “sense” of community (again, 
by no means necessarily tied to a locality, but 
also to family, neighborhood, or work col- 
leagues wherever located); or (c) common 
aims, determined by agreement in value— 
that is, a community more or less in the sense 
of shared norms and goals. 

It is the latter two that commentators 
chiefly worry about—on the one hand, the 
sense of loneliness and rootlessness in the 
urbanized society, and on the other hand, 
the danger that with the loss of traditional 
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values we “can no longer achieve the consen 
sus needed for collective life | Keller, 1968, 
55]." These concerns are about tho alien 


tions I have elsewhere (Seeman, 1^ 


“social isolation" and “cultural rang 

ment"—the former being definable in ‘erms oj 
expectancies for social acceptance exclu 
sion, and the latter in terms of si rein 
forcement values in a given circle ciety. 
My doubt is about the validity oí pessi- 
mism on these matters that is pical 
among mass society theorists. Ot! have 
long since raised similar doubts; wing 


the evidence on kinship, neighbor 
friendship ties, Greer (1962) conclu 


“few urban subareas approach the ano» ymity 
and social fragmentation of the stereo’: pe [p. 
97 | vi 

My own data make me think that s not 
the “lost community" (ie. the abs. ce 0 
sentiments of social solidarity) thai is 
urban problem, but the sense of “lost control 
(i.e powerlessness). I find very i 
dence that people are terribly concer sed about; 
their level of social engagement, even 


where there is detachment, there is very little 
evidence that it means for the urban dweller 
what we apprehensively suspect. For example, 
in a sample of 400 Los Angeles workers, only 
10% agreed with the statement “I often feel 
lonely”; in another sample in Los Angeles, all 
black, less than 20% disagreed with the state- 
ment “I am very popular with my friends.” 
When the questions concern the person’s 
occupational community, 50% of both the 
manual and nonmanual workers say that they 
have close friends in the same line of work; 
53% have friends at the same work place, 
and there is evidence that these friendships 
involve a good deal of off-the-job socializing. 
More significantly, the presence or absence of 
the sense of community as represented in 
these indexes on generalized loneliness or 
occupational collegiality seems to have very 
little to do with the negativities that we feat 
may result: for example, in the Los Angeles 
data, the score on occupational community 
does not even correlate with job satisfaction 
(—.09) or with the Srole anomia scale (which 
I interpret as a measure of generalized des- 
pair and distrust; the correlation was —.02); 
much less with ethnic prejudice or status 
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“striving. The same is true, apparently, at the 
university: for example, questions like “How 
lonely were you during the past month?” (and 
yarious related tions on loneliness) show 
no relation to student activism in the study 
at the Universit California, Los Angeles 
and Berkeley, ! igreement or disagreement 
with the pow item, “Becoming a 
- success is a ma of hard work; luck has 
- little or nothi: lo with it" discriminates 
the activists fi he nonactivists." In sum, I 
think that we ! tended to overestimate the 
extent and sig nce of personal isolation, 
and we have : estimated the satisfactions 
and positive fi ins that people can find in 
weak social t inovetter, 1969). 

Perhaps we ! © overestimated the ex- 
tent of our t with community when 
that is taken mean value consensus. I 
know that it i w fashionable to condemn 
the so-called ' nsus school" of sociolo- 
gists and hi is (eg. Parsons and 
Schlesinger) fo: underplaying the elements of 
conflict and di ition in American life, and 
Iam not eager to be a latecomer to that es- 
tablishment. 5: ! am intrigued to discover 
consensus of : d emerging out of a set of 
Studies in Los eles that show a fairly high 
degree of agreement in values and attitudes 
across the major lines of cleavage that are so 


fundamental in our thinking about urban life: 

*In various college studies, high powerlessness 
tends to go with activism (as it does in these data). 
But low powerlessness is associated with militance 
among blacks, At least two important distinctions 
are needed to help interpret such findings. First, and 
Most obviously, there is the meaning of militance 


that can sometimes refer to activism that is rela- 
tively planned, organizationally based, and instru- 
mental as compared with the more or less spon- 
ag and expressive militance that we have seen in 
civil disorders. One would expect that low 
Powerlessness would encourage the former and high 
Ee the latter (cf, Ransford, 1968, for con- 
m of the tie between high powerlessness and 
2 * proneness), The second distinction concerns 
inds of powerlessness. There is some evidence that 
onis, discriminate between what might be called 
enl versus. "personal" powerlessness—between a 
Am m (of bureaucracies, racial discrimina- 
Ee) that is unresponsive and hard to move, 
kil personal world (of friendships, talents and 
Mn S that is a quite different matter of per- 
n, Hcacy (cf. Gurin et al, 1969, for a related 
tne These distinctions have not been well 
» Or even well conceptualized, as yet. 
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across race; sex, occupation, etc. To a degree 
that I certainly would not have expected, the 
blacks and whites, the manual and nonmanual 
workers, fathers and mothers, suburban and 
central city residents show little difference in 
their views concerning a wide range of issues 
—for example, the values they would like to 
see their children accomplish in life and in 
school; the importance of status striving for 
themselves; sentiments of generalized trust in 
others; preferences in family decision-making 
processes; commitments concerning conform- 
ity, materialism, the primacy of self-interest 
over social welfare; and the like.’ 

These consistencies (especially those across 
race and social class) made us begin to have 
second thoughts about that popular phrase 
“internalizing middle-class values.” I do not 
intend to press the point concerning value 


+ 


similarity too hard—for one thing, because | 


there are numerous studies that report im- 
portant differences (e.g., social class differ- 
ences in the deferral of gratification, or the 
emphasis on security rather than creativity in 
work), But I do intend to caution by way of 
these data that the theme of value alienation 
can be easily exaggerated—partly because 
what looks like value difference is often 
enough a difference in expectancies or a dif- 
ference in one’s perceived circumstances, 
neither of which says very much about values. 
Thus, in our own data, we do indeed find dif- 
ferences—for example, black fathers and 
mothers are much more likely than whites to 
say that “reaching a higher social standing” 
is a very important accomplishment desired 
for their child; blacks tend to score more ex- 
ternal on the internal-external items; and 
one could proceed to numerous other differ- 


7In a summary of their research on black and 
white youth and their parents, TenHouten and Ten- 
Houten (1970) wrote: “Thus, black and white fami- 
lies were found to differ little in their basic con- 
jugal and parental role behaviors. . . . In general, the 
youth in this study present a rather consistent pic- 
ture across race and socioeconomic level. They ap- 
pear to have reasonable plans and aspirations, de- 
cided confidence in themselves and in education as a 
means of success. In general, they show a non- 
pathological set of attitudes toward themselves and 
their families. Their attitudes toward and reported 
behavior toward their present jobs show little vari- 
ation by race and family socioeconomic status [pp. 
2, 16]." (Cf. also TenHouten, 1970.) 
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ences of a similar kind—for example, expres- 
sions of lack of confidence in the equality of 
justice or in the responsiveness of political 
institutions. But it is hardly likely that such 
differences tell us very much about the ab- 
sence of a value community: they tell us more 
about the unequal distribution of status and 
power in the community, and about the prob- 
lem that people have of getting into a position 
to achieve what are the commonly recognized 
values or to influence the subtlety of their 
shading in American life. That is not to deny 
the existence of a youth counterculture, or to 
ignore the revulsion that some intellectuals 
express for conventional Americana, but there 
is no advantage either in confusing (as I 
think often happens) power claims and con- 
flicts, or unhappy assessments of one’s life 
chances, with alienation in basic values. 
Thus far, I have implicated four different 
varieties of alienation in these remarks—self- 
estrangement in work, feelings of powerless- 
ness, social isolation, and value estrangement, 
and I have exhausted neither the varieties of 
alienation nor the doubts that can be raised 
about their use in analyzing the social psy- 
chology of our time. I have not touched, for 
example, upon one of the central themes in 
mass society theory; namely, on the impor- 
tance of autonomous voluntary organizations 
that can mediate between the individual and 
the massive state or corporation, providing 
some means of exerting influence, and some 
sense of control in a large scale, bureaucrat- 
ized society. The dim view expressed fairly 
recently by C. Wright Mills (1959) holds that 
these intermediate associations (the labor 
unions, the professional bodies, etc.) have 
themselves now become agents of further ali- 
enation, We find, he asserted, “no intermediate 
association in which men feel secure and with 
which they feel powerful [p. 308]”; they have 
been replaced by “centralized organizations 
which become in turn new means of power 
against the individual [p. 308]." But that 
does seem an overly dim view when the evi- 
dence shows that in the three quite different 
countries I have examined—in France, Swe- 
den, and the United States, where labor un- 
ions play substantially different roles in widely 
varying cultural contexts, membership in such 
mediating organizations is regularly associated 
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with lower scores on the powerless: cale 
suggesting that the mediation effec! ven 
been lost altogether (cf. Neal & Sceman 
1964; Seeman, 1966a). i 
The thread, I suppose, that bi thes 
doubtful theses I have enumerated i> my re 
sistance to the “one-dimensional pe:s'mism® 
(Graubard, 1968) that typifies « ich ol 
the literature on alienation. It is a style 
builds in assumptions concerning a’ cnation 
that are open to serious question s one 
can see, assumptions about the rena 
of various alienated attitudes, about the gem 
eralization of these attitudes across ‘he indi 
vidual’s life space, about the defeativ effe 
of large-scale organization, about i iabili- 
ties of social isolation, about the scope a 
intensity of value conflict, about the centrali 
ity of work experience for the per self- 


image, about the profound frustration 4 
underlies the daily experience of the « “lin 
worker, and the like. I do not see, ‘ul 
ample, why one cannot recognize the d 
monotony and physical imposition of | 
manual labor, while suspecting at the 
time that there is a substantial con f 
subtle snobbery involved in the inte ctual’s 
dismal view of much manual work. 1 : 
think that many of us would carry this more 
optimistic perspective to the lengths attributed 
by John Fowles (1969) to his protagonist, the 
Victorian gentleman of The French Licuten- 
ant's Woman, who concluded that “the lower 
orders were secretly happier than the uppel 
They were not, as the radicals would have one 
believe, the suffering infrastructure groaning 
under the opulent follies of the rich; but 
much more like happy parasites [p. 292].” 
It would take an intolerable optimism and 
a blind social psychologist not to see the gravi 
ity of our problems, the public resistance to 
their resolution, and the massive inattention 
to them that a technicized social science al 
lowed. If some current models are too one 
dimensional and pessimistic by a good dea 
the fact remains that there were concentratio l 
camps, that there are disreputable qualities in 
American life (including a widely disrespected 
war, at the moment), and there will be furthe 
urban violence born of frustrated hopes ang 
an unrectified sense of powerlessness thal 
many people, white and black, rightly fee 
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i, present, and future create 
r the sociologist interested 
alienation, and for social 
ally. It is the tension be- 

tween keepii ofessional craft that is 

worthy of t! ne, and at the same time 

making inve s that illuminate on a 

scale comme; e with this known history 


That kind o 
a special tens 
in the prob! 
psychologists 


and likely fu! 

The tensi: es dangers on both sides: 
the danger i: erving the craft is that the 
work deterior into a kind of alienation in 


technical rules, limited in 
»ersonal involvement, and 
\ careers, The danger on the 
that this deteriorates, too 


itseli—bour 
vision, devoid ^í 
largely orie: 
side of rele 


—into a subile anti-intellectualism that is 
impatient v anything but the immediate, 
or into a kint self-indulgence that empha- 
sizes stance analysis so that what be- 
comes crucial is one’s identification as a radi- 
c realist humanist, or whatever. My 


secularization of work on 
;tinue to be achieved while 
'se dangers, which is to say 
lemonstration can be success- 
scope and human concern. 


hope is tha 
alienation ca 
avoiding bo! 
that clarity an 
fully wedded to 
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RESPONSIBILITY, REPARATION, AND SELF-PROTEC7 


AS REASONS FOR THREE TYPES OF HELPING ' 


NANCY N. THALHOFER * 


Department of Sociology, Eastern Michigan University 


A 2X4 X 2 factorial experiment, using 96 male and 96 female undergraduates 
assessed the effects of uniqueness of another's dependency, involvement in 
harm, and sex on three types of helping: help relevant to dependency, help 
relevant to harm, and help irrelevant to both dependency and harm. Types of 
help were found to be specific to appropriate conditions, rather than inter- 
changeable. Unique dependency produced more help relevant to dependency 
than nonunique dependency (p< 01), a result interpreted in terms of dif- 
fusion of responsibility. Help relevant to harm was greater with greater 
degree of the subject's involvement as the source of harm (P < 001), a result 
explained as reparation. Differences in involvement in harm produced differ- 
ences in irrelevant help (p < 05). The form of these differences was interpreted 
as self-protection. Sex interacted with uniqueness of dependency for help 
relevant to dependency (p < 05), with a greater difference between uniqueness 
of dependency conditions for males than for females, This was interpreted, with 
further evidence, as indicating that females subscribed more to a norm to help 


N 


those dependent, regardless of the condition of this dependency. 


Helping has been attributed to a social 
responsibility norm, to help those dependent 
or in need, and a number of studies have 
shown more help given for conditions of high 
than low dependency of another (Berkowitz 
& Daniels, 1963; Berkowitz, Klanderman, & 
Harris, 1964; Goranson & Berkowitz, 1966). 
The social responsibility norm was assumed 
to represent altruism rather than externally 
imposed demand, because help occurred with- 
out expectation of return benefit (Daniels & 
Berkowitz, 1963) and appeared indepen- 
dently of symbolic reward (Berkowitz et al., 
1964), although Jones (1970) indicated less 
help to one dependent if the helper had more 
choice whether to help or not. 

Other work has identified uniqueness of a 
potential helper’s situation, when he alone 
can give aid, as an important variable. Re- 
searchers have found that both the amount 
and speed of reporting an emergency were 
lower with larger group size (Darley & La- 
tané, 1968) and that subjects were less likely 


1 This study is based on a dissertation submitted 
to the University of Michigan. Appreciation is due to 
the dissertation chairman, Robert B. Zajonc, and to 
James M. Dabbs and John P. Kirscht for their en- 
couragement and suggestions. 

? Requests for reprints should be sent to the 
author, Department of Sociology, Eastern Michigan 
University, Ypsilanti, Michigan 48197. 


to give aid in or report an emerg: when 
with others than when alone (Lats: Dar- 
ley, 1968; Latané & Rodin, 1969). These find- 
ings were interpreted both in terms of the 
diffusion of responsibility possible wher many 
people, rather than one alone, are aware of an 
emergency and the greater inhibition of action 
in front of others, to avoid the embarrassment 


of overreacting to an ambiguous situation. In 
a field experiment, Piliavin, Rodin, and Pili- 
avin (1969) observed no differences in help- 
ing according to the number of people who 
witnessed an emergency in a subway, possibly 
because there were never only a few indi- 
viduals in the coach and because a high inci- 
dence of help occurred to the victim, who was 
both clearly visible in his plight and in close 
proximity to bystanders. Schwartz and 
Clausen (1970) found similar evidence that 
a greater number of potential helpers did not 
reduce helping, except when someone who 
seemed more competent or more likely to take 
initiative in helping was among that number. 
Thus, determinants of diffusion of responsibil- 
ity have been modified to include more than 
sheer number of others available to help. 

A number of studies have identified repa- 
ration, for which the victim of harm is repaid 
by the harm-doer, and guilt, when the harm- 
doer atones for a misdeed without compensat- 
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reasons for helping. Ber- 

r (1967) found that the vic- 
lting from one's own error 
ften for compensation than 
but that someone in need 
ften for compensation than 

| need, The concept of guilt 
nan, Wallington, and Bless 
for a subject's compliance 
help from the experimenter 
: had lied or caused incon- 
ther. Carlsmith and Gross 
vidence for helping derived 
restitution and guilt, when the 
first caused harm, but not from 
en need of another was the 
ever, Rawlings (1968) re- 
ence in help to a victim ex- 

c shock according to whether 
ed this distress or merely 

id Tilker (1970) presented 
'ne's responsibility for admin- 
nd degree of protest feedback 

` shock, indicating his need, as 


ariables incre:sing help to the victim. 
One unanswered question concerns the 


substitutability of one type of help for an- 
imple, different studies have 
veral different responses from one 
Oo caused harm to another: noncompensa- 
ory help to the victim (Carlsmith & Gross, 
969), irrelevant help to a nonvictim (Freed- 
man et al., 1967), and compensation to the 
victim which reverses harm (Berscheid & 
Walser, 1967). The degree of substitutabil- 
among types of helping appropriate to 
terent conditions was examined here by 
allowing the subject the opportunity for sev- 
_ “al different types of helping, as well as 
choosing whether to help or not. 

A second question concerned extending the 
ditions for which the number of potential 
elpers makes a difference in incidence of 
helping, Studies of this variable have used 
Quite similar basic experimental situations: an 
emergency involving illness or injury for 
ch subjects are free to assume responsibil- 
l. It seems plausible that the number of 
tential helpers would also make a difference 
more commonplace situation, when there 
NO emergency, but only another's depen- 
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dency as the target for helping. Further, if 
the number of helpers is important, it would 
make a difference in helping even when a 
specific request for help, which could increase 
overall obligation to aid, is made, rather than 
leaving subjects free to assume this responsi- 
bility. 


METHOD 
Overview oj Design and Procedure 


A2X4X2 design was used to assess the effects 
of three variables on three forms of help. Indepen- 
dent variables were uniqueness of dependency, in- 
volvement in harm, and sex of the subject. Depen- 
dent variables were help relevant to dependency, 
help relevant to harm, and help irrelevant to both 
dependency and harm. A boy, attending a school 
for mentally disturbed children, was both dependent 
and harmed according to experimental conditions. 
Subjects could either offer or not offer each of the 
different types of help when given a specific oppor- 
tunity to do so. 


Subjects 


Subjects were 96 male and 96 female introductory 
psychology students at the University of Michigan, 
who participated in experiments as part of the 
course requirement. 


Procedure 


The experiment was entirely self-administered to 
groups of subjects in booklet form, with experimental 
conditions given as written information. Subjects 
were randomly assigned to conditions, with 24 sub- 
jects in each of the eight experimental groups and 
sex fairly evenly divided within each group. 

Uniqueness of dependency. Half of the subjects 
received the following unique dependency instruc- 
tion: 


No one in the group of people you see around 
you, except you, will be reading this material and 
information about this particular boy. You should 
keep this in mind when you read the material and 
answer questions about it. You alone are reading 
about this particular boy. 


The other half of the subjects, assigned to the 
nonunique dependency condition, were informed in 
the same way that many others in the group besides 
themselves were to read material about the boy. 
All subjects read a report from the director of a 
school for disturbed children, explaining a problem 
concerning a young boy: that one of his teachers had 
complained about his behavior and had recom- 
mended that he be removed to a more punishment- 
oriented school. The staff of the school felt that this 
recommendation was a mistake and was due to the 
teacher’s inexperience and dislike for the boy. All 
subjects read an interview with the teacher which 
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presented a derogatory picture of the boy and were 
asked to rate the boy on a scale from 1 to 15 on 
the likelihood that he would benefit from his school 
experience. This rating was the cause of harm to 
the boy in one involvement in harm condition. 

Involvement in harm. For a condition in which 
the subject was the source of harm, the subjects 
read the following: 


Unfortunately, the teacher's recommendation 
and complaints about the boy have forced our 
board to act. The board has agreed to randomly 
select one rating by an impartial judge to deter- 
mine whether the boy stays here or is transferred 
to a school which is oriented more toward punish- 
ment and rehabilitation than therapy. The thera- 
pists and staff feel that transfer at this time would 
be most detrimental to the boy, because he has 
made great progress in his short time here, and 
transfer would probably cause him to revert to 
former behavior, After all, he is a mentally dis- 
turbed child. 

However, we are sure he will be rated favorably 
enough by an impartial judge, because this judge 
will be given all the information about the situa- 
tion: the boy's problem, the teacher's inexperience, 
and the staff's general reaction to it. Also, most 
people can spot inconsistencies in the teacher's 
description and are made suspicious by the ex- 
tremity of some of her statements. Therefore, we 
have agreed to let this rating decide. 

Your rating will be used for this purpose. Please 
write the number of your rating here. 


On another page, the information continued. 


The cutoff point our board has decided upon, 
taking into consideration the nature of the situa- 
tion, is a rating of 12. If your rating is 12 or over, 
the boy remains at this school, If your rating is 
below 12, because of this low rating, the boy will 
be dismissed from here and transferred to the other 
school. 

We could not tell you before that your rating 
would be used for this purpose, because we wanted 
a fair and unbiased judgment from you. 


A condition in which another, similar to the sub- 
ject, was the source of harm was introduced by the 
same first paragraph as the condition above. The 
subject then read: 


We thought the impartial judge chosen, who is a 
second-semester sophomore at the University of 
Michigan, would rate the boy favorably enough, 
because he was given information about the situ- 
ation: the boy's problem, the teacher's inexperi- 
ence, and the staff’s general reaction to it, Also, 
most people can spot inconsistencies in the teach- 
er's description and are made suspicious by the 
extremity of some of her statements. However, he 
has given the boy too low a rating. The cutoff 
point our board decided upon, taking into con- 
sideration the nature of the situation, was a rating 
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of 12, and this judge's rating was below ‘hat. Be- 
cause of this low rating, the boy will be -:smissed 
from here and transferred to the other xol. 


For a condition in which an extereo! jactor, 
bureaucratic machinery, was the source o m, the 
subject read the same first paragraph as cribed 
in the subject-harms condition, except t the 
board was presented as having alread; le the 
transfer decision by itself without any j s rat- 


ing. This information followed: 


However, there is really nothing we ca: 
the board's decision is made because o! 
our administration is set up. It woul 
least a year to get the bureaucratic m 

operation which could reconsider or : 

decision. Because of this decision, the ! 

dismissed from here and transferred t 

school. 


For a control condition, in which no 
mentioned, the subject read: 


aints 
serious 


The teachers recommendation and cor: 
about the boy have made our board gi 


consideration to the situation. To avoid any 
possible detrimental effects on the boy oard 
has decided to transfer him to a near! hool 
which is exactly like ours in terms of i enta- 
tion toward therapy. Two of the people ! who 
have worked closest with him will be tr:v-ferred 
along with the boy. The therapists an ff feel 
that this transfer at this time would cen» the 
boy's progress because of superior facilities and a 
more supportive atmosphere at the othe: school, 
as well as avoid any residual negative tional 
climate here. After all, he is a mentally disturbed 
child. Because it is a more favorable situstion for 


him in every respect, the boy will be trinsierred 
to the other school, 


Following this, a few minutes of interpolated activ- 
ity, a matching-letters-to-numbers task, was re- 
quired of all subjects, A short statement before in- 
troduction of the dependent variables reminded the 
subjects that the boy was either uniquely or non- 
uniquely dependent upon them. 

Help relevant to dependency. Subjects were given 
a chance to respond twice in terms of help relevant 
to dependency, First, if the subjects were willing to 
give either some time or money to help the boy, 
they were asked to fill out a form giving their name, 
address, and telephone number so they could be 
contacted about their offer to help. Second, subjects 
could give a few dollars which were due them for 
their participation in the study to either themselves 
or the boy, indicating the destination of this money 
by checking a box. 

Help relevant to harm. Help relevant to harm was 
the opportunity to rerate the boy on the same 1-15- 
point scale which was used previously in rating: 
Help was measured as the amount and direction 9 
change of this rating relative to the previous one 


- was free to as 


- troduced as anc 


REASONS FOR THREE Types or HELPING 


indicating help. The subject 
at his second rating might 
he transfer decision.’ 
urm and dependency. Assess- 
«nt to both harm and depen- 
: end of the experiment and 
»ething outside the experiment. 
the chance to respond twice in 
lp. First, they were asked to 
would be willing to participate 
in future exp without pay or academic 
credit, Second, jects were told they had no 
further obligat i could leave, but if they were 
willing to comp! an additional questionnaire, in- 
tudy, they were told that this 


with positive change 


have some effec 
Help irreleva: 
ment of help i 
dency was mad 
was presented 
Subjects were gi 
terms of irrelev: 
check a box if 


would be appre 


RESULTS 

Uniqueness o endency and Help Relevant 
to Dependenc 

The two measures of help relevant to de- 
pendency were offering time or money and 
offering money Jue oneself. To obtain interval 
data amenable to parametric test, an index 
was constructed by assigning numerical scores 
to the combination of both measures of help 
for each su! ^ score of 1 was given for 
no help on either measure, a score of 2 for 
help on one o! the two, and a score of 3 for 
help on both. The degree of relationship be- 


tween the two measures of help, assessed by 


the phi coefficient, was .45 (p< .01). 

Table 1 presents the means for the index 
of help relevant to dependency for each of 
the Uniqueness of Dependency X Involvement 
in Harm x Sex of Subject conditions. Analy- 
Sis of variance results are summarized in Ta- 
ble 2. There was an effect due to uniqueness 
of dependency , with more help offered for con- 
ditions of unique than nonunique dependency 
(F = 729, dí = 1/176, p < .01). There were 


"The implication that a second rating by the 
Subject would affect the harm was probably most 
E When the subject was the source of harm. 

he subject might presume a weaker connection 
m his second rating and the harm for the 

er experimental conditions of involvement in 
on e a mistake had been made, and any 
c Table information about the boy might be 
eat in a review of the harmful decision, Al- 
oig implied reversibility of harm by the second 
in 3 Was not varied independently of involvement 
hip i Situations of reparation are similar, in that 
An 9 reverse harm can often come only from one 

9 is the source of harm. 


TABLE 1 


MEAN SCORE FOR AN INDEX OF HELP RELEVANT TO DE- 
PENDENCY FOR UNIQUENESS OF DEPENDENCY X IN- 
VOLVEMENT IN HARM X SEX OF 
SUBJECT CONDITIONS 


iti Sub- | Other Red- | Ni 
Condition pest | harms stave | harm | Total 
Unique dependency 
Female 2.17 | 2.25 | 2.23 | 238 | 9.03 
Male 2.33 | 2.33 |-2.00 | 1.91 | 8.57 
Both sexes 4.50 | 4.58 | 4.23 | 4.29 | 17.60 
Nonunique dependency 
Female 2.27 | 1.89 | 2.07 | 1.75 | 7.98 
Male 1.85 | 1.60 | 1.80 | 1.83 7.08 
Both sexes 4.12 | 3.49 | 3.87 | 3.58 | 15.06 
Both conditions 
Female 4.44 | 4.14 | 4.30 | 4.13 | 17.01 
Male 4.18 | 3.93 | 3.80 | 3.74 | 15.65 
Both sexes 8.62 | 8.07 | 8.10 | 7.87 | 32.66 
TABLE 2 


ANALYSIS OF VARIANCE OF AMOUNT OF HELP MEA- 
SURED BY AN INDEX or HELP RELEVANT. 
TO DEPENDENCY 


Source df MS F 

Uniqueness of depend- 

ency (A) 1 | 474 7.209€ 
Involvement in harm (B) 3 ia RE 
Sex of subject (C) 1]| 1.30 2.00 
AXB 3:1 «39 <4 
AXC 1 | 4.15 6.19* 
BXC 3| .08 | <1 
AXBXC 3117 567 1.03 

Error 176 | .65 

* p <.05, 

hp c.01 


no differences attributable to the involvement 
in harm variation. 


Involvement in Harm and Help Relevant 
to Harm 


The measure of help relevant to harm was 
a change score between the first and second 
ratings of the boy. Table 3 presents the mean 
change score for each of the Uniqueness of 
Dependency X Involvement in Harm X Sex 
of Subject conditions. Analysis of variance 
results are summarized in Table 4.4 Although 
an effect due to involvement-in-harm condi- 
tions was observed (F — 9.33, df — 3/169, p 
< .001), the size of the mean of the no-harm 


* Because change scores could not be obtained for 
seven subjects who failed to make the second rat- 
ing, these subjects were not included in the analysis. 
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TABLE 3 


MEAN Amount oF HELP RELEVANT TO Harm MEA- 
SURED BY CHANGE SCORES FOR UNIQUENESS OF 
DEPENDENCY X INVOLVEMENT IN Harm X SEX 
oF SuBJECT CONDITIONS 


Sub- | Red- 
iti : Other No 
Condition ject .| tape Total 
harms | Parms | harms | harm. 
Unique dependency 
Female 2.92 | 1,91 1.23 | 4.92 | 10.98 
Male 3.08 | 2.08 —.64 | 2.40 6.92 
Both sexes 6.00 | 3.99 +59 | 7.32 | 17.90 
Nonunique dependency 
Female 4.27 | 2.43 | —.14 | 3.00 9.56 
Male 3.09 | 1.60 1.40 | 2.67 8.76 
Both sexes 7.36 | 4.03 1.26 | 5.67 | 18.32 
Both conditions 
Female 7.19 | 4.34 1.00 | 7.02 | 20.54 
Male 6.17 | 3.68 416 | 5.07 | 15.68 
Both sexes 13.36 | 8.02 1.85 |12.99 | 36.22 


condition warrants explanation. The mean for 
this condition was nearly as high as the mean 
in the subject-harms condition, although a 
condition of no harm should yield the least 
help if help relevant to harm, as seems plausi- 
ble, diminishes with less involvement of the 
subject in harm. However, even with the high 
incidence of help in the no-harm condition, a 
Schefíé post hoc comparison showed more 
help when the subject was the source of harm 
than for all other involvement in hard condi- 
tions combined (p < .05). Because the no- 
harm condition was a control condition and 
presented the boy's present situation and his 
transfer as beneficial to him, whereas all ex- 
perimental conditions of involvement in harm 
described the boy's circumstances as unfortu- 
nate, the measure of help relevant to harm, 
which was a rating of how beneficial the boy's 
school experience would be, might have re- 


TABLE 4 


ANALYSIS OF VARIANCE OF AMOUNT OF HELP MEA- 
SURED BY CHANGE ScorEs FOR HELP RELEVANT 
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TABLE 5 
MEAN SCORE FOR AN INDEX OF IRRELEVA TELP FOR 
UNIQUENESS OF DEPENDENCY  INVOLY CENT IN 
Harm X Sex or Supyect Conditions 
7 ] 
Sub- Red- 
E : Other | 
Condition "ct f .| tape |, Total 
er harms harms | hr 
m | 
Unique dependency | 
Female 2.00 | 2 | 8,54 
Male 2.00 | 1.83 7.92 
Both sexes 4.00 91 | 16.46 
Nonunique dependency 
Female 2.36 | 1,89 | 2.14 8.14 
Male 2.08 | 1.60 | 2.30 | 815 
Both sexes 4.44 | 3.49 | 4.44 16.29 
Both conditions | 
Female 4.36 | 3.97 | 4 16.68 
Male 4.08 | 343 4.39 | 16.07 
Both sexes 8.44 7.40 8. | 32.75 
flected the greater favorableness oí boy's 
situation in the no-harm condition. With the 
no-harm condition weighted at zero, «14 thus 
not contributing to the trend described by 
involvement in harm conditions, :cheffé 
post hoc comparison showed that the config- 


uration of differences indicated less help was 
given as the subjects involvement as the 
source of harm was less (5 < .01). 


Involvement in Harm and Irrelevant Help 
The two measures of irrelevant help, of- 
fering to participate in future experiments and 
remaining to participate in a nonrequired ex- 
periment, were combined into an index score 
in the same manner as the two measures of 
help relevant to dependency. The phi co- 
efficient, assessing the relationship between 


the two measures, was .23 (5 « .01). Means 
for the index of irrelevant help for each condi- 


TABLE 6 
ANALYSIS OF VARIANCE OF AMOUNT or HELP MEA- 


TO HARM SURED BY AN INDEX OF IRRELEVANT HELP 
Source df | MS F Source | a | Ms | F 

Uniqueness of depend- Uniqueness of depend- | | 

ency (A) — 1 Al} <1 ency (A) 1 00 1 
Involvement in harm (B) 3 |81.54 9.33* Involvement in harm (B) 3 | 1.11 2,85* 
Sex of subject (C) 1|1653| 1.89 Sex of subject (C) i 1 2k | «1 
AXB 3 | 469| «1 AXB 3 | 30] 100 
AXC 1| 743| <1 AXC | 1| 24] <1 
BC 3 | 360| «1 BXC 3| 36| «1 
AXBXC 3 |1556| 178 AXBxC 3 47 1.20 

Error 169 | 8.74) Error 176 | 39 

| | 


| 
| 
| 
| 


*p <.001. 
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tion appear in Table 5. Table 6 presents a 
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TABLE 7 


MEAN SCORE FOR AN INDEX OF HELP RELEVANT TO 
DEPENDENCY OFFERED By MALES AND FEMALES FOR 
UNIQUENESS OF DEPENDENCY CONDITIONS 


summary of analysis of variance results for 
this index. The analysis showed an effect for 
involvement in | conditions (F = 2.85, 
dj = 3/176, p 5). The Scheffé method of 
post hoc comparison showed the following 
configuration differences by involvement- 
in-harm condi! , in descending order of 
magnitude of ^-bing: red-tape harms, sub- 
ject harms, no harm, and other harms (p < 
05). 
Sex of Subject and Helping 

Although therc were no main effects due to 
sex for the major dependent variables, there 
was an intera between sex of the subject 
and uniqueness of dependency (F = 6.19, df 
=1/176, p < ; see Table 2). The means 
related to this ‘teraction are presented in 
Table 7. The action indicated the follow- 
ing: Althoug i‘ sexes offered more help 


for unique than nonunique dependency condi- 
tions, females save more help than males for 
the latter conditi There were sex differ- 
ences for some questions concerning the sub- 
jects’ reactions to the experiment and their 
subscription | Iping norms. Females liked 
the boy they read about more than males 
(F = 13.54, dj = 1/171, p < .001); and fe- 
males more than males thought both that one 
should help another he has harmed (F = 
12.98, dj = 1/172, p< .001) and that one 
Should help another in need (F — 13.72, df 
= 1/170, p < .001). 


n. 


Relationships among Dependent Variables 


The measure of help relevant to harm, a 
change score, was uncorrelated with the index 
of help relevant to dependency (r = .025, n 
= 183, ns) and showed a significant negative 
Correlation with the index of irrelevant help 
(r= ~.168, n = 183, p < .05). The index of 
help relevant to dependency was highly cor- 
related with the index of irrelevant help (r = 
446, n= 189, p < .001). 


Discussion 


Type of Helping as Specific to Condition 


gd relationship among the different 
rms of helping would suggest that type of 


Condition Female | Male E. 

Unique dependency 2.26 244 4.40 
Nonunique dependency 2.00 1.77 3.77 
Total 4.26 3.91 8.17 


help was specific to conditions. Although help 
relevant to harm had no positive relationship 
with the other types of help, there was a high 
degree of relationship between irrelevant help 
and help relevant to dependency. Possible ex- 
planations for this were that both measures 
were derived from requests for volunteered 
aid, and that both kinds of help may be 
related to feelings of responsibility, since they 
both presumably filled some need. In spite of 
this relationship, the observed differences in 
helping were appropriate to conditions. Since 
seemingly inappropriate help has been ob- 
served in other research, this suggests that an 
inappropriate form of help occurs if it is the 
only form of help possible. 


Diffusion of Responsibility and Uniqueness 
of Dependency 

One explanation for the effect of uniqueness 
of dependency on help relevant to dependency 
was that responsibility to help is diffused 
when others as well as oneself are in a position 
to help. Responsibility can be conceived as 
full responsibility when only one person’s 
help is required to fill the need of another, 
and many helpers could cause confusion or 
harm. This is the situation of many emergen- 
cies, and this kind of situation was created in 
most of the research varying the number of 
others able to help. For the present study, 
many helpers would be appropriate rather 
than redundant, because the boy could pre- 
sumably use as much money and time as 
many subjects could offer. Therefore, the 
results suggest that diffusion of responsibility 
can be applied to a situation of nonemergency, 
in which the obligation is less than that of 
full responsibility: responsibility to do some- 
thing to aid another, even a token gesture, 
without canceling the other’s need. In addi- 
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tion, the difference obtained here between 
uniqueness of dependency conditions when a 
specific request for help occurred showed the 
apparent importance of the uniqueness of de- 
pendency variable over specificity of request 
for help, which at high levels could increase 
obligation to help, in determining helping. 


Reparation and Involvement in Harm 


Reparation is a possible explanation for 
the differences in helping which were observed 
according to involvement in harm conditions, 
since nontrivial harm occurred, and help was 
allowed which could, by implication, however 
nonsalient, reverse the harm in the three ex- 
perimental conditions. Although harm was 
assumed to be necessary for reparation, it was 
apparently not sufficient to produce a high 
level of helping without the other two factors 
of the subject’s direct involvement in harm 
and the more directly implied reversibility of 
harm in this situation. One difficulty in the 
interpretation of reparation seems to be the 
possible confusion between help by reparation 
and help from responsibility. Harm alone 
probably creates some dependency, and harm 
with help possible to reverse it is like a situ- 
ation of responsibility in its implied obliga- 
tion. Therefore, one’s own involvement as the 
source of harm is a variable which can dis- 
tinguish help by reparation from help from 
responsibility. 


Self-Protection and Involvement in Harm 


The form of the differences in irrelevant 
help for involvement in harm conditions sug- 
gested an interpretation in terms of self- 
protection, Irrelevant help might counteract 
feelings of worthlessness by bolstering the self- 
image (Carlsmith & Gross, 1969) or serve as 
self-punishment in expiation of guilt (Freed- 
man et al., 1967), Not helping could mean 
avoidance of retaliation by avoiding proximity 
to a victim (Freedman et al., 1967), avoid- 
ance of further obligation to help (Jones, 
1970), or avoidance of implication of guilt. 
Greatest irrelevant help occurred when harm 
came by bureaucratic fiat. This condition 
posed little threat for the subject’s self-image, 
but it might have suggested help as atone- 
ment for a mistake imposed by society, and 
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the subject had no reason to avoid implica- 
tions of guilt. Least irrelevant help was 
found when one similar to the subject was the 
source of harm. Since another actu caused 
the harm, there was little reason for either 
bolstering the self-image or self-punishment, 
and the possible guilt by associa‘ might 
have created avoidance of volunte: . Some 
irrelevant help occurred when the ect was 
the source of harm, and although there was 
reason for guilt, remaining in the e mental 
situation meant that the subject was a possi- 
ble target for blame or further ob! m. 


Sex Differences in Helping 


Although females subscribed more than 
males to both reparation and respcssibility 
norms about helping others, the only : of 
help that bore some relationship to «ex was 
help relevant to dependency: uniqve.ess of 
dependency conditions made a greate“ differ- 
ence for males than for females. If ‘emales 
subscribed more to a norm to he!» one in 


need, the two uniqueness of dependency vari- 
ations should make little differents in help 
for females, because dependency occured in 
each. Other research has suggested that males, 


even if they do not actually offer more aid, 
are expected to help more in emergencies ac 
cording to conceptions of the male role (Pili- 
avin et al., 1969; Schwartz & Clausen, 1970). 
Here, males appeared to respond to the differ- 
ences in expectation implied by the uniqueness 
of dependency variations, while females apr 
parently felt obligation to help, regardless of 
condition. 
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EFFECTS OF COMMITMENT AND CERTAINTY UPON 
EXPOSURE TO SUPPORTIVE AND NONSUPPORTIVE 
INFORMATION 


CHARLES F. BEHLING* 
Lake Forest College 


Mills and Ross’s selective exposure hypotheses are examined in a 2 X 2 X2 
factorial experiment manipulating choice certainty, subjective commitment, and 
informational supportiveness. The findings for 120 high school subjects repli- 
cated the results for individual mean differences which Mills had previously 
cited as supporting choice-certainty predictions. These comparisons were con- 
founded between levels of certainty and supportiveness, however, and cannot 
be unambiguously interpreted. A more precise test of choice-certainty hypoth- 
eses is offered by examination of interaction effects across the entire factorial 
design represented in the present experiment. Analyses of these predicted inter- 
actions failed to support Mills and Ross’s position and in several cases pro- 
duced results which were in an opposite direction to predictions, 


Mills and Ross (1964) have developed a 
theory predicting that variations in subjects’ 
attitude certainty and commitment will lead 
to selective exposure to information. Mills 
(1968) has contended that this theory may 
be utilized to explain many of the seemingly 
discrepant findings summarized in Freedman 
and Sears’s (1965) critical review of the 
selective exposure literature. It was the pur- 
pose of the present paper to examine the 
alleged explanatory power of the Mills and 
Ross position and to present new research 
evidence relating to it. 

Mills and Ross’s theory proposes that sub- 
jects will actively seek out information which 
is expected to increase their certainty about 
an issue and will actively avoid information 
which is expected to decrease certainty. Sev- 
eral hypotheses were formulated to specify 
the predicted effect of subjective commitment 
on exposure to information. 


1. When people are highly committed to a 
position, the less certain they are that it is 
the best one, the more they will tend to 
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of the final version of this report is deeply 
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expose themselves to information sfpporting 
the position, and the less they wil! tend to 
expose themselves to nonsupportiv: infor- 
mation. 

2. When people are not comm:! 
position, the more certain they ar: 
the best one, the more they will ten: 
themselves to information support: 
sition, and the less they will ten 
themselves to nonsupportive infor: 


] to 8 
it it is 
) expose 
the po- 
expose | 
m. 


ils and 
or no 


Three recent studies derived from ^i 
Ross’s theory provide either wee. 
support for the information-seeking »y pothe- 
ses. Mills (1965a) found no significant differ- 
ences for the effects of commitment and cer- 
tainty variations on subjects! ratings of 
interest in supportive and nonsupportive ad- 
vertisements. Mills and Ross (1964) found 
some support for their hypotheses concerning 
supportive information, but this finding ws 
apparent only on the basis of correlational 
data, and not on the basis of their experimen- 
tally manipulated groups. Mills (1965b) T€ 
ported support for the high-commitment, sup- 
portive-information prediction only after the 
data were adjusted on the basis of di fierential 
initial preferences for messages. No signifi 
cant differences were found in this study ™ 
regard to the high-commitment, nonsuppo™ 
tive-information prediction. 

Two other choice-certainty experiments 1% 
ported results which seem somewhat mort 
substantial statistically, but the theoret!@ 
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à 


interpretation results is highly ambig- 
uous. Mills (1 
a list of produ 


equired subjects to rank 
' the order of their desir- 
| the subjects that they 
o choose to receive one 
zift. “Certain” subjects 
uld choose between “A” 
ch was ranked 10 prefer- 
ertain” subjects were told 


ability and the 
would be per: 
of the produc! 
were told that 
and the produ 
ences above it 


that they coul se between “A” and the 
product which ranked immediately below 
it. Prior to co: ng themselves to a choice, 


however, the biects were asked to read 
and evaluate : dvertisement for “A.” It 


was found tha! «certain subjects spent sig- 
nificantly m xe reading the ad than 
certain subjec id these results were inter- 
preted as refi attempts by uncertain 
subjects to increase their choice certainty 
and by certai bjects to avoid decreasing 
their certainty. 

It may be seen that the variables involved 
in this study were confounded across treat- 
ment conditions. Certain subjects represented 


a high-certainty, low-commitment, nonsup- 
portive-information treatment, while uncertain 
subjects represenied a low-certainty, low- 
commitment, ortive-information condi- 
tion. Under such circumstances it is extremely 
difficult to isolate the experimental manipula- 
tions which account for the observed results; 
4 variety of explanations may be offered for 
the findings. The confounded comparisons are 
hot systematically interpretable in terms of 
Mills and Ross's theoretical position. 

Using a procedure similar to the former 
study, Mills and Jellison (1968) tested sub- 
Jects in two conditions: “certain relevant” 
and “certain irrelevant." The certain-relevant 
condition was identical to the certain condi- 
tion of the previous study. In the certain- 
"relevant condition, however, the subjects 
chose between the product ranked immedi- 
ately below “A” and the product ranked nine 
Preferences above “A.” As in the other group, 
these Subjects were asked to read an adver- 
tisement for “A” It was found that certain- 
Televant subjects spent less time reading the 
Message than did those in the certain-irrele- 
vant condition, and this result was interpreted 
l 4S reflecting the “relevant” subjects’ efforts to 
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avoid having their certainty reduced by 
exposure to nonsupportive information. 

It may be noted that the “irrelevant” infor- 
mation may in fact have been relevant to the 
subjects’ choice. Information about the merits 
of a product ranked one preference higher 
than a choice product may well provide some 
information about the relative merits of the 
lower ranked product. In this case, the ir- 
relevant information may more precisely be 
considered as information which is at least 
partly nonsupportive. Moreover, it can be 
suggested that this information may be some- 
what less nonsupportive than the information 
which specifically endorsed the lower ranked 
product (ie. the information which was 
offered to the relevant subjects). 

This experiment does little to explicate the 
Mills and Ross theoretical position. Neither 
certainty nor commitment was varied be- 
tween subjects. The findings suggest simply 
that the subjects preferred “less nonsuppor- 
tive" to “more nonsupportive" information, 
and this prediction is in no way unique to 
the Mills and Ross theory. 

On the basis of this series of studies, it 
may be seen that research evidence does not 
provide convincing support for the Mills and 
Ross position. The experimental findings 
generally offer either very weak statistical 
support for the theory, or they are mot 
interpretable in terms of the theoretical form- 
ulations. It seems, therefore, that the claims 
of wide explanatory power for the theory 
(Mills, 1968) have not yet been supported by 
experimental evidence. The present paper de- 
scribes a more direct test of the above theo- 
retical hypotheses in a 2 X 2X2 factorial 
experiment, in which dual levels of certainty, 
commitment, and informational supportiveness 
were manipulated. The method of this experi- 
ment was similar to that of the two previous 
studies (Mills, 1965c; Mills & Jellison, 1968) 
which obtained strong statistical results, but 
the present experiment made nonconfounded 
comparisons, as well as the comparisons in- 
vestigated by Mills (1965c). In this way, re- 
sults of nonconfounded tests of choice-cer- 
tainty theory may be compared with tests 
which have previously been cited as support- 
ing the theory. 
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METHOD 
Subjects 


Subjects were 120 students from a boys’ parochial 
high school in Nashville, Tennessee. Fifteen subjects 
were randomly assigned to each of the eight experi- 
mental treatments. 


General Procedure 


The experiment was introduced to the subjects as 
a “market research survey” which measured “con- 
sumer reactions to several new products.” Each sub- 
ject was given a message of approximately 250 words 
which explained the alleged purpose of the survey. 
In order to determine the subject’s initial preferred 
reading speed, the amount of time the subject spent 
reading this message was unobtrusively recorded. 

As the first “survey task,” the subject was given a 
sheet describing five products and was asked to rank 
the products “according to how much you would 
like to have them at the present time.” The products 
to be ranked were an alarm clock, an automatic 
camera, a desk lamp, an electric fan, and a transistor 
radio. The products were identified by brand name, 
functional description, and suggested retail price. 

After the ranking of desirability was completed, 
the subject was told: 


Now we have an idea about how much you would 
like these products after you've first heard about 
them. We're also interested in people's reactions to 
the products aíter they've had a chance to really 
become familiar with them. It would be nice if I 
could give you all of the products to become 
familiar with them. But obviously this is impossi- 
ble. However, I can offer you one of the products, 
and if you're willing to take the products for a 
period of six months and at the end of that time 
answer some questions about the product, then 
the product will be yours to keep. What we do is 
to give you a questionnaire to complete and we 
include a self addressed stamped envelope for you 
to send to us. Would you be willing to take one of 
the products? [Mills & Jellison, 1968, p. 60] 


After the subject agreed, he was instructed: 


We can’t give you an unlimited choice of all the 
products but we try to give people as much choice 
as possible and in this case I can give you a 
choice between two products [Mills & Jellison, 
1968, p. 60]. 


Independent Variables 


At this point, differences between the various ex- 
perimental conditions were introduced. Certain sub- 
jects were informed that they could choose between 
the products which they ranked first and last in 
preference. On the basis of the discrepancies in rank- 
ings and the nature of the product descriptions, it 
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was expected that subjects offered thi ice 
feel relatively confident that they w " 
product was best. Uncertain subject: e told tha 
they could choose between the produ rked fing 
and second in preference. It was expe hat thes 
subjects would be less confident in | g whid 
of the choice products was more d 
“Committed” subjects were require ake tha 
decision between the products imr tely 
receiving these instructions. It was st to thes 


subjects that they could not chang deci: 
once they had made it. Each commit! 
required to write and sign his choice « 
of his ranking sheet. The amount o! 
subjects spent in making their choice 

“Uncommitted” subjects, on the ot! and, 
informed that they would not be asked to make 
decision between the products until the end of 
experiment. It was stressed that: 


You have plenty of time to make i-cision, # 
don't make up your mind yet. Yo: 
additional information later which r 
in making your choice. It is to y 
not to commit yourself to one of : 


until you have to. 


Dependent Variables 


Each subject was then required to r 
questionnaire items. First, he was as: 
on a 20-point scale how certain he v 
which of the choice products was b: 
was asked to indicate on another 20 
much effort he was willing to exert to í 
information which favored his preferi 
Before receiving this questionnaire, i: committi 
subjects were told: 


I'm going to give you a short questionnaire whid 
asks some questions about your choice products 
Your responses to these questions i» no way Com 
mit you to choose one or the other of the prod: 
ucts. You will not be required to make a fm 
choice until the end of the experiment. 


Committed subjects received only the first sente 
of these instructions. 

In order to avoid possible experimenter effet 
the next experimental task was self-administer' 
the subjects. Each subject was given an envelope #! 
was asked to follow the directions which were € 
closed, These directions stated: “It is of interest 
know your reactions to written messages about 
products. Please read the enclosed message 
evaluate it on the accompanying form.” ; 

Subjects were given a message of two typewrill? 
pages (approximately 500 words) to read. “Supp? 
ive” subjects received a report supporting the hig? 
est ranked of their choice products; “nonsuppotl ; 
subjects received a report supporting the low 
ranked choice. In each case, however, message 


the product names were 
time that the subject spent 
unobtrusively recorded. 
iained two 20-point ques- 
ibjects’ responses. The first 
w much he enjoyed reading 
item asked how useful the 
z the product. 
itted subjects were required 
between the products, and 
ıt the subjects spent in making 
ed. 


was asked to state his expla- 
surpose, and he was partially 
the nature of the experiment. 
rly cautioned that “it is often 
fulfill participants’ choices for 
ubject was pledged to secrecy 
of the experiment. 
ber of subjects had been inter- 
ter met with the subjects in a 
| the purpose of the experiment 
s involved in it. Each subject 
d of $1 for his participation in 
imum amount of time which 
rviewing a subject and debriefing 


triple-order i 


y; commitment 


teraction between levels of cer- 

and information supportiveness 
the basis of Mills and Ross’s 
That is, "committed, 
w were predicted to spend more 
=A reading supportive information and less time 
ng nonsupportive information than “committed, 
P subjects, Also, “uncommitted, certain” sub- 
were predicted to spend more time reading sup- 
ive information and less time reading nonsup- 
e information than “uncommitted, uncertain” 
jects. Predictions for the subjects’ rankings of 
fading enjoyment were analogous to the reading 
ime hypotheses. 


RESULTS 
cess of Manipulations 


Several measures relevant to the success of 
€ certainty manipulation were included in 
experiment. The first of these measures, 
amount of time required by the subjects 
Choose between their gift products, indi- 
es that certainty was successfully manipu- 

- Decision time for high-commitment 
Jects was recorded immediately after the 
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certainty treatment, and it may be interpreted. 
as a direct referent of subjective certainty. In 
this condition, low-certainty subjects required 
significantly more time to reach a decision 
than did high-certainty subjects (M — 19.17 
and 11.07 seconds; F = 9.67, df = 1/58, $ 
< 01). For low-commitment subjects, de- 
cision time measurement was confounded 
across certainty groups by intervening ex- 
posure to supportive or nonsupportive infor- 
mation and other experimental treatments. 
Nevertheless, the small mean differences in 
decision time for these subjects were in the 
predicted direction (low-certainty M — 20.53, 
high-certainty M = 19.67; F = .03, df = 1/ 
58, ns). The other experimental referent of 
subjective certainty, the subjects' rating of 
their certainty, failed to distinguish between 
high- and low-certainty treatments. The low- 
certainty subjects! mean rating was 14.68, and 
the high-certainty mean was 14.08. This re- 
flects a very small difference in the direction 
opposite to predictions (F — .60, df — 1/118, 
ns). In view of the strong decision time effect 
for high-commitment subjects, it may be con- 
tended that the experimental manipulations 
successfully varied the subjects’ levels of cer- 
tainty. This conclusion is weakened, however, 
by the failure of the other certainty measures 
to produce significant results. 

Subjects’ ratings of informational useful- 
ness indicate that utility was appropriately 
controlled in the experiment. Supportive in- 
formation was rated as more useful than non- 
supportive information (M = 18.05 and 
1645; F=7.95, dj = 1/112, p<.01), but 
no other differences in utility ratings were 
found between treatments. It is important to 
note that the observed supportiveness effect 
may be offered as an explanation for differ- 
ences between individual experimental groups, 
but the presence or absence of Mills and 
Ross's predicted interactions cannot be ex- 
plained by such a main effect. 

The subjects’ rankings of their relative 
preferences between the five experimental 
consumer products corresponded closely to 
their quoted retail prices. The most preferred 
products were the radio, camera, and lamp, 
and the relative order of these preferences 
was the same as the order of the small differ- 
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TABLE 1 
ADJUSTED MEAN READING TIME 


Commitment-supportiveness Low-cer- High-cer- 
condition tainty Ss | tainty Ss 
Supportive information 
w commitment 145.53 | 15186 
High commitment 125.45 | 143.45 
Nonsupportive information 
Low commitment 132.99 133.84 
High commitment 128.31 


128.33 


Note.—In seconds. 


ences in their quoted retail prices. Also, as ex- 
pected on the basis of prices, the fan and 
clock were the least preferred items. The clock 
was generally more preferred than the fan, 
although its quoted retail price was slightly 
higher. 


Subjects’ Exposure Preferences 


As measures of the subjects’ preferences for 
supportive and nonsupportive information, the 
time which the subjects spent reading these 
types of information was recorded, and, sec- 
ond, the subjects’ ratings of their enjoyment 
of the information were obtained. The results 
of these measures are reported both for the 
individual (confounded) groups which Mills 
(1965c) cited as lending support to his theory 
and for the entire set of treatment groups in- 
vestigated in the present experiment. 

It will be recalled that Mills (1965c) found 
that subjects in the low-certainty, low-com- 
mitment, supportive-information condition 
spent significantly more time reading informa- 
tion than did high-certainty, low-commitment, 


TABLE 2 


ANALYSIS OF COVARIANCE OF 
READING TIME 


Source df MS F 
Certainty (A) 1 1,167.99 1.16 
Commitment (B) 1 2,751.19 2.75* 
Supportiveness (C) 1 3,438.91 3.43* 
AXB 1 221.44 .22 
AXC 1 1,034.20 1.03 
BXC 1 621.06 62 
AXBXC 1 292.85 .29 

Error 111 1,002.11 
*p <.10 
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nonsupportive-information subjects. This re- 
sult was replicated by the present ex; 
The mean reading time for the first g 
154.80 seconds; for the latter grou 
129.87. This difference is significant at the 
.05 level (F = 4.55, df = 1/28). 


The results of the enjoyment mese for 
these groups are significant in the same lirec- 
tion. The mean rating for the low-cer sinty, 


low-commitment, supportive group was 16.37; 
for the other group, it was 13.57. Th 
ence is also significant at the .05 level (F = 
5.84, dj = 1/28). 


Despite the results for these confounded 
groups, the next sections of this stud; dem- 
onstrate that support for Mills and Ross's 


hypotheses is not found when the dat 
the entire set of experimental groups are ex- 
amined. 

Reading time. Analysis of covarianc> was 
employed as a method of examining 1 ading 
time data for the entire set of exper’ ;ental 
groups. The covariate was the subjec!.^ jni- 
tial preferred reading rates. Tests of the slopes 
of covariance regression lines rejected the hi 
pothesis that the slopes could be considered 
equal to zero (F = 93.60, df = 1/104, p< 
01). The correlation between „the covariate 
and criterion scores was .48. It is therefore 
indicated that the covariance method o'‘ers à 
more precise analysis of differential treatment 
effects than an analysis disregarding original 
rates. (It may be pointed out that an analy- 
sis disregarding original rates was computed 
for these data, and the results did not yield 
the pattern of relationships predicted by Mills 
and Ross.) 

Reading time means (adjusted on the basis 
of the covariate initial reading rate) are pre 
sented in Table 1. i 

It may be observed that only the difference 
between certainty treatments in the low-com- 
mitment, supportive-information condition ap- 
proximates the relationships predicted by 
Mills and Ross. As the data in Table 2 indi- 
cate, no differences between treatments 
reached the .05 level of significance. 

Enjoyment ratings. Subjective enjoyment 
ratings, the auxiliary measure of informational 
preferences, yielded results which were quite 
weak, but which were generally in the direc 


CoviTMENT, CERTAINTY, AND EXPOSURE TO INFORMATION 


LE.3 


or ENJOYMENT 


Low-cer- | High-cer- 
ainty Ss tainty Ss 

informatio: | 
mitment 16.37 | 14.70 
iitment 15.27 14.63 
16.27 13.57 
14.57 15.13 


and Ross. As Table 3 
neans, except those of 
upportive-information 
cted direction of pref- 
mean differences are small, 
the effects reached an 
tatistical significance 


ddition to the above 
rent included a measure 
nly to preferences for 
Subjects’ ratings of 
xey were willing to exert 
;emselves to supportive 
fable 5), however, did not 
: in the direction pre- 
Ross. Analysis of the 
unpredicted significant 
ihe commitment variation 
6); low-commitment subjects indi- 
more willingness to exert effort to find 
tive information than did high-com- 
nent subjects. 
blematic data. Fourteen of the 120 sub- 
made "incorrect" choices between their 
oducts; that is, they chose to receive 


TABLE 4 


ANALYSIS OF VARIANCE OF 
Enjoymenr RATINGS 


df | MS F 

1 36.85 3.27* 

1 3.17 28 

1 3.85 34 
[^n 34.67 3.08" 
| 1 .05 01 
| jx 2.00 18 
es bad eel 83 
| 112-1. 3027 
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TABLE 5 


Mean Ratines or WILLINGNESS TO Exert EFFORT 
TO ACHIEVE EXPOSURE TO SUP- 
PORTIVE INFORMATION 


Commitment 


condition High-certainty Ss 


Low-certainty Ss 


14.67 
13.52 


15.62 
13.42 


Low 
High 


the lower ranked of the two products. One 
additional subject indicated apprehension 
about the purpose of the experiment. Because 
of the possibility that these problematic re- 
sponses may have systematically influenced 
the experimental findings, statistical analyses 
of the dependent variables were computed 
with both the inclusion and the omission of 
these subjects, The general pattern of results 
reported above was not altered when the 
problematic data were omitted; Mills and 
Ross’s predicted relationships were still not 
found for any of the dependent variables. 


DISCUSSION 


Tt was contended earlier that the previous 
experiments which pertain to Mills and Ross's 
position provide minimal support for the pre- 
dictions of the theory. Interpreted in the con- 
text of these experiments, the results of the 
present study suggest that the theoretical po- 
sition may possess very weak predictive 
power. The findings of one of the few studies 
(Mills, 1965c) which obtained strong statis- 
tical results relevant to the theory were repli- 
cated in the present experiment. When atten- 
tion is directed to the entire set of treatments 
in the present study, however, Mills and 
Ross’s predictions fail to hold. These results 
strongly suggest that the confounded differ- 


TABLE 6 


ANALYSIS OF VARIANCE OF 
ErroRT RATINGS 


Source df MS F 
Certainty (A) 1 5.42 27 
Commitment (B) 1 84.17 4.23* 
AXB 1 8.27 Ev 

6 19.90 


Error 11 


* p <05. 
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ences cited earlier as support for the theory 
(Mills, 1965c, 1968) may, in fact, not repre- 
sent differences due to the pattern of interac- 
tions predicted by Mills and Ross. 

On the basis of the present findings, a sim- 
ple utility explanation may be offered for the 
confounded differences; it is not necessary to 
rely on concepts involving choice certainty. It 
was found that subjects rated the utility of 
supportive information significantly higher 
than that of nonsupportive information. It 
may be noted that since the content of the 
supportive and nonsupportive information was 
identical, the utility ratings may reflect a type 
of generalized selective evaluation of sup- 
portive information. This finding could be em- 
ployed to explain mean differences between 
the low-certainty, low-commitment, support- 
ive-information group and the high-certainty, 
low-commitment, nonsupportive-information 
group. This main effect for supportiveness can- 
not be utilized, however, to account for the 
absence of Mills and Ross’s predicted triple- 
order interaction across the total set of groups. 

Reading time measures produced two re- 
sults of marginal significance, but choice- 
certainty concepts are not necessary to ex- 
plain either of them. Supportive information 
tended to be preferred to nonsupportive (p < 
-10), but the utility main effect cited earlier 
may account for this result. Low-commitment 
subjects tended to read information of either 
supportiveness type longer than high-commit- 
ment subjects (p < .10). This finding does 
not reflect exposure selectivity; instead, it may 
suggest a willingness by low-commitment sub- 
jects to expose themselves to any information 
which may be relevant to their impending 
decisions. 

Subjective enjoyment ratings produced re- 
sults of marginal significance (p < .10) for 
the main effect of certainty and for the Cer- 
tainty X Commitment interaction. The cer- 
tainty main effect reflects a tendency for low- 
certainty subjects to rate enjoyment of the 
information higher than high-certainty sub- 
jects. The Certainty x Commitment interac- 
tion indicates a tendency for low-certainty, 
low-commitment subjects to rate enjoyment 
higher than the other groups. These results 
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reflect the subjects’ enjoyment of 
portive and nonsupportive infor 


oth sup- 
on, and 


therefore they do not represent th: attitudi- | 
nally defensive selectivity predic: y Mills | 
and Ross. Instead, the results st an en- 
joyment by low-certainty subj of any 
type of information which is rel to their 
decisions. 

Mills (1965b, 1965c, 1968) pr d a re- 
formulation of the Mills and Ro ory by | 
suggesting that subjects’ levels o! certainty 
may not influence the degree t: h they 
avoid nonsupportive information his re- 
formulation, choice-certainty pre ns are 
expected to hold in the case of su ive in- 
formation, but not in the case of ipport- 
ive information. The results of resent 
experiment do not support even t^i* «°formu- 
lation, however. In addition to resu 
cited earlier, the experimental me whid 
was relevant only to supportive in’ "ation, 
subjects’ effort ratings, failed to | mean 
differences in the direction predicted by Mills 


and Ross. The effort ratings of low-commit- 
ment subjects were significantly higher than 
that for high-commitment subjects (# < .05). 
Again, it is not necessary to postulate choice- 


certainty concepts to explain this frcing. In- 
stead, it may be explained simply on ‘he basis 
of an apparent willingness of low-commitment 
subjects to expose themselves to any informa- 
tion which may be relevant to their impending 


decisions. 

A problematic element of the manipulation 
of commitment across certainty conditions is 
the possibility that high-certainty subjects 
may implicitly commit themselves to an al- 
ternative prior to the time that the experi- 
menter calls for decisions. A premature cont 
mitment explanation cannot appropriately be 
offered for the results of the present exper 
ment, however. Premature commitment Cam 
not apply to high-commitment subjects, a? 
the results of these subjects’ responses ge 
erally fail to support the choice-certainty pat 
tern of predictions. 

Although it is very difficult for experimental 
findings to disconfirm a theory, the results 0 
the present experiment may be considered 4 
serious challenge to Mills and Ross's position: 


s 


he entire set of groups tested in this 

t is considered, the results of no 
support the choice-certainty hypothe- 
the mean differences between the 


that predicted by Mills and Ross. It 
puld seem, therefore, that recent claims of 
le explanatory power for the theory (Mills, 
| may be unfounded. Instead, there pres- 
seems to be sufficient justification for 
gestion that the theory possesses very 
edictive power indeed. 
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COMMUNICATION SCHEDULE AND COOPERATIV! 
GAME BEHAVIOR ' 


NORMAN H., VOISSEM axp FRANK SISTRUNK * 
University of South Florida 


The effect on cooperative behavior of different schedules of communicatio 
opportunity was examined in a Prisoner's Dilemma game. Communication 
opportunity was provided by means of standard messages exchanged volun 
tarily after every trial, after every tenth trial, or only prior to the game. Th: 
major finding was the Communication Schedule X Trial Blocks interaction 
Ceoperation was low for all groups during the early trials. However, wher 
players were given an opportunity to make frequent communications, th: 
cooperation level increased significantly during later trials. There were no 
significant. differences by sex of the subjects, except for an interaction of sex 
and communication schedule with the C|CD state-conditioned propensity. 


The experimental game has proven to be a 
useful research device for producing meaning- 
ful social interaction under controlled labora- 
tory conditions. An important determinant of 
most social interaction is communication, and 
communication would be expected to have a 
significant effect on the behavior of players 
in the gaming situation. However, communi- 
cation variables have been manipulated in 
only a few gaming research studies. Although 
there are exceptions, the general finding has 
been that the cooperation level increases when 
some form of communication is allowed in a 
setting of neutral motivational orientation. 

At the midpoint of a 50-trial Prisoner’s 
Dilemma (PD) game, Scodel, Minas, Ratoosh, 
and Lipetz (1959) had subjects cease playing 
and allowed them to freely discuss the game 
with their partner. During the second half of 
the game, mutual defections decreased for this 
group, as compared with a no-discussion con- 
trol group. Bixenstine, Levitt, and Wilson 
(1966) modified the PD game for use with 
six-person groups. Casual communication was 
introduced into the study by providing a 15- 
minute recess at the midpoint of a 40-trial 
game. A significant increase in the level of 
cooperation occurred over the last 20 trials in 
a communication group receiving feedback on 
the other subjects’ choices, while no increase 


1 This paper was adapted from a master’s thesis 
submitted by the first author to the University of 
South Florida. 

2 Requests for reprints should be sent to Frank 
Sistrunk, State University System of Florida, 107 
West Gaines Street, Tallahassee, Florida 32304. 


was observed in a group where t dback 


was not provided. Gahagan ar deschi 
(1968) studied communication cre y, the 
degree to which a communicato: havior 
matched his intention as stated s com- 
munication., Cooperation level was her in 
trials that immediately followed communica- 
tion for the subjects who receiv highly 


credible messages than for the sub -cts who 
received the lower credibility messages. Evans 
(1964) investigated communication of prom- 
ises having different levels of enforceability 
and found that cooperation level decreased 
with decreases of enforceability. In 2 study by 
Swensson (1967), messages were sent prior 
to each trial in a 30-trial game. Cooperation 
level increased as the opportunity í 
munication increased from none, to 
to two-way message exchanges. Radlow and 
Weidner (1966) conducted a study using | 
extensive note exchanges prior to the start of 

the game. The cooperation level achieved 

under these conditions was approximately 

90%, significantly greater than their control | 
group, and a level which far exceeded the level | 
attained in most other studies having a 

individualistic motivational setting. Radlow - 
and Weidner stated that their motivational 
orientation was individualistic, but it ap- 
pears that it was biased toward a cooperative 
set, and this set seems to have a general tend- - 
ency to increase cooperation level. Deutsch | 
(1960) found that the cooperation betwee? | 
subjects playing a 1-trial PD game wa 

greater when they were allowed to exchange 
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g their selections than 
ranges. Deutsch (1958) 
operation increase com- 
eration level of a no- 
was greater for subjects 
lized note having more 
containing less informa- 
used the same set of 
;nd concluded that per- 
ncreased with informa- 

h had predicted, and he 
mutual trust is a requi- 
iperative choices. Another 
the finding of Deutsch 
vational sets affect the 
by interacting with the 
able. A cooperative set 
el of cooperation, and a 
luced a low level of co- 
ios with or without com- 
e. When an individualistic 
se^ was used, a significantly 
operation was observed in 
yas allowed communication 
in the no-communication 
the generality of Deutsch’s 
limited since he employed 
ame. Krauss and Deutsch 
) employed untutored and tutored 
commurication as levels of an inde- 
variable in the trucking game. The 
nce oí communication was stressed 
out discussing the communication content 
group. The tutored group was 
ed to focus its communication on 
tiating a fair solution. Bilateral threat 
ditions were also used; that is, both sub- 
controlled barriers which could be used 
st the other player. Subjects tutored to 
nmunicate fair proposals gained higher pay- 
‘than the untutored subjects. In an earlier 
(Deutsch & Krauss, 1960), the sub- 
of one group were given an opportunity 
Communicate by intercom whenever they 
ned. These subjects seldom used the com- 
nication opportunity, and their perform- 
ce Was not significantly different from that 
à no-communication group. Nor was the 
rmance of a group which was forced to 
mmunicate in a bilateral threat condition 


. Howeve 
g is severe) 


Io “ars u- 4.73. u^ 
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communication group. Krauss and Deutsch - 
concluded that communication opportunity — 
and forced communication, in themselves, - 
were not sufficient conditions to improve per- - 
formance; tutoring in fair bargaining was also 
required. 

Thus, a number of studies have demon- 
strated that various aspects of communica- 
tion affect behavior in the mixed-motive 
gaming situation. Credibility, promise en- 
forceability, one-way versus two-way, forced 
versus optional, and tutored versus untutored 
communication variables have been studied. 
Some variables have been shown to interact 
with communication variables. Increases of 
communication generally result in increases 
in cooperation level, but there are excep- 
tions. For example, Deutsch (1958, 1960) 
found that communication was not an 
effective influence under cooperative or com- 
petitive motivational sets. Different studies 
have operationalized the communication vari- 
able in different ways and used different types 
of games, and this may have influenced the 
results. A few studies have introduced free 
oral communication at some point during the 
game, but most studies have used sets of 
written message exchanges. The content and 
structure of these messages have differed 
among the studies. In the interest of limiting 
investigations to manageable complexity and 
size, most studies have held constant pos- 
sibly interacting variables, such as instruc- 
tional set. Other possibly interacting variables 
such as sex remain to be investigated. Indeed, 
there has been little integrative or systematic 
research on the effects of communication, and 
the precise way that specific aspects of com- 
munication affect cooperation in conjunction 
with other critical variables has yet to be 
demonstrated. 

The present study systematically investi- 
gated the effects on cooperative behavior of 
four schedules of communication opportu- 
nity: no communication, pregame communica- 
tion, communication every 10 trials, and com- 
munication every trial. An individualistic 
motivational set was used throughout the 
study, and the effects of communication were 
observed across trials and by sex of the dyads. 
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METHOD 
Subjects 


The 48 male and 48 female volunteer subjects 
were undergraduate students attending the University 
of South Florida. Students who were unacquainted 
or only slightly acquainted with each other were 
paired to form same-sex dyads and were randomly 
assigned to the experimental conditions. 


Procedure 


The players were seated in separate booths and 
operated individual consoles which contained re- 
sponse switches, a cumulative counter, and the game 
matrix. The payoff matrix values in imaginary dollars 
were: R=3, T=5, P=1, S=O (see Rapoport & 
Chammah, 1965). The Steele-Tedeschi (1967) index 
of conflict intensity for this matrix is .398, which 
would be considered a medium value. Thus, a stan- 
dard matrix of moderate conflict intensity was used 
to allow for more accurate comparison with other 
studies and greater generality of results. The out- 
come cell on the subject's console illuminated after 
each trial. Tape-recorded instructions avoided the use 
of competitive terms and provided an individualistic 
set. Previous studies have shown that feedback of 
only one’s own score, as compared with feedback 
of one’s own score as well as his partner’s score, 
influences the choices made (McClintock & McNeel, 
1966; McClintock & Nuttin, 1969). Therefore, 
counters displayed only a player’s own cumulative 
score, since this feedback is consistent with the indi- 
vidualistic set. After 3 standard practice trials, ques- 
tions were answered, and then the subjects were 
tested for their comprehension of the matrix. Indi- 
vidual instructions were given to those who made 
errors to ensure that they understood the matrix. 
Communication was not mentioned, nor was any 
communication allowed between the subjects of a 
control group. A second group of subjects could 
communicate only before the game began; a third 
group of subjects could communicate before the game 
started, as well as after every tenth trial; a fourth 
group of subjects were allowed to communicate 
before each trial. Subjects communicated by com- 
pleting one of five statements of intention on pre- 
pared message forms. The message set was: “I intend 
to choose . I expect you to choose " 
If you choose , I will choose . Whatever 
you choose now, lll choose on the next trial. I do 
not wish to disclose my intentions." Subjects could 
send an unlimited number of messages during each 
communication period. They were not informed 
of the number of trials required, but the game con- 
tinued for 101 trials. Data of the last trial were used 
only for propensity calculations. 


RESULTS 


The factorial mixed-effects analysis of vari- 
ance design used in this study employed com- 
munication opportunity and sex as between- 
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dyads variables and trial blocks as « repeat 
measure. The primary dependen! meas 
used were individual and mutual »eratig 
levels. Fisher's least significant d nce tes 
was used to evaluate differences | en pair 
of means of factors having significa! F ratios 
(p < .05). 
Individual Cooperative Responses 

The primary analysis compare: r leve 
of communication opportunity an » level: 
of sex, evaluated over first-half a ist-hall 
trial blocks. The dependent measure for this 
analysis was the proportion of individual co 
operative responses made by dyads. ihe com 


munication opportunity main effe as sig 
nificant as predicted (F = 2.98, dj = 3/40, 


« .05). Although there was a cons t trend 
of increasing cooperation with incre: sing op 
portunity for communication, the y sta 
tistically significant differences among 

main effect means were between com- 
munication on every trial and all other leve 


of communication. A trend analysis vev 
that these means arranged in th 
increasing communication opportuniiy cong 
tained a significant linear component (PS 
.01), indicating that the proportion of coop- 
erative responses was a monotonic function ol 
communication opportunity. The main effe 
of sex and all the interactions with the sé 
variable were not found to be significant 
trial blocks main effect was not s 

but the Communication Opportunity 
Blocks interaction was significant (F = 4 
df = 3/40, p < .01). This interaction showe 
that there was a significant increase in coop 


over that observed in the first half of 
game for the communication-every-trial group 
only (see Figure 1). A significant decrease W 
cooperation level occurred in the last half 0 
the game compared with the first half for tà 
pregame-communication group. The no-com 
munication and communication-every-10-trià 
groups did not exhibit any significant differ 
ences across trial blocks. Mean cooperatio 
levels are presented in Table 1. 

An ancillary analysis of the individual coop 
eration level was made using trial blocks € 
10 trials to determine if there were any ™ 


C NICATION ScHEDULE AND CooPERATIVE GAME BEHAVIOR 
TABLE 1 
MEAN TION OF INDIVIDUAL AND MUTUAL COOPERATIVE CHOICES As A FUNCTION 
or I )cks (50 Trrats CH) AND LEVEL OF COMMUNICATION OPPORTUNITY 
Individual cooperation Mutual cooperation 
CIA rom roi az — -— - 
schedule Trials Trials PA Trials Trials X 

0-50 51-100 (0-100) 0-50 51-100 | — (0-100) 

— | 
None 260 213 .236 072 4107 |  .089 
Pregame 344 243 293 180 423 51 
Every tenth tria! 306 316 311 A57 190 A73 
Every trial 101 | 574 A87 255 472 363 
328 s .332 166 223 .194 


: trial blocks. The trial 
»ugh significant in this 


teresting tren: 
blocks main efi 


analysis (F = 9/360, p < .01) be- 
fore applicat the Greenhouse-Geisser 
correction (M 166), was not found to be 
significant aft: correction. However, the 
data were ana for heterogeneity of co- 
variance and fovad to be homogeneous, indi- 
cating that t! rection was conservative 
and not actu iceded in this case. The 
cooperation lev: creased with the first few 


trial blocks 
trial blocks i 
plained drop ox 
A trend analys 
contained signifi: 
ponents (p < i 

Individual level of cooperation data for 
first trials only were also analyzed for com- 
Parison with the research literature which 
used one-trial games, but there were no sta- 
listically significant differences. 


(hen increased with later 
a U curve. An unex- 
;'red in the last trial block. 
of this curve revealed that it 
int quadratic and cubic com- 


rm 


| Mutual Cooperative Responses 


The total number of cooperative choices is 
à measure of cooperation attempts, whereas 
the number of mutual cooperative choices is 
à measure of reciprocated and therefore suc- 
cessful cooperative responses. An analysis 
Was made using mutual cooperative choices as 
et variable over trial blocks of 50 
E and resulted in a statistically significant 
dj nmunication opportunity effect (F = 3.56, 
tion- 3/40, b < 05). Only the communica- 
fete oe tial group was statistically dif- 
Tent from any of the other communication 
Eros and it had a higher level of coopera- 

" than the other groups. The sex variable 


was not statistically significant, nor were any 
of the interactions with this variable found to 
be significant. The trial blocks main effect 
was statistically significant (F — 5.31, df — 
1/40, p < .05), with the proportion of mu- 
tually cooperative responses increasing as a 
function of trial blocks. The Communication 
Opportunity X Trial Blocks interaction was 
also statistically significant (F — 5.36, df — 
3/40, p < .01), showing a significant increase 
in mutual cooperation level in the last half as 
compared with the first half of the trials for 
the communication-every-trial group only (see 
Table 1). 

An analysis of the first-trial data was also 
made, but it failed to show any significant 
differences. 


Earning Effectiveness 


Even though the data seemed to show that 
cooperation level increased as a function of 
communication opportunity, a question re- 
mained concerning the effect of communication 
on the level of earnings (game points). The 
actual earnings by dyad were analyzed using 
four levels of communication and two levels of 
sex as between-dyads variables over two trial 
blocks of 50 trials. No significant F ratios 
were obtained. 


State Stability 


It was expected that with increasing trials, 
the subjects! behaviors would tend to become 
less variable. State stability was defined as 
the proportion of trials in which both sub- 
jects made the same choice on a trial that they 
had made on the previous trial. There was a 
significant trial blocks effect (F — 69.04, df — 
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TABLE 2 
STATE-CONDITIONED PROPENSITIES BY Dyap 
cicc | CICD | CIDC | CIDD 
Communi- | Trust- | 
cation Sex | hi- | Forgive- | Repent- | Trust 
level wortete qoi gelb ance 3 
| ness | | 
| as | 
| x y z w 
None |Male | .517 .250 .289 ATI 
| Female| .375 +178 256 .202 
X 446 219 273 187 
Pregame Male .285 214 243 „220 
Female| .472 312 232 1139 
X 378 .263 237 .180 
Every tenth | Male -688 438 .256 166 
trial Female 519 .236 240 4454 
X .603 .337 -248 -160 
Every trial | Male -801 .195 .503 220 
Female| .792 .355 -403 367 
X .797 3225 453 444 


1/40, p < .01), with increasing state stability 
over later trials. 


State-Conditioned Propensities 


The state-conditioned propensities (see 
Rapoport & Chammah, 1965) are the condi- 
tional probabilities that a subject will choose 
cooperatively on a trial, given particular dy- 
adic outcomes on the preceding trial. The 
four propensities were empirically determined 
for the different communication levels and by 
sex and are presented in Table 2 and Figure 


C-100 


PROPORTION OF 
i 


COOPERATIVE RESPONSES 
ES 


L3 


FIRST HALF 
(150) 


LAST HALF 
151-100) 


TRIAL BLOCKS 


Fic. 1. Communication Opportunity X Trial Blocks 
interaction for individual cooperative responses. 
(C-100 = communication every trial, C-10 = com- 
munication every 10 trials, C-1 = pregame communi- 
cation, C-0 = no communication.) 
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ere ana- 
he inde 
1nication 


2. The data for each propensit 
lyzed in factorial analyses, us 
pendent variables of sex and co! 
opportunity. 


The probability that a playe: 1 choose 


cooperatively following a mutually perative 
trial (trustworthiness) was ificantly 
higher for the communication very-trial 
group than for the no-communicstion and 
pregame-communication groups (/ 5.19, df 
= 3/40, p<.01), with no sig int dif- 
ferences among the other co nication 
groups. 

The probability that a player v choose 
cooperatively following a trial in which he 
had chosen cooperatively and the oi «er player 
had defected (forgiveness) resulte in a sig- 


nificant Sex X Communication Opportunity 
interaction (F = 2.89, df = 3/40, ^ < 05). 
Males in the communication-evecy 10-trials 
group were significantly more _ giving" 
than the subjects in all other grows, except 
the pregame-communication females. 

The probability that a player wold choose 
cooperatively following a trial in wh «a he had 


defected and the other player had -nosen co- 
operatively (repentance) was sis uficantly 
higher in the communication-evers al group 
than in all other groups (F = 3.21, df = 3/40; 


p< 05). 

The probability that a player would choose 
cooperatively following a trial in w ich there 
was mutual defection (trust) was not signif- 
cantly influenced by communication oppor 
tunity or sex. 


Discussion 


The major hypothesis that cooperation 
level would increase with an increase in com 
munication opportunity was consistently SUp- 
ported by the data on the several dependent 
variables, Moreover, with the exception of the 
earning-effectiveness analysis, all analyses €? 
ploying trial blocks as a factor had either a 
significant trial blocks effect, a Communica- 
tion Opportunity X Trial Blocks interaction 
or both. This interaction was perhaps the 
most meaningful overall finding. A representa 
tive display of this interaction is presented or 
individual cooperative responses in Figure J 
Cooperation was low for all groups during © 


| 


when players were given 
an opportunity ke frequent communica- 
tions, the coopt level increased signifi- 
cantly during | ils. 

Earlier gam tuations involving com- 
munications (! ne et al, 1966; Rad- 
low & Weidne Scodel et al., 1959) had 
led us to expe ooperation level in the 
communication 10-trials group (as well 
Las the every- roup) would be signifi- 

cantly higher the cooperation level of 

the no-comm: | control group. How- 
ever, this exp | was not supported by 
the data. The ive, forced pregame mes- 
sage exchange | of the Radlow and Weid- 
ner study may increased the probability 
of agreement: : the players, and reach- 
ing these agi ts could have been the 
factor which ; ced the higher cooperation 
level, rather th ? existence of the message 
exchange per other two studies which 
had a single sunication period allowed 
free oral comi ition, and this could have 
been more effei n producing higher coop- 
eration than « tunities for standard mes- 
sage exchange 

Not only w: 


ly trials. Ho 


ow frequencies of communi- 
cation opportunity not effective in increasing 
the cooperation level in this study, but they 
also resulted in lower cooperation in the sec- 
ond hali of the zame, as compared with the 
first half, The communication opportunity 
every 10 trials was sufficient to prevent the 
Cooperation level from dropping in the last 
half of the game, which the groups having 
lower frequencies of communication oppor- 
tunity had a tendency to do, but the only 
group which noted a significant increase in 
Cooperation level above the level observed in 
the first half of the game was the higher fre- 
quency communication opportunity group. 
The cooperation level of all groups de- 
creased during the first trial block of 10 trials. 
This decrease in cooperation level for early 
trials is consistent with the findings of Swens- 
Son (1967), who allowed subjects to exchange 
Messages before each trial in a 30-trial game. 
His message set was more restrictive than the 
Message set used in the present study, and he 
observed a decrease in cooperation level over 
fach of three trial blocks of 10 trials. The 
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decreasing cooperation level in the present 
study leveled off at about 40 trials, forming a 
U-shaped function. 

Radlow (1965) observed a U-shaped coop- 
eration-level curve in a study investigating 
the effect of variation in the payoff matrix. 
He hypothesized that the initial drop in coop- 
eration level resulted from attempts to gain 
an unfair share of the payoffs on the early 
trials. He argued that cooperation level later 
increased because the subjects became dis- 
satisfied with the low payoffs they received as 
a result of mutual defection, and they at- 
tempted to arrive at the mutually cooperative 
payoff in later trials. This hypothesis seems 
consistent with the findings of the present 
study, with communication opportunity acting 
as an effective means of coordinating the tran- 
sition to mutual cooperation in the communi- 
cation-every-trial group. The communication- 
every-10-trials group also had this means of 
coordination available, but to a lesser degree, 
and it appears that this lower level was not 
sufficient to produce a significant behavioral 
difference from the behavior of the no- 
communication and pregame-communication 
groups. However, the cooperation-level in- 
crease over the later trial blocks tended to be 
larger in this group than in the no-communica- 
tion groups. As might be expected, the pre- 
game-communication group behaved more like 
the no-communication group, and the 
communication-every-10-trials group behaved 
more like the communication-every-trial group 
as the number of trials increased. The coop- 
eration level for the communication-every-10- 
trials group, however, remained substantially 
less than that of the communication-every- 
trial group, except for the second trial block. 

Apparently, the tendency to lock in on the 
mutually uncooperative outcome as trials in- 
creased was difficult to change unless there 
was an effective means of coordination. Credi- 
ble communication offers a powerful tool for 
simultaneously reorienting both subjects of a 
dyad toward mutually cooperative behavior, 
should one of them succumb to the tempta- 
tion of the uncooperative response. Also, the 
F ratios for the mutual cooperative response 
data were larger than the corresponding ra- 
tios for the individual cooperative response 
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COMMUNICATION OPPORTUNITY 


Fic. 2. State-conditioned propensities as a function 
of communication opportunity. 


data, indicating greater sensitivity of the 
former to the communication opportunity 
variable. 

The state-conditioned propensities are pre- 
sented as a function of communication oppor- 
tunity in Figure 2. Greater communication 
opportunity resulted in higher values of the x 
and z propensities. This indicated a higher 
level of choice reciprocation following the 
states pertinent to these propensities (CC and 
DC). That is, on a given trial, a player was 
likely to choose the same as the other player 
had chosen on the previous trial, provided 
that the other player had chosen C and pro- 
vided that he had an opportunity for fre- 
quent communication. The z propensity, how- 
ever, remained less than .5 over all levels of 
communication, The w and y propensities were 
low over all levels of communication and indi- 
cated highly reciprocated choices following 
the CD and DD states, regardless of com- 
munication. This would become more ap- 
parent if the w and y propensities were trans- 
formed into reciprocated choices by subtract- 
ing them from 1. (D|DD = 1 — C|DD; D| 
CD = 1 — C|CD.) The D choice is the pre- 
dominant choice following all state conditions, 
except for the CC (mutually cooperative) 
state. The probability of a subject choosing D 
after a state in which the other player chose D 
is quite high, regardless of communication 
level, as indicated by the 1 — y and 1 —w 
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values. Furthermore, the z curv- indicated | 
that the D response was also lic; following | 
the other player's C response , 1-2) 
when the subject had been rew: | by high 
points for the DC dyadic state fact, the 
only state after which there v i greater 
proportion of C responses than 'esponses 
was the CC state under high-co: :unication 
conditions. f 

There appear to be two major kinds of 
factors which might directly i nee the 
subject’s response, subsequent to » articular 
dyadic state. Most obvious is th nber of 
points gained. One would expect tho : petition 
of responses which gained high its. The 
other kind of factor is related to ine inter- | 
personal interpretation placed on ihe other 


player's behavior. For example, one might 
perceive that the other person coonerated, or 
took advantage of a situation. ! appeared 
that if either of these condition: red the 
subsequent D response, it was qui ikely to 
be made. The C response on the other hand 
was likely to be made only when one had not 
been punished (low points) or betrayed (when 


the other person chose D). At the same time, 
the opportunity for frequent communication 
modified these tendencies. E 
In general there was low cooperation m 
this study, as in most games, whicli : right bed 
explained by the two reasons for defection 
noted by McClintock and McNeel (1966): to 
maximize one's own gain (individualistic mo- 
tivation) or to maximize the difference be- 
tween one's own and the other person's gain i 
(competitive motivation). However, close €x 


amination of the data patterns plus interviews | 
with the subjects indicated a third jx sssibility, | 
that of systematically alternating cooperative 
and competitive responses to achieve the big | 
payoff on alternate trials. That is, they may | 
have attempted to maximize their own gal. 
by making defective responses, but in a © | 
operative pattern of alternation, even though 
this strategy does not provide the theoreti 
optimum gain (2R > T + S). Thus, develop- 
ment of functional strategies in a misperce?” - 
tion of the optimal strategy may have Com 
tributed to the overall depressed cooperatio? 
level in this study, as well as others in the 
literature. 


(" V 


ation was shown to be a 
- influencing cooperation 
ver, for various reasons, 
given the opportunity to 
communicate d make optimum use of 
the opportunity me subjects greatly re- 
stricted, or eve oided, the use of the avail- 
able communi channels. On the other 
hand, other subjects elected to use deceptive 
communication. They seemed to overlook the 
long-term cons es of deceit which could 
quickly change potential influence of the 
communication a relatively unimportant, 
or even a nega influence. A few subjects 
thought that uld be disadvantageous to 
disclose their intentions, since disclosure might 
make it possible for the other player to capi- 
talize on this rmation at their expense. 
They therefor se to use communication 
only occasional 

In conclusio he 


Overall, commun 
powerful varial 
ina PD game 
everyone who 


findings indicate that 
communication o portunity affects the course 
of behavior in PD game, but the data also 
support the argument of Krauss and Deutsch 
(1966), that t! dstence of communication 
opportunities ( not guarantee their effec- 
tive use, the purpose for their use, or, for 


that matter, their use at all. 
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INTERPERSONAL REINFORCERS: 
CLASSIFICATION, INTERRELATIONSHIP, AND SOME 
DIFFERENTIAL PROPERTIES ' 


JIM L. TURNER, EDNA B. FOA, and URIEL G. FOA? 
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A proposed order among six classes of interpersonal reinforcers or resources 
(love, status, information, money, goods, and services), plotted on the tw 

coordinates of particularism and concreteness, was tested in three interrelated 
studies dealing, respectively, with perceived similarity of reinforcers, thei 
exchange, and their structural invariance. The results supported the order b) 
showing that: (a) Reinforcers proximal in the order are perceived as similar 
and are substituted for one another more than the distal ones. (b) For each 
resource given there is one resource which is most frequently chosen for 
exchange. The probability of choosing other resources is inversely related to 
their distance from the most preferred one. (c) The intercorrelation pattern 
of resources is invariant across exchange situations. Implications of the find 
ings for interpersonal reinforcement, social exchange theory, and experimental! 
games studies are suggested. Some differential properties of the resources, 
related to their order, are discussed in reference to problems of behavior in 


the city. 


Most studies of social-interaction processes 
involve interpersonal resources as indepen- 
dent variables, outcomes, or both. The use of 
social rewards is particularly explicit in in- 
vestigations of social exchange, experimental 
games like the Prisoner’s Dilemma, and cer- 
tain studies of behavior modification, where 
one person dispenses the reinforcers for an- 
other’s behavior. 

It is becoming increasingly apparent that 
a general problem in these areas of research 
is inadequate conceptualization of the rewards 
which people provide for themselves and oth- 
ers. As Aronson (1969) has noted, in com- 
plex social situations almost any exchange is 
a potential reward, and yet many such inter- 
personal rewards are not transituational.* 
Thus, the investigator is left with the difficult 


1 This research was supported, in part, by Grant 
GS-2094 from the National Science Foundation (U. 
G. Foa, principal investigator). Thanks are expressed 
to Marjorie Marlin, for statistical advice, and to 
Gregory Donnenwerth, John Harvey, Bruce Wood- 
ward, and Pan Case, for serving as confederates. 

? Requests for reprints should be sent to Uriel G. 
Foa, who is now at the Department of Psychology, 
Temple University, Philadelphia, Pennsylvania 19122. 

3 Any resource is a potential reinforcer. Whether 
or not giving it or taking it away will constitute, 
respectively, actual reward and actual punishment 
depends on the circumstances, some of which are 
discussed in this paper. 
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problem of predicting the particular combina- | 
tion of events and situations which will, in 


fact, be rewarding. A case in poi. is pro- 
vided by a recent investigation o! the ex 
changeability of psychological and material 
rewards (Weinstein, DeVaughan, & Wiley, 
1969). Finding themselves unable to predict 
specific exchange outcomes, these authors 
stressed the need for a more stringent identi- 


fication and control of the various exchange 
alternatives inherent in the experimental 
paradigm. Similarly, Alexander and Weil 
(1969) have recently pointed to the ambigu- 
ous goal structure of games like the l'risoners 
Dilemma, in which subjects may direct thelr 
behavior toward rewards which were unin- 
tended and unprogrammed by the experi 
menter. These studies illustrate the problem 
of reward control in the subject. Other inves 
tigations have shown that the same question 
arises also with regard to the experimenters 
reinforcing behaviors. The considerable lit 
erature on “experimenter effects” deals pr 
cisely with the methodological and substan- 
tive issues stemming from failure to antic’ : 
pate the differential impact of unscheduled 

reinforcement contingencies provided by thé | 
experimenter. As Rosenthal (1969) has aptly 
noted, the major, systematic effects of thes | 
unintended influences on a wide variety ? | 


mply dismissed 
; that scien- 
clinicians, and teachers have a proclivity 
stematically delivering unscheduled re- 
has its own intrinsic interest, the theo- 
“implications are much broader in 
Tf rats can be induced to learn faster 
quent and gentle handling, if children's 


can no longer be si 


and if patients show symptomatic relief 
a function of their therapist's belief in his 
t procedure, it is apparent that inter- 
reinforcers need to be spelled out and 


studies of behavior modification, investi- 
have usually been content to dichoto- 
ze reinforcers into the broad classes of 
» versus “tangible” rewards. The rela- 
effectiveness of these two classes, how- 
er, is seldom predictable, and postdictive 
isons are often ambiguous or uncon- 
Ig (e.g, Locke, 1969; Reynolds & Ris- 
^ 1968). Moreover, studies of persons as 
cers seldom separate the agent's pres- 
"from other reinforcers, nor do they make 
ts presence contingent on a particu- 
ponse (Lott & Lott, 196°). In general, 
ë is known about the relationships between 
' of reinforcement, agent of reinforce- 
it, and reinforcer effectiveness. An ade- 
e classification of interpersonal reinforc- 
Constitutes a first step toward the solution 
ese problems. At the least, a taxonomy 
alert investigators to resources often 
ooked. It also provides a basis for gen- 


ng classes of interpersonal reinforcers and 
Yir differential effectiveness in various situ- 
- Interest in the interrelationship among 
forcers is also growing among learning 
ANperimentalists. Some of this work (Jenkins, 

0) focuses on the effects of order in the 
dimension. The beginning of a search 
More specifically psychological dimen- 
is, however, inherent in the notion of 
chy proposed by Premack (1965). It is 
ely the ordering of interpersonal rein- 
» OF resources, along psychological di- 
Ons which constitutes the main topic of 
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A TAXONOMY OF RESOURCES 


In a recent theoretical paper, Foa and Foa 
(1971) have proposed a taxonomy of inter- 
personal resources. Extending the previous 
work of Foa (1966), they suggest six resource 
classes of love, status, information, money, 
goods, and services, plotted on two conceptual 
dimensions: particularism and concreteness. 
This taxonomic system is viewed as a struc- 
tural model of the relative degree of cognitive 
differentiation among the various interpersonal 
resource classes. Although Foa and Foa have 
not provided precise definitions of these 
resource classes, attributes consistent with 
common usage have been suggested. Thus, 
“Jove” is an expression of affectionate regard, 
warmth, or comfort; "status" is an expres- 
sion of evaluative judgment, which conveys 
high or low prestige, regard, or esteem; “in- 
formation" includes advice, opinions, instruc- 
tion, or enlightenment, but excludes those 
behaviors which could be classed as love or 
status; “services” involve those activities on 
the body or belongings of a person which 
constitute labor for another; “goods” are 
tangible products, objects, or materials; 
“money” is any coin, currency, or token which 
has some standard unit of exchange value. 

This classification pertains to the meaning 
assigned to behavior rather than to behavior 
per se. Although the success of any given ex- 
change is predicated on a shared understand- 
ing and expectancy of appropriate action, it is 
often the case that the same behavior will 
vary in meaning across different contexts. At 
certain athletic events, for example, it is 
common to observe teammates giving encour- 
agement and praise to one another by patting 
each other on the posterior. T he same behav- 
ior between executives at a business confer- 
ence is probably quite rare and subject to a 
different interpretation. Likewise, a football 
player who receives a memo from a teammate 
commending his merit is likely to feel consid- 
erable ambiguity regarding the appropriate 
interpretation of this behavior. The point is 
that the meaning of a given behavior may be 
contextually specific, while the same meaning 
may be conveyed by different behaviors. 

If there is no one-to-one correspondence 
between behavior and meaning, it appears 
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Concreteness 
More More Less 
Love 
Particularism Services Status 
Goods Information 


Less Money 


Fic. 1. Classification of resources by particularism 
and concreteness. (The six resource classes are shown 
in italics. See text for explanation.) 


advisable to gain some understanding of each 
system before attempting to relate one to 
the other (Foa & Turner, 1970). Our present 
concern is with the structure of meaning. 


The Structure of Resources 


A hypothetical structure of the six resource 
classes was obtained by locating their position 
on two conceptual dimensions: particularism 
and concreteness (Foa & Foa, 1971). The di- 
mension of particularism derives from the 
writings of Parsons (1951) and Longabaugh 
(1963) and is similar to Blau’s (1967) no- 
tion of intrinsic and extrinsic rewards. It 
refers to the extent to which the value of the 
resource is influenced by the particular person 
who delivers it. In operant terminology this 
dimension may be thought of as the degree to 
which variables associated with the agent of 
reinforcement constitute discriminative stim- 
uli affecting the salience of the reinforcer. In 
general, this dimension ranges from love, the 
most particularistic resource, to money, the 
least particularistic one. It matters a great 
deal from whom we receive love, since its rein- 
forcing effectiveness is closely tied to the per- 
son stimulus. Money, however, is most likely 
of all the resources to retain the same value, 
regardless of the relation between, or charac- 
teristics of, reinforcing agent and recipient, 
Services and status are less particularistic 
than love, but more particularistic than goods 
and information. 

The second dimension is concreteness, rang- 
ing from concrete to symbolic. It specifies the 
form or type of expression characteristic of 
the various resources. Services and goods in- 
volve the exchange of some overtly tangible 
activity or product and are classed as con- 
crete. Status and information, on the other 
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hand, are typically conveyed by 


paralinguistic behaviors and thus hi 
symbolic. Love and money are excha 
both concrete and symbolic forms and oc 
therefore intermediate places on this d 
sion. The position of the six resource cl 
plotted on these two conceptual dimensi 
shown in Figure 1. 

The circularly ordered taxonomy genel 
by these dimensions is interpreted as a 


tural model of the cognitive « 
resources, which specifies th 
similarities and differences. Several hy 
can be generated from the pro 
The following ones are explore 
entation. 


$; Similarity-dissimilarity h wpothesiss 


order, the more they will be per gala 
lar. Conversely, the more two give 
are distal in the order, the more 
perceived as different. 

2. Exchange hypothesis: Given the 
that is most preferred in exchenge for a 
ticular resource, the likelihood of pre 
other resources decreases progressively 
more distal they are in the order from 
resource most chosen. d 

3. Structural invariance hypothesis: 
pattern of interrelationship among the 
resource classes follows their order andi 
variant with respect to the exchange sit : 


'These hypotheses are tested here im 
erence to the cognitive structure of the 
ject. Experimental work designed to 
order in overt behavior is now in progres 
will be reported separately. 


SIMILARITY AND SUBSTITUTION ! 


This study is concerned with the pl 
tual-cognitive differentiation of the SF 
source classes. Subjects received a serie 
messages, each representing a parti 
source class. Their task was to return, 
prearranged array of resource messag 
message most similar, as well as the 0! 
dissimilar, to each resource message ré 
Subjects were denied the option of retu 
messages from the same resource C8 
which the message received belonged. 


l tioned to create 


Ba q^ 24. 
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Method 


Subje: s in the study were 37 under- 
graduate le and 14 male) from introductory 
classes in y and education at the University 
of Miss: umbia. Volunteers were solicited 
from the 'stensibly to participate in a study 
of “Dyad rsonal communication under con- 
ditions o i cues" About half of the total 
subject p ame from each class, with similar 
ratios of n > females in each. 

Resour res, The items used to represent the 
six resour isted of short statements which 
were indivi typed on 3 X 5 index cards. Exam- 
ples of th ms are: I feel affection for you 
(love) ; you ings very well (status); here is my 
opinion (ir ition); here is some money for you 
(money) ; | a package for you (goods); I ran 
that errand ou (services). Three messages were 
written for source, giving a total of 18 mes- 
sages. 

Each me vas phrased to imply that a par- 
ticular reso was being given to the subject. 
Effort w to provide messages which were as 


general ible and to avoid phrasings which 
Suggested a ‘cular form or quantity of the re- 
source. We sought to avoid statements whereby 
the giving nue resource might imply the taking 
away of i resource, thus expressions like 


"favor" or ivice” were eliminated. 

All messag "presenting a given resource were 
pretested on ber group of subjects to see whether 
they were perceived as belonging to the same class. 


— 11) sorted the total deck of 18 

many different categories as they 
priate. Only 1 subject used more than 
erforming this task. Although sev- 
5 ly used less than six categories, 
in each cas was a result of combining neigh- 
boring resou nto the same category. The most 
Common tendency was to use six categories of un- 
equal numbers. Here again, it was always neighbor- 
mg resources that were combined. The most com- 
mon “error” was to combine love with status and/ 
®r goods with money. When further instructed to 


six categoric 
eral subje 


Sort the cards into six different categories of three 


Cards each, there was substantial agreement across 
Subjects that each triplet of messages belonged to 

e same distinctive class. 

Experimental setting and procedure. Subjects re- 
Ported individually to an office at the Center for 
Research in Social Behavior which had been parti- 
an experimental cubicle. The sub- 


- Ft was seated at a desk facing a small message slot 


fading to an adjoining office. The message slot was 
Shielded to Prevent any visual contact between the 
two rooms, The experimenter introduced each sub- 
Jeet to the experimental situation by reading a 
Standardized form which provided both a rationale 
and Instructions for the task. 

Subjects were first told that the experiment was 
amed at studying interpersonal communication when 
certain cues such as facial expression, tone of voice, 
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gestures, and a referential context were eliminated. 
It was then explained to the subject that another 
subject (actually an experimental confederate) would 
be sending him a series of messages through a slot in 
the wall. The “other subject” remained essentially 
anonymous, but was briefly described as being an 
undergraduate who was of the same sex and age as 
the subject. For each message delivered to him, the 
subject was asked to return two messages: the most 
similar and the most dissimilar to the message he 
received, with respect to general thought and feeling. 
Subjects were further informed that their partner 
was making the same judgments and that we were 
interested in whether their choices would show suf- 
ficient matching to indicate a mutual understanding. 
No time limit was imposed on the subjects, and they 
were instructed to be as accurate as possible. 

The sequence in which messages were received by 
the subject was randomized prior to the experimental 
session. The decks of messages available for the sub- 
ject to choose from were also previously randomized 
within resources. Each of these decks contained five 
messages, one from each resource class except the one 
the subject received on that trial. An experimental 
session consisted of 18 trials, that is, 1 trial for each 
message, 3 trials for each resource. The subject’s 
responses were recorded by the confederate, At the 
end of the session, the subject was interviewed as to 
the acceptability of the cover story and the difü- 
culties he may have encountered. Subjects were then 
cautioned not to discuss the experiment with any- 
one for a period of 3 weeks, and the true purpose of 
the study was then explained to them. 


Results 


The conditional frequency distribution of 
resources returned for each resource received 
was computed. Table 1 presents the frequency 
of the various resources, which were returned 
as being most similar to the one received. 
Since the proposed order is circular, the high- 
est frequencies should be in the two cells bor- 
dering the main diagonal. Lower frequencies 
were predicted for the cells two steps removed 
from the diagonal and the lowest frequency 
for the cell which is three steps removed, or 
most distant, from the diagonal. Examining 
both rows and columns in the matrix reveals 
that with the exception of three deviations, 
this is precisely the order obtained. 

Table 2 presents the subject’s judgments as 
to which resource was most dissimilar to re- 
source received. In this matrix the predicted 
order of frequencies is the converse of that 
obtained in Table 1. The highest frequency 
should fall in the cell three steps removed 
from the diagonal, with a corresponding de- 
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TABLE 1 
PERCENTAGE FREQUENCY DISTRIBUTION OF RESOURCE RETURNED AS BEIN 
Most Like RESOURCE RECEIVED 
Resource returned 
Resource uv 
Love Status Information Money | Goods | Se All resources 
Love = 65 10 0 2 2 100 
Status 62 -- 20 10 3 S 100 
Resource information 17 34 — 11 24 14 100 
Received money 0 16 8 = 60 16 100 
Goods 6 5 21 55 — 1 100 
Services 41 18 7 16 18 - 100 
All resources 21 23 11 15 18 1 100 
crease in frequency as one approaches the di- resource but only one resou: which is 
agonal from either direction. Again, with the farthest away, these subjective orts are 
exception of two minor deviations, the pre- consistent with the order. 
dicted order was obtained. Moreover, no other E R A 
ordering of the resources (including ordering DOE ESOU] 
them on each dimension separately) would The exchange hypothesis staics that the 
produce as few deviations as the predicted frequency with which the various resources 
one. are exchanged, in a given situation, will fol- 
To test the homogeneity of rank order in low the order proposed. Thus, n the res 
the distribution of frequencies, the Friedman source which is most preferred exchange 
two-way analysis of variance by ranks was for a certain resource, the relative frequency 


computed for rows and columns in both Ta- 
bles 1 and 2. The level of significance ob- 
tained in these tests ranged from .029 to 
.00013, indicating that the distribution of 
frequencies did not arise by chance and that 
the deviations are insignificant. 

In the postsession interview, a substantial 
number of subjects volunteered the informa- 
tion that judgments regarding the most simi- 
lar message were considerably more difficult 
to make than judgments as to which message 
was most unlike the one received. Since the 
circular order suggests two neighbors for each 


of choice for the other resources will decrease 
iollowing the order. To test this hypothesis, 
an instrument called the “Social luteraction 
Inventory" was devised and administered t0 
160 freshmen taking introductory psychology 
at the University of Missouri, C Jolumbia. 

In this inventory the subject is presented 
with six situations, in each of which he gives 
a certain resource to another person. For 
love, the situation is described thus: “You 
convey to a person that you enjoy being with 
him and feel affection for him." For status, 
the situation is: “You convey to a person your 


TABLE 2 
PERCENTAGE FREQUENCY DisTRIBUTION OF RESOURCE RETURNED AS BEING 


Most Unlike Resource RECEIVED 


= —— D—— —— 
Resource returned 

Resource Ex. — 

Love | Status Information | Money Goods Services All resources 

N : ERE 
Love — 4 19 44 26 7 100 
Status 3 — 17 24 28 28 100 
Resource information 18 10 =i 18 22 32 100 
Received money 51 9 19 — 3 18 100 
Goods 39 16 24 5 16 | 5 
Services 11 18 33 24 14 100 
All resources 20 10 18 19 16 17 100 
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TABLE 3 
PERCENTAG E NUMBER OF CHOICES OF RESOURCES, CONTINGENT UPON N RESOU RCE GIVEN 


Resource given 


Resource chosen in exchange 


Information | 


it is as follows: “You give a person certain 
objects that you possess.” Each situation is 
followed by 15 pairs of items, where each item 
‘in the pair belongs to a different resource 
lass. Since there are six classes of resources, 
the 15 pairs allow for the combination of 
tach resource with every other one. In each 
‘Pair the resource is represented by a different 
item, so that there are five items for each 
dass, The subject was instructed to choose in 
pair the item which he preferred in ex- 
Bener for what he had given. 

‘ - Examples of the items are as follows: For 
love: “The person indicates that he wants to 
be your friend.” “The person says he is fond 
Of you.” For status: “The person praises 
you." “You are told that the person has con- 
Ee in your abilities." For information: 
The person gives you the benefit of his 
familiarity with a certain subject.” “The per- 
‘on makes you familiar with new facts.” For 
oney: “You receive payment from the per- 
son.” “You receive cash from the person." 
For goods: “The person gives you some 
merchandise. ” “The person gives you a cer- 
an product.” For services: “The person 
tuns an errand for you." “The person bored 
imself available to do some work for you." 
To balance sequential effects, both the order 
presentation of the situations across sub- 
S and the order of presentation of paired 
mparison within each situation were ran- 
ized. The frequency of resources pre- 
ed as exchange was then calculated for 
of the six stimulus situations. 


ha 


A 
d m Lo ve Status | Money | Goods Services All resources 
Love (SET yee LIS 14]. TER 
Status | 26 27 21 | 2 | 8 16 | io 
Information | 23 21 25 | 5 10 16 | 100 
Money 11 | 14 15 | 22 | 16 22 | 100 
Goods | 21 (TEST 16 | PR) 18 19 | 100 
Services 20 | 17 18 9 11 25 100 
All resources | — 22 20 10:4 Tis | 12 19 ^| 100 
Tespect and esteem for his talents." For goods, Results 


Table 3 presents the percentage with which 
each resource was preferred in exchange for 
each resource given. Since in each stimulus 
situation a specific resource appeared only five 
times in the 15 paired alternatives, it could 
not be chosen more than one-third of the 
time. Consequently, the highest possible per- 
centage is 33%, while the lowest is, of course, 
Zero. 

The hypothesis that for each resource given 
the frequency of the resources desired in ex- 
change will follow the proposed order is well 
supported by these data. In the first row, for 
example, where the resource given is love, this 
resource is also the most chosen in exchange. 
Moving from left to right on the first row, the 
frequency of choice decreases for status, still 
further for information, and reaches the low- 
est point for money. Then it increases again 
for goods and more so for services. Thus, 
starting from the resources with the highest 
frequency, the pattern shows a decrease and 
then an increase, as the most preferred re- 
source is approached from the other side of 
the circular order. The same pattern is also 
found in the other rows of the table, with 
three deviations: the frequencies of love in 
exchange for information, services for money, 
and information for services are all higher 
than predicted. Here, as well as in the first 
study, deviations generally involve informa- 
tion, suggesting that this class of resources 
may require a closer scrutiny. 

The test of similarity has shown that the 
resource least similar to any given one is two 
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steps removed from the latter. In the exchange 
pattern of Table 3, however, the resource 
which is least exchanged is usually money, 
independently of the resource given. The only 
exception is when money was given: here, the 
least likely exchange is with love, the oppo- 
site of money. Although this feature of the ex- 
change pattern does not alter the order, it re- 
quires some explanation. The situations of 
the Social Interaction Inventory implicitly 
refer to an exchange between friends. In the 
institution of friendship, as in any social in- 
stitution, certain resources are more appropri- 
ate than others, Resources suitable for one 
institution may be scarcely used in another 
institution (Foa & Donnenwerth, 1971) so 
that each institution may be characterized by 
its own profile of the relative frequency of 
resources. The profile obtained for friendship 
is given in the bottom row of Table 3: the 
highest frequency is for love and the lowest 
for money. The remaining resources fell be- 
tween these two extremes, decreasing and 
then increasing gradually, according to the 
order. The profile of resource frequencies 
would be different for other institutions. In 
banking, for example, one would expect 
money to have the highest frequency, while 
the frequency of the other resources would 
decrease as one moves toward love. Thus, the 
distribution found in each row of Table 3 
depends not only on the resource given, but 
on overall institutional constraints as well. 
These constraints are also reflected on the 
most frequent resource in each row. Among 
friends it might be appropriate to reciprocate 
information with information or with services. 
At the drugstore, however, the appropriate 
exchange for the information contained in 
the daily newspaper is a coin. While institu- 
tion and role may prescribe which exchanges 
will occur most and least often, the frequency 
of the remaining exchanges appears to fol- 
low the order. 


INVARIANCE OF THE STRUCTURE 


The third hypothesis states that the pat- 
tern of interrelationships among the six re- 
source classes follows their order and is in- 
variant with respect to the exchange situa- 
tion. Resources proximal in the order will 


correlate higher than distal on: feature 
of the correlations pattern w change 
across exchange situations. To a test- 
ing of this hypothesis, the So craction 
Inventory was once again u~ t in a 
modified form, allowing for ex otal im 
dependence among the resp In the 
previous version the subject h hoices 
to allocate among the resourc: at the 
more he chose a resource, the f: re the 
choices left for the other ones. ! study, 
each statement pertaining to a p ur class 
of resources was presented sepa and the 
subject was asked to rate it : *-point 
scale, ranging from highly desira not at 
all desirable. For each resourc: e states. 
ments were rated. The score on source 
was obtained by adding the p of the 
three statements pertaining to it rating 
procedure was repeated for the imulus 
situations, that is, for each resource ven by 
the subject to the hypothetical person. 

The subjects were 120 college freshmen 
enrolled in an introductory course in sociol- 


ogy. Each subject received 36 ss, with 
each one indicating the desirablen f a par- 
ticular resource in exchange for ino resource 
given by the subject to the other pcrson. The 
means of the 36 scores were first computed, 
as done in the previous study. Their pattern 
was highly similar to the one given in Table 
3, thus providing a replication of t previous 
study. The scores were then inter orrelated. 


Results 


The hypothesis of structural invariance 
leads to the prediction that the pattern of 
intercorrelation will not change when either 
the resource given by the subject or the re 
source desired by him in exchange are varied. 
Two sets of 6 X 6 intercorrelation matrices 
were compared. In each matrix of one set, 
resource given was kept constant, while 
resource desired varied (e.g, when the SU? 
ject gives love, how much each resource is 
desired in return). In the other set, the 7€ 
source given varied, while the resource desi 
remained constant within each matrix, 
was, of course, different for each matrix (e. 
how much love is desired in return for 
resource given by the subject). From cach 
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TABLE 4 


Kesoveces Ax Given 


1 40 » 6 §2 

$2 26 M4 45 

i n 56 Sst 

42 66 5 

D 56 66 oo 

is St Eri [2] 

e examples are presented and 
4 the intercorrelation of pref- 


for ving status when various re- 
ts arc «ven. For example, the coeffi- 

f 61 the first row, second column, 
'rrelation between the desires 
status when love is given and when 
. The relative sizes of these 
nts as predicted; higher for re- 

we proximal in the order and 

listant resources. In the first 
able, for example, the correla- 

ve and status, two neighbors, 
high. 1t then decreases for informa- 
cases still more for money. With 
ods, the correlation begins to increase and 
| even higher for services. This pattern is 
peated in the other rows, with only two 
ions from the predicted order: the cor- 
on between status and goods is higher 
expected, and the correlation between 
mation and money is lower than ex- 
d. The lowest correlation, however, is not 
between resources two steps removed 
order. More often it is between re- 
one step removed, going in counter- 
e direction. 


TABLE 5 


1 CORRELATION AMONG PREFERENCES FOR GOODS 
- Wien DIFFERENT Resources ARE GIVEN 


IuTERPERSONAL RriNFORCENRS 


ELA > Peermeences roe STATUM 
x I 


TABLE 6 
Ivreecosertarion awoxo Prevexences ros 
p Duresest Raov Wes Love Is Givex 
= ul RA RG Po agra | 
M | Lave | Sates | Sit ian oma $E 
Love 7 a -0 n n» 
Status 71 PI "n «4 D 
Information| 17 23 2$ "à 6% 
-04 17? u AR 
Goods | 2 u 4 74 os 
Services 26 80 w A4 08 


Table 5 shows the intercorrelation among 
preferences for goods in exchange for the 
various resources given. It is readily apparent 
that this matrix also preserves the predicted 
pattern with higher coefficients for resources 
proximal in the order and lower ones for 
more distant resources. In this table there are 
no deviations, and the lowest correlation is 
usually found between resources which are 
two steps removed from one another. 

The remaining four tables for the other re- 
sources present a similar pattern, although 
sometimes with a few more deviations. Thus, 
the interrelationship among the resources 
given follows the order, irrespective of the re- 
source desired. 

We turn now to examine the intercorrelation 
among preferences for different resources when — — 
the given one is constant. As previously, we 
shall give two examples out of the six avail- 
able. Table 6 shows the correlation among 
preferences for various resources when love is 
given, In Table 7, the constant resource given 
js money. In both tables, the relative size of 
the coefficients follows the pattern previously 
noted. There are, however, a few more devi- 
ations: four in each table. Half of the devia- 
tions again involve the resource of informa- 


| 
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TABLE 7 


INTERCORRELATION AMONG PREFERENCES FOR 
DIFFERENT Resources Wnes Money Is Given 


Riven | Love | Status | Infor | Money| Goods oo 
EEMENEESS tors 

67; (53742 aS AD 

67 44/752 aR 345 

nation| 53 74 60. - 55- .54 
42-52 oe 65 ól 

45:7 49 ^ Somers 77 


49. 45 54 61 7 


Reed | Love | Status Infor- | Money| Goods | Sers 
Love 56 40 -22 22 21 
Status 56 38 -11 26 36 
Information|! 40 — 38 -04 35 33 
Money -2 -11 -04 49 32 
Goods 22 26 35 49 65 
Services 21 36 33 32 65 
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tion, as in some of the results discussed 
earlier. Tables for the remaining resources 
also appear to support the order. These inter- 
correlation tables suggest that the interrela- 
tionship among preferences of resources re- 
mains the same, irrespective of the resources 
given. 

In conclusion, the same order of relation- 
ship appears among the resources given and 
resource desired when the other resource in- 
volved in the exchange is varied. These re- 
sults provide evidence that the order of re- 
sources is invariant across exchange situations. 
Whether it will remain the same when the 
institution and/or the culture change remains 
to be seen. 


DISCUSSION AND CONCLUSION 


The results of these three studies support 
the circular order of interpersonal reinforcers, 
predicted by plotting them on the two dimen- 
sions of particularism and concreteness. Rein- 
forcer or resource classes proximal in the 
order appear more related and more likely to 
be substituted one for another than distal 
classes. Although preference for receiving a 
resource is contingent upon the resource given 
and perhaps also upon the institutional situ- 
ation, both of these factors seem to leave 
the order invariant. In discussing these re- 
sults, it seems appropriate to consider their 
limitations before examining some implica- 
tions and applications to further work. 

The present investigation was primarily 
concerned with the relative distance between 
resources as implied by their respective posi- 
tions in the circular order. Although no pre- 
dictions were made regarding the absolute 
distance between any of the resource classes, 
one might be tempted to infer from the results 
that certain proximal resources are closer than 
others. Examination of Table 1, for example, 
reveals that one neighboring resource is always 
returned with a considerable higher frequency 
than its counterpart. This aspect of the data, 
however, does not seem to have wide gen- 
erality. The degree of similarity between any 
two neighboring classes is likely to vary as a 
function of the particular items representing 
each class. Fondling and kissing are very 

concrete ways of expressing love and thus 
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probably closer to services tha ) status. 
Conversely, a verbal expression iffection, 
such as “I like you very much," is ily sym- 
bolic and more likely to be seen ilar to 
status than to services. Likewi insump- 
tion goods are probably neare: services 
than durable goods, while service he body 
will be nearer to love than serv! o one's 
belongings. 

The present data are limited > cogni- 
tive structure of interpersonal rei ers and 
do not extend to overt behavior. V person 
who is prevented from expressi ove be 
more likely to give status than i nation? 


Will an individual, barred from p 
saulting somebody, be more likel 
than to deceive him? Only furth 
show to what extent the order í 
categories is reflected in overt bel 


with respect to roles and institut m 


ly as- 
; insult 
rk will 
gnitive 

e 


An additional restriction of th 


tions. In the first study we tried to ^^ as non- 
specific as possible in selecting resource in- 


stances, the situational context w eft am- 
biguous, and the resource sende: mained 
essentially anonymous to all subjects. The 
intent of this nonspecificity was to allow sub- 
jects a freedom which would maximize the 
possibility of getting a distribution of re- 
sponses and thus minimize the pos ibility of 
obtaining very low or zero frequencies in 
many of the matrix cells. Had we, for exam- 
ple, told the subjects that the sender was their 
mother, the frequency with which love was 
seen as being most similar to services would 
have increased considerably, with a corre- 
sponding reduction in frequency for the other 
resources. Likewise, services received from à 
store clerk are more likely to be seen as simi- 
lar to the provision of goods, with other, re- 
sources only minimally related. By placing 
subjects in a task where both the sender and 
the context of social communication were un- 
defined, we sought to make a more general 
statement concerning the cognitive structure 
of resources in the communication process. In 
the remaining two studies, it was implied that 
the target person was a friend or an acquaint 
ance. The effects of this specific institution? 
situation are clearly seen in the marginal dis- 
tribution of resources in Table 3, which differ 


^ 


fable 1, where no particular 

was suggested. Specifica- 
tion modified the frequency 
distributior the order undisturbed. 
Repeating : idy for a different institu- 
tion may lil reserve the order, but the 
frequency ion will change. In fact, 
the freque: istribution changed, even 
within the s nstitution, depending upon 
the resource (see Table 3). By virtue of 
its invarian notion of order appears to 


from the on: 
institutional s 
tion of the 


be more ge: xd useful than the hierarchy 
of frequenc posed by Premack (1965). 
In a very ic situation, order and hier- 
archy of fr: es are identical. But, while 
the order a; to remain invariant across 
situations, irequency distribution does 
change. Consequently, the combined distribu- 
tion of two o re institutional situations is 
unlikely to reveal the order. The combined 
distribution he institutions of friendship 
and trade, í ample, will probably be bi- 
modal with peaks at love and money. Such 
distribution would suggest the wrong conclu- 
sion, that love and money are closely related. 

The results obtained show that preference 
for a given i yersonal resource varies across 


Situations. The effectiveness of such a resource 
as a reinforcer may likewise vary, so that a 
teinforcer disial from the most powerful one 
is likely to prove ineffective. Jecker and Landy 
(1969) indeed found that giving money, in 
various amounts, to the subject did not in- 


crease his liking for the giver, and perhaps 
even decreased it somewhat. This unexpected 
Tesult could have been predicted from the 
slightly negative correlation between pref- 
erences for money and for love when money 
Was given (see Table 7). 

The ordered classification of resources may 
change descriptive concepts such as “reci- 
Procity” (Gouldner, 1960) and “legitimacy” 
(Jones, 1964) into empirically testable propo- 
Sitions regarding the resource specificity of 
exchange patterns. The data of Table 3 sug- 
gest, for example, that exchange with the 
Same or with a proximal resource will be 
Viewed by the recipient as a more acceptable 
reciprocation than exchange with a distal 
tesource, Similarly, legitimacy will be higher 
when the resource that is overtly bidded for is 
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near the one covertly sought after. Proposing 
friendship may be more legitimate when what 
one really wants is status than when the con- 
cealed motive is to solicit money. 

The notion of ordered relationship among 
resources may also have significance for the 
study of “interpersonal conflict,” “bargain- 
ing,” and “conflict resolution” in experimental 
games. Presumably, due to the considerable 
problems associated with quantifying the less 
concrete resources, the rewards employed in 
such studies have typically been money, tok- 
ens, or "points." The general finding seems 
to be that real money in varying amounts, 
imaginary money, and points all result in 
similar behavior on the part of subjects 
(Vinacke, 1969). Although some investigators 
(e.g, Gumpert, Deutsch, & Epstein, 1969) 
have reported that different incentives result 
in different strategies, it is not always clear 
exactly which reward criteria the subjects are 
responding to. In dismissing the notion that 
points and imaginary money are trivial in- 
centives, Deutsch (1969) suggested that “It 
is precisely because ‘doing well’ and ‘being 
approved and well regarded’ by the experi- 
menter may be quite important to the sub- 
ject that the incentive of earning points is 
usually not an insignificant one for most sub- 
jects [p. 1086]." This is, of course, a reason- 
able assumption, but its relevance is marred 
by the fact that investigators do not always 
control such contingencies. Receiving approval 
or being liked by the experimenter may or 
may not be synonymous with the more ex- 
plicit rewards. It also seems apparent that 
subjects in such studies maintain “situated 
identities” (Alexander & Weil, 1969) and that 
they may adopt self-presentation strategies 
which bear no necessary relationship to com- 
petitive or cooperative responses. It would be 
of some interest in this regard to design pay- 
off matrices where competitive and coopera- 
tive tendencies lead to qualitatively different 
rewards, rather than differing amounts of the 
same reward, Extending payoff matrices (Thi- 
baut & Kelley, 1959) to include the whole 
range of resources could provide a meaningful 
test of systematic differences in the goal-seek- 
ing behaviors associated with various types 
of incentives. It seems possible that strategies 
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for obtaining various resources will be less 
similar the more these resources are distal in 
the order. If one plays only for money, for 
example, it will probably make no difference 
whether he gains at the beginning or at some 
later stage of the game. In playing for status, 
however, early success may be more decisive 
than later performance, as shown by a study 
of the primacy effect in attribution of ability 
(Jones, Rock, Shaver, Goethals, & Ward, 
1968). 

The investigation of behavior patterns as- 
sociated with each resource is complicated by 
the fact that in social-psychological research, 
as well as in everyday life, a particular rein- 
forcing event is often a combination of two 
or more resources. To cite a well-known ex- 
ample, a lactating monkey mother in nursing 
her infant provides maternal care (services) 
in which both nutriment (goods) and tactual 
comfort (love) are component reinforcers. 
Prior to Harlow’s (1958) classic study, how- 
ever, it was commonly assumed that nourish- 
ment (being the primary reinforcer) was the 
only salient resource contingency, and little 
justification was available for examining the 
differential effects of these different types of 
rewards. It was not until Harlow was able to 
demonstrate the overwhelming importance of 
“contact comfort” in establishing the infant’s 
attachment and affectional responses to its 
mother that an empirically viable distinction 
emerged. Similarly, broadly defined constructs 
such as social approval may obscure the dif- 
ferential effects of love and status in human 
interaction, Research findings reviewed by 
Aronson (1969) have, however, shown that 
the most gifted group members are not 
necessarily the most liked, and evidence pre- 
sented by Jones and Jones (1966) has demon- 
strated that respect and liking represented 
two distinct dimensions. 

Specification of the resource classes in- 
volved in a particular situation appears all the 
more necessary, since each class possesses 
different exchange characteristics. Some of 
them seem related to the position of the class 
in the order, so that proximal resources are 
more similar in these properties than distal 
ones. It appears appropriate to conclude this 
paper with a brief review of properties which 
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vary with the position of the irce in the 
order. 
Order-Related Properties of KR rces 

Our results tend to indicat: love and 
money, placed at opposite po f the par- 
ticularistic dimension, are the dissimilar 
resources with the lowest fr cy of ex- 
change. It appears that thes: urces also 
differ most with regard to « her prop- 
erties, while the other resour: d to fall 
between these two extreme: rding to 
their position in the order (Fo: oa, 1971). 
Stating the values for love an money on 
these properties will therefore fficient to 
provide an idea of the values ed by the} 
other resources as well: service: and status 
will be more similar to love tha formation 
and goods, while the latter c! will be 
more similar to money than thi ler ones 

Relationship between selj an her, The 
relationship between giving th ource to 
the other and giving it to self is positive for 
love, but decreases and becom« gative as 
one moves from love toward its opposite in 


the order, money. This predict: 
to the intuitive notion that giving love to 
others also requires self-acceptance (Fromm, 
1956; Rogers, 1961, p. 51). With regard to 


money, quite the opposite is true; the more 
we give away, the less we have for ourselves. 
In consequence, an exchange of money can be 


a zero-sum game, but an exchange of love 
cannot. 

Relationship between giving and taking. M 
love, there is usually a certain degree of am- 
bivalence, even in normal individuals: giving) 
love does not exclude the concurrent present 
of some hostility, or taking away of love 
However, giving and taking away money are 
unlikely to occur in the same act. 

Conservation. Money can be kept or saved 
for a long time for future exchange. Love; b 
the other hand, cannot be preserved 4 
separated from the exchange situation. There 
are no deposit or storage facilities for love. 

Time for processing input. Giving and ™ 
ceiving love cannot be done in a hurry, it rea 
quires time. Money, on the contrary, a 
change hands very rapidly. 
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Delay of re Love is a relatively long- 
term invest th rewards beginning to 
be reaped o ter several encounters; a 
friendship n: be “cultivated.” An ex- 
change of m ith some other resource, on 
the other han | be completed in a single 
encounter. 

Optimun size. It has been noted 
(Carpenter ) that in animal species liv- 
ing in groups, such as monkeys and apes, 
there is an nal group size, presumably 
related to put processing capacity of 
the species 1 this size is exceeded, ag- 
gressive be and other tendencies dis- 


ruptive of life greatly increase (Cal- 
houn, 1962). in the human species, the op- 


timal group for an orderly exchange of 
resources may vary with the resource class, 
being smallest for love and largest for money. 
Latané and Darley (1969) have indeed found 
that the larger the number of bystanders, the 
less likely it is that anyone of them will inter- 
vene in an emergency situation, Helping be- 
longs to the class of services, a neighbor of 
love in the or“er, and the optimal group size 
of this resource appears to be small, accord- 
ing to the findings of Latané and Darley. On 
the other hand, large groups are involved in 
trading in a stock or commodities exchange, 
and access to a large market is considered 
advantageous by businessmen. 


Consequences for City Life 


The last three properties (time required 
for processing, delay of reward, and optimal 
group size) converge in rendering the urban 
environment more suitable for the exchange 
of nonparticularistic resources than for par- 
ticularistic ones. The significance of processing 
time has been noted by Milgram (1970), who 
proposed that reducing the time allocated to 
each input is an adaptive response to the 
overload of interpersonal stimuli characteriz- 
ing an urban center. If processing of inputs 
requires more time for love than for less 
particularistic resources, it follows that in the 
City, love will be curtailed more than the 
latter ones. In an urban setting, many inter- 


“Fred M. Marcus. Natural mechanisms for the 
control of optimum populations: Do they exist? 
hpublished manuscript, 1969, 
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personal contacts occur only once, while love, 
unlike money, requires at least several en- 
counters to be exchanged. Finally, if optimal 
group size is smaller for love than for money, 
the large metropolitan crowds will again favor 
nonpersonalistic exchanges over personalistic 
ones. 

One of the consequences of this selective 
process of resource exchange is the facilita- 
tion of antisocial or asocial behavior in urban 
society. Particularistic resources, especially 
status, are powerful instruments of social 
control: a person who misbehaves is likely to 
lose status in the community long before he 
runs into conílict with the law and meets its 
less particularistic forms of punishment. The 
relative scarcity of particularistic exchanges 
in the city deprives society of informal means 
of social control, so that individuals behave 
less responsibly in the metropolis (Zimbardo, 
1969). The difficulties posed by an urban 
environment to particularistic exchanges will 
also result in isolation and alienation, since 
the feeling of belonging is provided by love, 
the resource with the highest positive relation- 
ship between self and other. 

In concluding, it is suggested that an or- 
dered classification of resources may provide 
a useful framework not only for laboratory 
investigation, but also for understanding some 
of the problems of society. Further research 
in this direction may, on the other hand, lead 
to refinements and modifications of the pro- 
posed model. 
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University of Alberta, Edmonton, Canada 


tudy 


examined the effects of various thwarting conditions on cardiac 


ponse and subsequent aggression. Subjects were asked to learn a list of non- 
ase syllables, with verbal reinforcement to be administered by a peer. 
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1971, Vol. 19, , 181-187 
| EFFECTS OF THWARTING ON CARDIAC RESPONSE 


cis received either an easy list with neutral comments from their peer, a 
ut list with neutral comments from their peer, or a difficult list with 
y comments from their peer. 
-, and high-thwarting conditions. The next phase of the experiment 
ed the subjects with the opportunity to reverse roles and teach their 
. list of syllables, with administration of shock as reinforcement. The 
ts indicated that insult (high thwarting) enhanced both cardiac and 
ive responses, as expected. Low- and moderately thwarted subjects did 
lifer in physiological activation, thereby providing no support for the 
ture that frustration per se produces a physiological readiness to re- 


These conditions served to induce low-, 


| aggressively. The finding that low-thwarted subjects delivered more 


ve responding) is emphasized. 


n instigation. 


The notion th 
sion has not 
in situations \ 
sponse folloy 


ut frustration leads to aggres- 
1 well supported, especially 
iere an overt aggressive re- 
e frustration but does not 
facilitate attainment of some nonaggressive 
goal (Berko & Le Page, 1967; Buss, 
1963; Epstein & Taylor, 1967; Loew, 1967). 
The effects which have emerged in the study 
9f the frustration-aggression hypothesis led 
Buss (1966) to contend that insult is the 
major antecedent of aggression and Berko- 
Witz (1965) to propose that frustration, de- 
fined as interference with ongoing goal re- 
Sponses, is most likely to result in aggression 
in the presence of strong, aggressive cues 
(internal or external). Geen and Berkowitz 
(1967) have argued specifically that frus- 
tration produces a readiness to respond 
aggressively, a predisposition which, with 
Appropriate cues, leads to aggression. 

Does this readiness manifest itself in in- 
“teased tension or physiological activation? 


^ ‘This research was supported by a Canada Council 
rant to the first author, Thanks are extended to 
aid Rehill for collecting the data. 
* Requests for reprints should be sent to Lynn 
tewart Hewitt, Department of Psychology, Univer- 
Sty of Alberta, Edmonton 7, Canada. 


ces to their peer than did moderately thwarted subjects was interpreted in 
of attribution theory. The need to specify conditions under which the 
‘vation is to injure the other rather than to help the other learn (via 


Several significant cardiac responses are 


d as supporting Elliott’s notion that heart rate reflects activation or 


Hokanson and his colleagues (Hokanson & 
Burgess, 1962; Hokanson & Shetler, 1961) 
have demonstrated that a condition involving 
harassment and failure increased physio- 
logical activation, but they did not examine 
the effects of frustration alone on activation. 
Furthermore, they did not demonstrate 
changes in aggressive responding concomitant 
with physiological changes (Hokanson & 
Burgess, 1962). 

Epstein and Taylor (1967) found that a 
partner's perceived aggressive intent, but not 
defeat alone, in a game intensified aggressive 
and physiological responses. Because the no- 
tion that frustration predisposes a person 
toward aggression is a part of current theory 
(Berkowitz, 1969), the possibility of con- 
comitant physiological activation demands 
continued exploration. The purpose of this 
study was to examine the effects of three 
thwarting conditions, two involving frustra- 
tion and one involving frustration plus insult, 
on cardiac response and subsequent aggres- 
sion. It was expected that insult would en- 
hance both overt aggressive and physiological 
responses, whereas frustration would intensify 
only physiological activity. 
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METHOD 
Subjects 


The subjects were 90 men, who participated for 
$1.50 or one experimental credit toward an intro- 
ductory psychology course requirement. Seven other 
subjects also participated, but their data were dis- 
carded because of their suspicion. 


Procedure 


The subject was taken to the experimental room 
by a male experimenter and seated at a table. The 
subject was led to believe that another subject, who 
was presumably seated in the adjoining room, would 
be his partner. This other subject was fictitious. His 
remarks and responses were tape-recorded and played 
back by the experimenter via a silent foot pedal at 
the appropriate times. 

The experimenter explained to the subject that 
his heart rate would be recorded throughout the 
experimental session. Two active and one ground 
electrode were then placed on the subject's chest, 
and a 2-minute recording of the subject's heart rate 
was taken on a Sanborn 51 Viso-cardiette. Although 
Lacey, Kagan, Lacey, and Moss (1963) have argued 
for the inclusion of galvanic skin response as a 
measure of arousal because of directional fractiona- 
tion, especially when the subject is relaxed, heart 
rate was used for several reasons. Heart rate is less 
subject to artifacts than galvanic skin response and is 
more reliable than surface blood pressure measures. 
It has reflected stress and arousal effects (Blatt, 
1961; Corfield, 1969; Malmo, 1965; Schnore, 1959) 
and, more relevant to this study, activation effects 
(Elliott, 1969). 

After returning to the experimental room, the 
experimenter moved behind a curtain which shielded 
him from the subject’s view. From this position, the 
experimenter read the initial instructions and op- 
erated the tape recorder and Viso-cardiette. 

The subjects were told that the experimenter was 
interested in the effects of different kinds of rein- 
forcement upon learning, and in order to control the 
influence of extraneous variables, such as gestures or 
facial expressions, they had been placed in separate 
rooms. The subjects were told that they could com- 
municate with each other by means of a loudspeaker 
system connecting the two rooms, To justify the 
recording of heart rate, the subjects were informed 
that the experimenter was also interested in the phys- 
iological responses which occur during learning. 

The subjects were instructed that in the first part 
of the experiment, the critical subject would serve as 
the learner, while the other subject would serve as 
teacher; in the second part of the experiment, the 
subjects were to exchange roles. The experimental 
task involved either teaching or learning a list of 10 
nonsense syllables, The teacher was to read the list 
to the learner, who would then repeat as many of the 

syllables as he could remember. (The teacher was then 
to reinforce the learner by giving feedback, In the 
first part of the experiment, the teacher was to give 
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the critical subject verbal feedback, ing to the 
subject how well he had done. Sub. ‘s were told 
that the method of reinforcement he second 
part of the experiment would be des later. The 
subjects were instructed to contir learning 
trials until they were signaled by the imenter to 
stop. 


Following the instructions, the thwa nanipula- 
tion was introduced. One-third o subjects 
learned a list of easy nonsense selected 
from Glaze's (1928) lists having hi ssociation 
values (low thwarted), while anot hird at- 
tempted to learn a list of difficul bles, se- 
lected from Glaze’s lists having lo sociation 
values (moderately thwarted). All o: subjects 
received neutral comments (e.g, “We O.K— 


formance 
f the sub- 
ist, but 
ormance 
mments, 


let's do it again") as feedback on th: 
from their partner. The remaining th 
jects attempted to learn the same di 
received very critical comments on thei 
from their partner (high thwarted). ‘I 
delivered after each trial, were as fol 


here for 
nd time. 


[Trial 1] O.K. I think we're going t 
a long time. But let's try it for the : 


[Trial 2] You still haven't got thc right— 
I'll read the list again, and this time und con- 
centrate a little more. It's really no y hard. 
[Trial 3] God! At the rate you'r g, we're 
never going to get finished! I reall vt think 
there's much point in continuing with : do yo 
Try harder this time so that we ca t out. 
here!! All right ... I'm going to read ‘hem or 
more and this time, Pm going to r them 
slow that even you can get them. E 


Considerable pilot work was done to obtain the 
desired affective tone for these comments and to 
ensure credibility. Ni. 

For all subjects, the experimenter then moved à 
previously concealed shock box, a modified version of 
Buss's (1961) aggression machine, to the table at 
which the subject was seated. 

The experimenter explained to the subject that in 
the next part of the experiment, he would assume 
the role of teacher and administer physical feedback 
(reinforcement) to his partner, who would assume 
the role of learner. The subject was also told that he 
could vary the intensity, number, and duration 0! 
shocks received by his partner. Intensity could be 
varied by adjusting the large center dial to the 
desired level (0-330 volts) and by then pressing 
the shock-delivery button. Eight descriptive labels 
indicated to the subject the approximate severity of 
shock associated with the varying voltage levels: 
slight shock = 0-55 volts; moderate shock = 56-110 


volts; strong shock = 111-170 volts; very strong 
shock = 171-225 volts; intense shock = 226-280 
volts; extreme intensity shock = 281-335 volts; 


danger = 336-400 volts; severe shock = 401-450 volts. 
To decrease the likelihood of suspicion, a governor 
was attached to the shock box which prevented the 


from turning the dial beyond the 330-volt 


| Subjects were instructed that a maximum of 
volts was availabl: for their use. The number of 

is could be varied by pressing the shock-delivery 
iton more than on he duration of shocks could 

e varied by depr the button for varying 


hs of time. The « ry button was connected to 

shunt-clock ti: one visible to the subject 

d the other visible he experimenter behind the 

in. The shock-b ial was connected to a meter, 

le to the experi er, which indicated the in- 
levels set by ubject. 

— The subject was : a sample shock of 45 volts 

to increase the reali the situation. The experi- 

then gave a list of 10 syllables to the subject 

leit the room, pr nably to wire the partner to 

shock apparatus. The list differed from the one 

usly learned |) > subject, and no indication 

its difficulty level v ;en by the experimenter. 


nenter returned, and the 
The subject administered 
ving each of three learning 
s. In all conditions, the partner reported 3 sylla- 
on the first trial, 5 syllables on the second trial, 
8 syllables on the third trial, for a total of 16 


r 2 minutes, the 
task comm: 


terminated the task at this point 
t Lo rate his partner on a series 
lar scales and to complete a 
tionnaire about ihe experiment. The questions 
checks on the effectiveness of the thwarting 
ation, the subject's perception of the pain- 
ss of shocks received by his partner, and the 

ness, Following completion of the 
questionnaires, the subject remained quietly seated 
While 2 additional minutes of heart rate were re- 
‘corded. The experimenter then removed the electrodes 
“and thoroughly debriefed the subject. 


thirteen 7-point 


RESULTS 
- Summary of Experimental Design 


Ninety subjects were run in this 3 X 3 
- repeated-measures design, with 30 subjects in 
[. fach cell. Dependent measures were (a) in- 
tensity, number, and duration of shocks; (5) 
- changes in heart rate during the experiment; 
and (c) questionnaire ratings of the partner 
- at the end of the experiment. 

To assess the difficulty of the learning task, 
an analysis of variance was carried out on the 
‘subjects’ actual learning of the syllables in 
` pe first half of the experiment. A significant 
2. main effect for thwarting emerged (F = 1239, 
a df = 2/87, p < 005). Low-thwarted subjects 

Tepeated more syllables than both high- 
- thwarted subjects (p < .001, Duncan's multi- 
- Dle-range test) and moderately thwarted sub- 
jects (p< .005, Duncan's multiple-range 
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test), thus providing a successful check on 
the manipulation of task difficulty. Mod- 
erately thwarted subjects tended to repeat 
more syllables than high-thwarted subjects 
(p < .10, Duncan’s multiple-range test). The 
mean number of syllables (summed over 
trials) given by the low-, moderate-, and high- 
thwarted subjects was 20.40, 18.17, and 16.87, 
respectively. 

The major dependent aggressive responses 
were the intensity, number, and duration of 
shocks, each of which was analyzed sepa- 
rately. 


Intensity 


Only the trials effect was significant (F — 
3.32, dj = 2/174, p < .05) in this analysis. 
More intense shocks were given on the second 
shock trial than on the first trial (5 < .05, 
Duncan's multiple-range test), Mean shock 
intensity on the third trial did not differ sig- 
nificantly from the other means. The means 
for the first, second, and third trials were 
84.72, 101.89, and 95.39, respectively. 


Number 


The thwarting main effect was significant 
(F = 447, df = 2/81, p < .025). Moderately 
thwarted subjects gave fewer shocks than 
both low-thwarted subjects and high-thwarted 
subjects (p < .05, for both comparisons, Dun- 
can’s multiple-range test). Subjects in the 
low- and high-thwarted groups did not differ 
from each other in the number of shocks 
given. The mean number of shocks given by 
low-, moderately, and high-thwarted subjects 
was 1.70, 1.17, and 1.64, respectively. 

The Thwarting X Trials interaction also 
reached significance (F — 2.56, df — 4/174, p 
< .05). Low- and high-thwarted subjects in- 
creased the number of shocks delivered from 
Trial 1 to Trial 2 and then leveled off on 
Trial 3, while moderately thwarted subjects 
tended to decrease the number of shocks de- 
livered over all three trials. Figure 1 shows 
the significant Thwarting X Trials interaction. 

The absolute number of shocks adminis- 
tered by subjects in this experiment was quite 
low. Although the procedures (no feedback 
from victim and absence of the experimenter) 
were designed to facilitate display of aggres- 
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Fic. 1. Mean number of shocks given by low-, mod- 
erately, and high-thwarted subjects over trials. 


sion, the instructions may have inadvertently 
inhibited use of shock. Or, perhaps the insti- 
gation was relatively mild. 


Duration 


None of the effects in this analysis yielded 
significant effects. 


Questionnaire Data 


Subjects rated their partners on a series of 
thirteen 7-point bipolar scales, with higher 
scores indicating more favorable ratings. An 
analysis of variance showed that the main 
effect for thwarting was highly significant (F 
= 17.97, df = 2/87, p < .005). High-thwarted 
subjects gave less favorable ratings to their 
partners than did moderately thwarted sub- 
jects or low-thwarted subjects (p < .001, for 
both comparisons, Duncan’s multiple-range 
test). Ratings by moderately and low- 
thwarted subjects were not significantly dif- 
ferent from each other. The mean ratings for 
high-, moderate-, and low-thwarted subjects 
were 4.26, 5.38, and 5.09 points, respectively. 

Subjects responded to three postexperi- 
mental questions concerning anxiety, annoy- 
ance, and estimated painfulness of their part- 
ner’s shock on 5-point scales. Analysis of the 
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degree of annoyance felt duriny ‘he experi- 
ment yielded a significant m ‘ifect for 
thwarting (F = 7.66, df = 2 005). 
High-thwarted subjects indicat ater an- 
noyance (mean rating of 2.67 did low- 
and moderately thwarted subje ean rat- 
ings of 1.63 and 1.65, respectiv < 005, 


extension of Duncan's multiple test to 


unequal replications; Kramer, ). The 
other two ratings did not yie nificant 
effects. 
Physiological Data 

For statistical treatment, t! -second 
time interval just preceding the riment- 
er's reading of the instructions was used:as an 


estimate of base-line heart rate. The degree of 
change occurring during various events in the 
experiment was calculated by subtracting the 
average of the subject's base le om his 
average reading for the event flected 
change in beats per 5-second val. In 
order to provide a uniform coun record, 
conducive to subsequent explorator: analyses 
as well as for comparison with res: 
studies, heart rate samples were based on 
5-second intervals of time (Davis, Buchwald, 
& Frankmann, 1955). These saimpics were 
consistently taken from the first, last, and 
middle (if possible) portion of each «vent, so 
that no biased selection of intervals occurred. 

Analysis of variance yielded no significant 
differences in base levels for the three subject 


heart rate due to the independent 
were then tested by analysis of variance. 
Thwarting manipulation. Cardiac response 
during this period was measured by counting 
and averaging the first three, middle three, and 
last three 5-second intervals of heart rate 
during the syllable presentation, along with 
the first two and last two 5-second intervals 
during the verbal reinforcement period, for 
each of the three learning trials. The thwart- 
ing manipulation (partner teaching and rem: 
forcing the subject) resulted in nearly signifi- 
cant heart rate differences among the three 
thwarted groups for the thwarting period ( 
= 2.52, df = 2/87, p < .10). High-thwarted 
subjects showed greatest increases, but did not 
differ significantly from the two oth 


.10, Duncan's multiple- 

comparisons). The means, 

rom base levels, were 

vr high-, moderate-, and 
, respectively. 


thwarted grou 
range test, for | 
reflecting incre 
1.16, .75, and 
low-thwarted s 


Analysis of varting event resulted in 
a highly signi trials effect (F = 38.08, 
Of = 2/174, | |j). Duncan's multiple- 


that heart rate increases 
reater (p < .001) on the 
!) than on the second (X 


range test sh 
were significa: 
first trial (X 


z 77) or th il (X = 74), thereby re- 
flecting adapt: effects. 
Shock instr: i. Cardiac response during 


this period v 
averaging the 
two 5-second 
the instruction p< 
main effect emer 
instruction ev: 


easured by counting and 
t two, middle two, and last 
‘vals of heart rate during 
iod. A significant thwarting 
d in analysis of the shock- 
(F = 691, df = 2/87, p< 


005). High-thwarted subjects showed signifi- 
cantly greater cases in heart rate for the 
shock-instructi: period than did low- 


thwarted (5 - 
test) and moderat 


/5, Duncan's multiple-range 
ly thwarted subjects (p < 


01, Duncan’s rnultiple-range test), The 

- Means, representing change from base levels, 

| Were .88, 35, and .23 for high-, moderate-, 
and low-thwa ubjects, respectively. 

Presenting syllables, Cardiac response was 


Measured by counting and averaging the first 
three, middle three, and last three 5-second 
intervals of heart rate for each of the three 
teaching trials. Analysis of the teaching event 

yielded a significant thwarting main effect (F 

= 6.18, dí = 2/87, p < .005). High-thwarted 
3 Sübjects showed significantly greater increases 
In heart rate than both moderately thwarted 
(^ < .005, Duncan's multiple-range test) and 
low-thwarted subjects (p< .01, Duncan's 
Multiple-range test). The means for high-, 
Moderate-, and low-thwarted subjects were 
63, 08, and .16, respectively. 

This same analysis revealed a significant 
trials effect (F= 5271, df— 2/174, p 
|001). Heart rate increases from base levels 
Showed significant differences between the 
fist (£ = 49), second (X = .27), and third 
(X = Il) syllable presentations (p< .001, 


a all comparisons, Duncan’s multiple-range 
est), 
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TABLE 1 


Mean Heart RATE CHANGE (SYLLABLE PRE- 
SENTATION Less BASE LEVEL SCORES) FOR 
THWARTING X TRIALS [INTERACTION 


= = 
Trial! | Tral2 | Trial 3 


Group 
Low thwarted | 33 43 | 03 
Moderately thwarted 27 O1 —.05 
High thwarted .86 67 35 


In addition, a significant Thwarting X 
Trials interaction emerged (F = 2.88, dj = 
4/174, p < .025). Table 1 presents the means 
for this interaction. These means show heart 
rate adaptation effects, with slightly differing 
slopes for the three thwarting conditions. 

Administering shock. Cardiac response was 
measured by counting the two 5-second in- 
tervals of heart rate during the shock-delivery 
period for each of the three shock trials. 
Analysis of heart rate change scores for the 
shock event yielded a significant thwarting 
main effect (F = 3.93, df = 2/87, p < .025). 
High-thwarted subjects showed greater in- 
crease from base levels than did moderately 
thwarted subjects and low-thwarted subjects 
(p < .05, for both comparisons, Duncan's 
multiple-range test). The respective means 
for high-, moderate-, and low-thwarted sub- 
jects were .67, .12, and .26. This same analy- 
sis yielded a significant trials effect (F — 
4.45, df = 2/174, p < .025). Heart rate in- 
creases were significantly greater on the first 
shock trial (X = .60) than on the second and 
third shock trials (Xs — .23 and .22, respec- 
tively, p < .05, for both comparisons, Dun- 
can’s multiple-range test). 

Postexperimental period. Analysis of heart 
rate during the postexperimental question- 
naire period revealed no significant effects, nor 
did multiple-range tests reveal significant dif- 
ferences. Also, analysis of variance and multi- 
ple-range tests on heart rate during the final 
2-minute recovery period yielded no signifi- 
cant effects. 

Figure 2 depicts heart rate change from 
base line for each of the six events. Although 
adaptation effects are apparent after Event 2, 
the significant elevation of heart rate for the 
high-thwarted group, relative to the other 
two groups, is evident on Events 3, 4, and 5. 
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Fic. 2. Mean heart rate change from base 
level over six events. 


DISCUSSION 


The three thwarted groups did not differ in 
cardiac rate increases during the thwarting 
period. However, when made aware of the 
opportunity to aggress, the thwarted groups 
showed differential increases, with the high- 
thwarted group displaying greater elevation 
in heart rate than the other thwarted groups. 
This effect confirms Elliott’s (1969) conten- 
tion that heart rate might not reflect emo- 
tional tension level, except when responses 
can be initiated to cope with the arousal. The 
insulted subjects did not show significant in- 
creases over the other thwarted groups until 
they were told they could shock the other 
person, that is, that they could retaliate. 

Differences among conditions were elimi- 
nated following shock delivery, thereby sug- 
gesting that cardiac recovery occurs only after 
the aggressive sequence has ended. While in- 
consistent with Holmes's (1966) findings that 
frustrated subjects maintain an elevated level 
of physiological arousal after aggressing, these 
results support Hokanson and Shetler's 
(1961) proposition that tension reduction 
takes place when a subject makes a response 
which is appropriate to the situation, as in 
overt aggression toward a frustrator. This 
study provides the additional information, 
however, that during the actual aggressive se- 
quence, activation is high for insulted per- 
sons, as indicated by elevation of cardiac re- 
sponse throughout teaching and shocking the 
other person. As suggested by Elliott (1969), 
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cardiac response seems especially sensitive to 
action instigation. An additional implication 
of these findings is that after terminating a 
series of aggressive responses, subjects who 
are then provided with an unex)ec'ed further 
opportunity to aggress may no: be likely to 
express further aggression. Since their physio- 
logical tension has been reduce, strong ag- 
gressive cues or renewed drive arousal should 
be necessary to elicit aggressive responses. 
Particularly relevant to the qucstion posed 


in this study is the fact tha 

thwarted subjects, frustrated (op 
by their failure on a task, did no 
low-thwarted subjects in activatio: 
out the experiment. If these subjects were in 
a state of readiness, as propose! by Berko- 
witz (1965), it was not evident from their 


oderately 
rationally) 
iffer from 
through- 


cardiac responses. What may be necessary for 
failure to serve as an activator is (7) a spe 
cific threat to self-esteem, as provided in the 
Geen and Berkowitz (1967) study, where sub- 
jects failed on a task which a peer had solved; 
or (b) the perception that the peer is the 
source of frustration (Dollard, Doob, Miller, 
Mowrer, & Sears, 1939; Lanzetta & Hannah, ] 
1969). | 
In terms of aggressive responses, the low- 
and high-thwarted subjects delivered more 
shocks to their peer than did the moderately 
thwarted subjects. For the insulted subjects, 
the level of aggression was concomitant with 
increased heart rate and greater reported 


annoyance, so that their aggression seemed to 
reflect anger, However, the aggressive behav- 
ior of the low- and moderately thwarted per 
sons must be understood in other terms. AP- 
parently, their past performance on a similar 
task served as a comparison which led the 
subjects who succeeded (low thwarted) t0 
expect that their peer could do well, an 
therefore shocked him to improve his pel 
formance, whereas those who failed (moder- 
ately thwarted) did not expect their peer f? 
do well, and hence did not so reinforce him. 
'This assumption is consistent with Lanzetta 
and Hannah’s (1969) finding that attribution 
of ability to do a task because of competence 
led to increased shocks in a training 185* 
These results suggest that although UY 
amount of shock delivered in this situatio" 


- cient definitio: 


- Berkowrrz, L, Th 


- aggressive respo 
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‘on, in the sense of harming 
iso reflects a tendency to 
einforcement for learning. 


may reflect aggre: 
another person, 
use the shock 


Thus, experime ich are designed to mea- 
sure aggressio? ms of physical harm to 
another must :^c'sde controls which will 
eliminate the < ‘ion of attribution effects. 
Furthermore, important to clarify the 
motivational | tor the delivery of shock 


in a situation, 
rather than t 
biguously in! 
amount of pai: 


that the intent to injure, 
, the person can be unam- 
If one considers the 
livered by shock as a suffi- 
:weression, then the results 
of studies so far demonstrate that insult, some 
types of task f+ilure, and attribution increase 
;. However, if one wishes 
the intent to physically in- 
to the intent to teach or 
"ments which unconfound 
igned. 


to clarify whe! 
jure is prepo! 
train, then ex 
these must be 
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SOME RESTRICTIVE CONDITIONS FOR THE OCCURRENCE 
OF PSYCHOLOGICAL REACTANCE * 


GISLA GRABITZ-GNIECH * 


University of Mannheim, West Germany 


The present study attempted to relate two areas of research in social influenc: 
thus far not integrated. These areas concern “psychological reactance” an: 
“conformity.” By confronting Brehm's theory of psychological reactance with 
Green’s theory of fear of emotional alienation, two hypotheses were derived 
specifying the condition under which psychological reactance may be expecte 
to occur: (a) Given the presence of a group norm threatening personal freedom 
in a social influence situation, occurrence of a reactance effect will decrease 
(b) with high feelings of inadequacy of the responding individual, occurrence 
of a reactance effect will decrease. In a before-aíter design, reactance behavior 
of 60 male and female students was examined as a function of social influenc: 
with, versus without, norm and feelings of inadequacy, low versus high. Addi- 
tionally, two control conditions with respect to the social influence variable 
with 30 subjects each were run, Reactance was measured as pre-post difference 
of attractiveness ratings regarding a painting which had been eliminated from 
the original set of (four) choice alternatives. The results lend support strongly 
to the first, marginally to the second, experimental hypotheses stated. It is 
concluded that specifications of the concept of psychological reactance are neces- 
sary with respect to the social situation and personality factors. Some prob- 


lems about measure of reactance are discussed. 


According to the triangular model suggested 
by Willis (1963), one can expect three pos- 
sible kinds of reaction in a social influence 
situation. The model provides two dimensions, 
one which represents conformity /anticonform- 
ity and a second which represents depen- 
dence/independence. Independence refers to 
behavior which lacks consistency in relation 
to social influence, while conformity and anti- 
conformity refer to behavior which is highly 
dependent on the source of social influence. 
When there is a positive response to the in- 
tended social influence, one speaks of con- 
formity; when the response is negative to the 
expectations of the social influence, it is called 
anticonformity, or resistance. 

The starting point of the present study was 
the theory of psychological reactance formu- 
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lated by Brehm (1966). The theory states 
that when a person’s possibility to act freely 
is restricted, he will become motivationally 


aroused and show some sort of opposition. 
This arousal is called psychological »sactance, 
and it is designed to reestablish the lost or 
threatened freedom. For example, if a person 
were free to engage in Behaviors A, B, C, 
or D, and his freedom to engage in one of 


them, say B, were eliminated or threatened 
with reduction, then he could reestablish his 
freedom by engaging in Behavior B. Any 
attempt of others to influence an individual’s 
freedom of choice results in a motivational 
state called “psychological reactance”; this 
state creates a boomerang effect against the 
attempted control, and the outward appear- 
ance is an increased amount of attractiveness 
of the object when the freedom of choice is 
restricted. (For a newer explanation of the 
boomerang effect in the field of attitude 
change, see von Cranach, Irle, & Vetter, 
1965.) In terms of the triangular model oí 
Willis (1963), psychological reactance is siml- 
lar to anticonformity, since it is dependent on 
the source of social influence that has ? 
negative effect. 


other theoretical point of view with 
ard to personal freedom is held by Green 
). Green argued that the individual ex- 
fear of emotional alienation when 
he is confronted with situations in which he 
bas to come to decisions. As a result of that 
fear, he will reject the freedom to choose. 
Green pointed out that the problem of reject- 
ing freedom has been investigated in the 
social sciences, especially in the field of con- 
"formity research. The individual wants to 
— keep the advantages of group membership 
and, therefore, limits “his full use of the op- 
portunities of self-expression [p. 3]." Green's 
‘interest in this problem goes back to 
Fromm (1941), who in his book, Escape 
- from Freedom, discussed the phenomena of 
personal freedom and social isolation. In a 
related experiment, Green was able to find a 
close relationship between the motivation of 
“fear of emotional alienation” (tested with 
- the Thematic Apperception Test) and con- 
formity behavior (validated in an Asch 
Situation). 

In short, two theoretical standpoints exist 
which are both related to the problem of 
individual freedom and motivation, but which 
- arrive at contradicting predictions and re- 
sults. Both psychological reactance (Brehm, 
1966) and fear of emotional alienation 
(Green, 1966) arouse types of behavior in 
Social influence situations which are related 
to the individual’s freedom of choice, but 
Come to contradictory theoretical predictions 
Concerning the motivational basis of behavior. 
Brehm’s theory states that there exists a 
Motivation to reestablish a threatened or 
eliminated freedom aiming to strengthen the 
individual position, whereas Green's theory 
centers around a motivation to reject freedom 
Caused by fear of emotional alienation from 
Others. These motivations call for kinds of 
behavior that can, in Willis’s terms, be 
referred to as anticonformity and conformity. 

The specific purpose of the present study 
Was to show some conditions under which 
the Occurrence of psychological reactance as 
4 motivational state and thus as behavioral 
effect is restricted, In the social influence 
feld, internal (or individual) as well as 
external (or situational) factors guide behav- 
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ior. Internal factors may be summarized as 
personality variables (e.g., social dominance, 
inner-directedness), and external factors are 
group variables (such as attraction, norm 
expectations) and stimulus variables (com- 
plexity, difficulty, etc.). With regard to the 
situational factors, it follows from conformity 
research that the stronger the (relevant) 
normative expectations in a given situation, 
the more compliance will be shown in this 
situation. Brehm (1966, p. 7) himself stated 
that the extent to which a social influence 
attempt is seen as justified and legitimate may 
be powerful variables with respect to the 
magnitude of reactance, but he held them 
constant for testing his hypotheses. As to the 
internal variables, it has been shown, for 
example, by Klein (1967), that dependency 
(ie. social anxiety, low self-esteem, other- 
directedness) leads to compliance. Brehm 
(p. 127) assumed that there may be indi- 
vidual differences in showing psychological 
reactance, but so far he has not directly in- 
vestigated the role of personality factors. A 
study reported by Wicklund and Brehm 
(1968) gives some support to the assumption 
that low self-esteem or high feelings of in- 
adequacy lower the resistance to appeals for 
compliant behavior. The authors showed that 
importance of the relevant freedom, that is, 
the person's competence on the issue, is a 
striking variable for the occurrence of a 
boomerang attitude change. 

As was stated by Brehm and Sensenig 
(1966), “social influence attempts may fre- 
quently produce two opposing forces: that 
toward compliance, and that against compli- 
ance and toward the reestablishment of 
freedom [p. 703]." The occurrence of psycho- 
logical reactance in social influence situations 
might be restricted in two ways: (a) The 
pressure might be so strong that one is moti- 
vated to comply; (5) the freedom might be 
rejected. 

Thus, for the purposes of this investigation, 
two predictions can be advanced. First, if a 
freedom-restricting social influence is of 
norm-establishing character, the tendency to 
arouse psychological reactance will be dimin- 
ished. Second, if a person shows high feelings 
of inadequacy, the probability of experiencing 
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psychological reactance will be lowered. Both 
hypotheses concern behavior that is guided 
rather by motivation to give us a special 
freedom than by a motivational state to re- 
establish an eliminated or threatened freedom. 


METHOD 


In a before-after design, the dependent variable 
was the amount of psychological reactance, mea- 
sured—as in many of Brehm’s experiments—by the 
change in attractiveness scores of a choice alterna- 
tive. The first independent variable was the active 
experimental manipulation of the social influence 
situation; in one condition it was exercised without 
norm; in the other, with norm. Two control groups 
were additionally run, one experimental base line 
without treatment and one replication of Brehm's 
original condition, The second independent variable 
was a personality trait, namely, the subjects feeling 
of inadequacy as measured by the scale developed 
by Janis and Field (1959) and manipulated through 
an ex-post-median split. 


Subjects 


Subjects were male and female volunteers, mostly 
freshmen, from introductory psychology and sociol- 
ogy courses at the University of Mannheim, West 
Germany. Subjects received course credit and were 
run in groups of five in the first experimental session. 
They were randomly assigned to the various experi- 
mental conditions in the second session. 


Procedure 


In the first experimental session, the subject was 
asked to give judgments about paintings. The paint- 
ings were 4 colored reprints that had been selected 
from a series of 13 pictures by the criterion of nearly 
equal attractiveness scores and qualitative differences 
in a pilot test. 

It was explained to the subject that he would see 
the paintings in order to indicate how much he 
individually liked each one, The experimenter an- 
nounced that each subject would have the opportu- 
nity to select one of the paintings as a gift for 
himself at a later date. Immediately after viewing 
each painting, the subject gave his rating on a scale 
of favorability labeled 0-100. The instructions were 
worded as follows: 


Here are four paintings, please indicate their 
attractiveness. Each person who participates in this 
experiment will get one of them later as a gift. 
However, the shipment of the paintings has not ar- 
rived yet but is expected soon. Then you may 
select one of them to keep. Now, I shall show 
you all the paintings briefly so that you can get 
an impression. Would you then please judge the 
paintings on a scale from 0 to 100, indicating how 
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much you personally like each pict 
dred means on the scale you lik: 


One hun- 
extremely 


well, and 0 means you do not like i ill. Please, 

make your mark in the appropria! e on the 

scale. I will tell you which pictur rate ata 
given time. 

The subjects then answered the px ty ques- 
tionnaire, for which they received p instruc- 
tions. Finally, the experimenter ari a date 
(about 4 days later) with each subje: i second 
meeting, ostensibly in order to let ! ose one 
of the paintings. 

In the second session, the subject formed 
that he would be shown the four | again 
and that he should rate them once n find out 
whether his judgments would be affect looking 
at the pictures a second time. Beior ubjects 
made their second rating in the exper | condi- 
tions, one of the paintings (the one subject 
had initially rated third most attract is elimi- 
nated from the set of choice alternati Phen all 
four paintings were seen and rated b subject. 
Measures 

The dependent variable was measured by the 
change in attractiveness scores of the object rated as 
the third most attractive (X;) by the suoiect in the 
first session and then eliminated in the rimental 
conditions from among the choice ilic ves. “It 
was expected that the eliminated altern: ‘ve would 
tend to increase in attractiveness . . . [P:cim, 1966, 
p. 22]," because the freedom to cho: was re- 
stricted. One speaks of a reactance effect when 
Object Xs gains the greatest positive (or lowest 
negative) change score in the attractivencss judg- 
ments, 

Besides the differences in attractiveness scores On 
the rating scale, a further measure, nomely, the 


frequencies of changes in rating Object X», that is, 
frequencies of increased, unchanged, and decreased 
attractiveness ratings, as used by Brehm (1966, P. 
23), was computed. 

It seemed to be difficult to establish a clear-cut 
index for the dependent variable (reactance), since 
the judgment of the critical object (X+) was not 
independent of the judgments of the other three. If 
there was a variation in the judgments concerning 
the other pictures in the second measurement, then 
this presumably influenced the judgment of the 
critical picture, an effect not necessarily attributable 
to reactance, In some studies preliminary to the 
experiment discussed here, a central tendency (re 
gression effect) from the first to the second set 0 
ratings had been found. The values of the second 
attractiveness judgment of the four pictures were 
moved to the center of the distribution, so that the 
distribution of mean attractiveness scores O the 
four pictures exhibited a smaller dispersion for the 
second judgment, Concerning the central tendency: 
Johnson (1952) stated in general: 


. is manifested . . . as an 
mulus magnitudes below the 


The centra! 
overestimat 


center of ti s series and an underestima- 

tion of stir nitudes above the center of 

the stimu! when stimulus ratings are 

averaged | 

The diffic erning the regression effect 
sare connected he experimental procedure it- 
"self. Besides t that repeated measurements ac- 
"tually result i "sion to the mean, the use of 
"B even num four objects (alternatives), to- 
gether wit! oretical necessity to eliminate 
from among ternatives the one rated third 


sults. Reactance theory 

ittractiveness of Object Xa in 

as a consequence of eliminat- 
"vever, the same positive shift 
consequence of a regression to 
tance effect, caused by a social 
ion, could not be separated, then, 
an effect ed by regression to the mean. The 
t way to | the effect due to regression is 
‘to compare t ita, that is, changes in rating of 


Xa, of the « tion conditions, with those of a 
- mo-eliminatio: dition that qualifies as an un- 
_ biased base lir £ there is no possibility to com- 
“pare experim onditions with a no-treatment 
“condition, one has to use an unbiased index for the 


dependent vari 
"transformed in 
‘Tanks, As der 
- ferences of 


; for example, the data have to be 
‘andardized scores (z scores) or in 
lent measures, then, pre-post dif- 
res or pre-post changes of the rank 


OÍ the critical alternative X, could be used. In the 

present study an experimental base line was used. 
The data of the treatment conditions were compared 
with those of the no-treatment control condition. 


Independent Variables 


Manipulation of the social influence. Two experi- 
mental conditions were run, one without norm and 
one with norm, In the condition without norm, Ob- 
ject X. was eliminated from choice—as in the ex- 
Periment of Brehm, Stires, Sensenig, and Shaban 
(1966)—on the grounds of impersonal events. The 
Experimenter declared that the shipment of pictures 

had arrived, but that it had, for some unknown 

Teasons, failed to include copies of one picture (this 
, Picture was the one that the subject had rated as the 
third most attractive in the first session). The differ- 

ence in the experimental condition to Brehm's experi- 
. ment lies in the fact that the subjects were not 
tested individually but four at a time (with the 
. Same X;), so that they could be compared with the 
Second experimental group in which a norm was 
Introduced by a group (togetherness situation). 

Tn the condition with norm, three stooges elimi- 
‘Rated Object Xa and thus provided a norm. The 
.Téason given for the elimination was the stylistic 
incompatibility of one of the four pictures. It is 
Worth noting that this criterion for elimination did 
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not involve any positive or negative evaluation of 
the pictures. The stylistic discrepancy to the series 
of objects was intended to be free from value con- 
siderations; that is, the variables “stylistic fitting” 
and “attractiveness” were considered as independent. 
The experimenter asked the four persons (first, two 
stooges; then, the naive subject; and at last, a third 
stooge) to eliminate that picture of the series which 
was stylistically incompatible. (The argument was 
that in prior sessions, the subjects frequently had 
made remarks that one of the pictures did not 
stylistically fit in the series of pictures. The subjects 
were told that this led to the intention to confine 
the series of four pictures to one of three pictures 
in the future. On grounds of the group elimination, 
Object Xa was no longer eligible, but on account of 
comparability with the first measure, all four pic- 
tures had to be judged once more.) The social in- 
fluence lowered the freedom to choose by means of 
a pretended establishment of a group norm in the 
presence of the naive subject. The norm was pro- 
vided by members of the subject’s peer group, 
rather than by the experimenter, in order to pre- 
clude the possibility that the subject would become 
suspicious of any direct manipulation of the norm 
by the experimenter. This was also the reason why 
the subject was assigned the third turn in announcing 
his decision. 

Additional control conditions, In order to guar- 
antee an unequivocal interpretation of results and 
also with respect to a possible comparison of the 
results of the present study with those of Brehm's 
experiments, two further conditions were included. 
One control condition was a pure replication of the 
experiment of Brehm et al. (1966); that is, the pro- 
cedure was like that in the without-norm condition, 
but the social influence was given only to one 
subject at a time (alone situation-Brehm replica- 
tion). This replication provided, in addition to a 
direct comparison with Brehm’s results, the possi- 
bility to discern whether a possibly lacking experi- 
mental effect in the without norm, togetherness sit- 
uation refers to the togetherness situation or to the 
reactance phenomenon itself. The second control 
condition was an experimental base line without 
elimination or mention of Xs, with four subjects 
per session (no treatment). This condition was run 
in order to give an absolute interpretation of the 
results, since in this case no reactance at all was 
expected. 

Personality trait. As a partition of the single 
samples into two experimental conditions concerning 
individual differences, a personality questionnaire 
was used. Since a German version of an analyzed 
scale to measure social inferiority was not avail- 
able, the Janis and Field (1959) Feelings of Inade- 
quacy Questionnaire was translated and roughly 
analyzed (interitem consistency, fy = 1984). The 
final questionnaire mixed with items of other ques- 
tionnaires was given to all subjects. The total scores 
of the subjects in the two experimental groups then 
were divided by median split into those with high 


TABLE 1 


FREQUENCIES OP INCREASED, DECREASED, AND 
UNCHANGED ATTRACTIVENESS RATINGS OF 


SOCIAL INFLUE 


No treat- 


Without norm | NE |: went 
Attractiveness I | 4 Ss to- 
change 1S alone : gether 
(Brehm | 4 Ss to- | 4 Ss to- | (experi- 
repli- gether | gether | mental 
cation) | base line) 
Increase (+) | 16 | 18 10 6 
No change (0) 6^4] > 4 12 
Decrease (—) 8 | 7 | 13 12 


feelings of inadequacy (FI+) and those with low 
feelings of inadequacy (FI—). 


RESULTS 


Social Influence Manipulations 


In line with those findings reported by 
Brehm (1966), the Brehm replication has to 
be compared with the experimental base line 
in order to find out about the success of the 
general experiment. The respective data ap- 
pear in columns 1 and 4 of Table 1. The dif- 
ference between these conditions is statisti- 
cally significant (y? = 5.99, df=2, p< 
105). 

Due to the alone or togetherness situation 
(columns 1 and 2 of Table 1), no statistically 
significant difference—as expected — was 
found; that is, the group reactance effect 
(without norm) was like that of the Brehm 
replication condition. 

Although the difference between the Brehm 
replication and with norm condition (columns 
1 and 3 of Table 1), as well as the difference 
between the togetherness situations without 
and with norm (columns 2 and 3 of Table 
1), follows the expected pattern, no acceptable 
level of statistical significance was ap- 
proached. Nevertheless, the data from Table 
2 (lowest row) show that mean change in 
attractiveness ratings of the with-norm situ- 
ation—as expected—is significantly smaller 

than mean changes either in the Brehm repli- 


3 Chi-square tests according to Guilford (1956, p. 
232). All chi-square tests of significance are two- 
tailed. 
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TABLE 2 


MEAN CHANGE IN ATTRACTIVE) TINGS 
Without norm No prei 

Group 4 Sa 
1 5 alone 4 Ss together 
mim) together oa 
ase line 
FI— +3.533 | --6.066 | - — 2,800 
FI+ +4.266 | +3.266 + +1.600 

Overall | +3.900 | +4.666 | - — .600 

Note,—Positive change indicates increa tiveness. 
cation (4 — 1.63, df = 58, p < .0t ir in the 
without norm, togetherness cons ion (f= 
2.13, df = 58, p< .025). Wher data of 
both without-norm conditions ombined 
(X = 4.283) and compared with 1a of the 
with-norm group (X = —3.766) statisti- 
cally significant greater mean in the frst con- 
dition can be shown (£ — 2.095, —88, p 
« .025). 

To test whether the reactance ct in the 
with-norm condition (column 3 le 1) is 
vanished or only lowered, the di» of this 
condition have to be compared w^ those of 
the no-treatment condition (coin 4 of 
Table 1). No statistically significan: difference 
could be found between these conditions. 


With respect to both experimental variables— 
social influence and personality trait—all 
data except those of the Brehm replication 
condition (see Table 2) were analyzed in a 
2X3 analysis of variance (Winer, 
The analysis of the data indicated that the 
social influence manipulation (without norm, 
with norm, no treatment) was effective ac 
cording to the occurrence of a reactance 
effect at the .10 level (social influence: eG 
2.455, df = 2/84, p < .10; personality trait: 
F = .00005, df = 2/84, ns). A second analy- 
sis of variance was computed with the 2 ex- 
perimental social influence conditions only 
(togetherness groups without and with 
norm) and 2 levels of the personality trait. 
The results of this 2 x 2 design strongly 
lend support to Hypothesis 1, since the € 
influence manipulation was significant ? 


51 statistic according to Winer (1962, pp. 31-9); 
Since the hypotheses are directed, one-tailed t€ 
have been used. 


ignificance with respect to 
psychological reactance 

F = 4.399, dj = 1/56, p< 

l ait: F = 3085, df = 1/56, 


ns). 
In brief, with respect to the social 
n tions, there is evidence on 

\ reactance effect appears 

; in Brehm's experiments) 
situations; (b) the reactance 
the freedom-eliminating so- 
itions without any norm, as 
he experimental base line; 
ution calls for less reactance 
orm condition, a comparison 
"ment group, showing that 
ance effect at all in the norm 


| scores in the Feelings of 
‘vestionnaire range from 0 to 
the two experimental groups 
j: tions: without norm and with 
orm), the edian of the subjects’ scores 
Was nearly ai the same point, exactly at 11.50 
and 11.66. Up to and including 11 points, the 
‘subjects were classified in the group FI (—), 
that is, low ngs of inadequacy; from 12 
Points and more, the subjects were attached 
to the group FI (+), that is, high feelings 
9f inadequacy. 

Those persons of the two experimental 
groups who showed reactance (indicated by 
positive rating shifts of the Object X4) had 
à significantly lower mean on the FI scale 
i those persons who showed no reactance 
(10.78-12.40; t= 1.47, df=58, p< 0). 
In other words, persons who showed reactance 
- had lesser feelings of inadequacy than those 
Showing no reactance. This finding confirms 
Hypothesis 2, which stated that persons with 
‘Aigh feelings of inadequacy scarcely experi- 
.fhCe reactance. The data suggest the inter- 
‘Pretation of a weak effect on the personality 
trait with regard to the occurrence of psycho- 
logical reactance in the direction predicted by 
Hypothesis 2. 
Examination of the data (see Table 2) by 
Analysis of variance, based on attractiveness 
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change scores of Xa, revealed neither a suffi- 
cient F ratio for the personality trait per se 
nor a significant interaction effect. 

Concerning the feeling of inadequacy varia- 
ble, the data provide only some evidence 
with respect to the following point: Persons 
who show reactance exhibit low feelings of 
inadequacy. 


Discussion 


The principal findings concern the occur- 
rence of psychological reactance, first formu- 
lated by Brehm (1966), under two restrictive 
conditions. The present investigation sup- 
ported the proposition that psychological 
reactance is not to be expected in all freedom- 
eliminating social influence situations. The 
specifications of the reactance theory concern 
a situational variation and one personality 
variable, 

Before discussing the results under theo- 
retical aspects, some critical notes concern- 
ing Brehm’s evaluation of his data should be 
expressed. As mentioned earlier, in a before- 
after design, the occurrence of a regression 
phenomenon might be expected, that is, a 
decrease of the variance in the data of the 
second measure. To rigorously control the 
effect of the experimental manipulation, a 
comparison of the data with those of an 
experimental base line (with no treatment at 
all) has to be carried out. 

Especially the first two experiments in 
Brehm’s book (Brehm et al., 1966) gave rise 
to the following methodical remarks. In the 
first of the two experiments, a control group 
without treatment was used, but the differ- 
ence in the original score values of the de- 
pendent variable between experimental and 
control group was not significant. Brehm sup- 
posed the reason for this insignificance in the 
small W of the control sample. Therefore, he 
tested the critical object (X out of four) 
against a zero change and thus was able to 
show a significant increase in attractiveness 
of the eliminated object. This procedure is 
not flawless, for part of the increase might be 
caused by the central tendency effect. Brehm 
(1966) did not interpret the "significant" 
increase as reactance, since he supposed that 
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not all objects (records) had the same initial 
attractiveness score: 

Because the attractiveness manipulation failed, the 
increased attractiveness of the eliminated record is 
more easily attributable to factors other than re- 
actance [p. 24]. 


In the second experiment the difference be- 
tween experimental and control group also 
failed to yield an acceptable level of signifi- 
cance. Only when the authors combined the 
data of this experiment with those of the 
first one (the results of which had not been 
interpreted by Brehm as psychological reac- 
tance) was there a significant difference (p 
= (09) between experimental and control 
group regarding the differences in scale values 
of Object X3. 

Besides the differences of the scale values, 
Brehm used a frequency measure. This mea- 
sure consisted of the number of persons who 
showed a positive change of the critical Ob- 
ject Xa on the scale from 0 to 100, and those 
who showed no or negative changes. (Since 
these frequencies are shifts on the original 
scale, again the objection can be made that 
positive changes of X might be favored by 
the central tendency phenomenon.) The com- 
parison of experimental and control groups 
of the first experiment reach no sufficient 
level of significance, although, unjustifiably 
even if few, the no changers were omitted.” 
A significant result was obtained only by 
questionable comparison of the experimental 
group and a zero change (rectangular dis- 
tribution). Concerning the second experiment, 
there was a significant difference between 
experimental and control groups only after 
combining these data with those of the first 
experiment and after omitting the freqencies 
for no changers (Brehm et al., 1966, pp. 
306, 311). 

In brief, then, the results offered by Brehm 
(1966) from these experiments are only par- 

5 The following way of data presentation might 
be more incontestable: The frequency data of no 
changes (which means that the rating of Xa was the 
same in the postmeasure as in the premeasure) and 
decreases (which means that the rating of Xs was 
worse in the postmeasure) might be pooled—since 
the experimental hypothesis stated is directional— 
and then compared with data of increases (which 


means that the rating of Xa was better in the post- 
measure, a change interpreted as reactance), 
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tially able to give support for theory of 
psychological reactance, sinc: lata which 
are not compared with a no-t: ^nt control 
group cannot be interpreted juivocally, 
just as those findings for whi wt of the 
data had been omitted. 

In the present study the ince effect 
was demonstrable against treatment 
control group as intended in ! 's (1966) 
experiments. There is some ev : from the 
data of the present experim: aat differ- 
ences in strength of reactam behavioral 
consequence of a motivation ousal are 
demonstrable by comparison b: n various 
experimental groups. 

The theory of psychological reactance is of 
fairly recent origin, and | himself 
stated that not all problems « ned with 
the theory had been solved. ! ling from 
the idea of some necessary spi tions, 
present study investigated the : f external 


influence 
;ncerning 


variations of the situation an: 
of personality variables. Fact 
the personality aspect have bees taken into 
account by the theory of psychsgical reac 
tance in two recent studies. Wicklund and 


Brehm (1968) have related jc’ competence 
of a person in regard to the i ind to psy- 
chological reactance. This varie! has been 
found to be relevant for determination of à 
reactance effect. Jones (1970) included in his 
study in addition to other experimental varia- | 
bles some personality measures (among them 
the Feelings of Inadequacy subscale from 


Janis & Field, 1959) in order “to explore 
individual differences in reactions to reat 
tance arousal [Jones, 1970, p. 127]." The 
results of the personality measures, in 8e 
eral, gave no information. According to the 
Janis and Field subscales, there have been 
found inconsistent and weak correlations wil 
the dependent variable, that is, reactance be 
havior. 

With respect to the internal variables of ay 
person, the data of our study give evidence— 
although only weak—that psychological m 
actance is predominantly shown by persone 
with low feelings of inadequacy, that is; sell- 
reliant individuals. By contrast, insecure pt 
sons (with high feeling of inadequacy 
showed no psychological reactance. The test 


ble is an indication for the 
g tendency to behave. 
s an impression of the 
which an individual has 
Green, 1966—wants to 

z to the social field. The 

e in special behavior is an 
dition for the occurrence of 
tance; whether and to what 
m really exists, for example, 
dom potential, may possibly 
> derivable from personality 
the question raised by the 
»rsonality variables in this 
imarily whether—in Jones’s 
words—differences in behav- 
es to motivational arousal 
er a freedom is present and 


{or—in lir 
hold) in : 


variables. 
investigati: 


ni -actance behavior when elimi- 
mated, or w r it is not existent or refused 
so that—í: ieoretical points of view—no 
reactance c' vt all is to be expected. 
With respect to the external variables, the 
results show ‘hat reactance still is effective 


e (han one person is exposed to the 
‘ating or threatening influence. 

ed that a pure togetherness 
ested—gives no opportunity to 
; to interact with each other. 
ce of the reactance effect in a 

> several persons are together 
to an application of the theory 
; of advertising, mass communica- 
lion, education, penal institutions, etc. 

The most interesting result was that there 
exists a condition (with norm) under which a 
person would obviously give up a freedom 
rather than show reactance. (This effect of 
the norm-establishing situation can be main- 
tained only under the assumption that the 
group elimination per se does not lower the 
.Téactance effect, a control treatment for 
Which experimental data are not now avail- 
able.) The existence of psychological reac- 
tance, together with efforts to establish and 
Consolidate a social group with respect to the 
Same issue, is logically not consistent from a 
theoretical standpoint. Group-setting behavior 
I$ norm oriented; norms call for conformity, 
that is, giving up personal freedoms. Giving 
Up one’s freedom to engage in a special be- 
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havior must be distinguished from the sort of 
compliant behavior that is caused by strong 
group pressures or social power. Giving up 
one’s freedom might be seen, for example, in 
line with the techniques of ingratiation de- 
scribed by Jones (1964), Psychological reac- 
tance is, as Brehm (1966) formulized, 
“antithetical to civilization [p. 128],” it is 
an unsocial motivational state to protect the 
individual’s freedom, Therefore, group prop- 
erties such as affiliation and cooperation ex- 
clude the occurrence of reactance (since 
probably the freedom is given up); proper- 
ties such as individual achievement and com- 
petition might be seen in line with the reac- 
tance motive and the arousal of resistance 
against the pressure. 

From the findings with regard to the situa- 
tion, it can be concluded that the two theo- 
retical positions held by Brehm (1966) and 
Green (1966), which both concern themselves 
with individual freedom but come to opposite 
statements, have different starting points. 
Green obviously refers to a more socially ori- 
ented motivation, concerning the existence of 
a well-functioning society (e.g, a society 
whose members enjoy a free and easy corpo- 
rate life). Brehm, in contrast, presumably 
imagines a more antisocial, egoistic motiva- 
tion, which “must inevitably clash with the 
pressure of social life [p. 128],” in a differ- 
entiated and highly developed (and therefore 
perhaps confining) culture. Spontaneously, one 
might suppose that the two theories arose in 
different cultures. Since both have now been 
formulated and tested in the United States, 
one has to conclude that situational and indi- 
vidual differences favor the one or the other 
motive. 

Theoretically, then, the area of conformity 
research gave fruitful hints on the possible 
limitations of the occurrence of some special 
motivational arousal called psychological re- 
actance. Needless to say, considerable re- 
search must still be done to obtain a clearer 
picture about the restricting conditions. The 
first ideas about a possible connection of 
psychological reactance with nonconformity 
came from Kiesler and Kiesler (1969), who 
mentioned the reactance phenomenon in 
their recently published book, Conformity. 
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In their highly 
in the Classroon 
(1968) reported t! 
experimentally cre: 


publicized book, Pygmalion 
Rosenthal and Jacobson 
e astounding finding that 
ed teacher expectations 
resulted in changed performance on the part 
of students. At the beginning of the school 
year, teachers were told to expect intellectual 
growth from certuin students, Even though 
these potential “bloomers” had been randomly 
selected from the class, they nevertheless ful- 
filled the prophecy, showing greater IQ gains 
over the course of a year than did a group of 
control students 

Such intriguing results do not easily go un- 
noticed. Predictably enough, they also do not 
80 unchallenged. Thus, numerous studies and 
reviews have reconsidered the “Pygmalion 
phenomenon” with mixed results and varied 
conclusions (Claiborne, 1969; Meichenbaum, 
Bowers, & Ross, 1969; Minor, 1970; Snow, 
1969; Thorndike, 1968, 1969). But whether 
9r not the Pygmalion phenomenon is real or 
illusory, it is of importance to determine 
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TOWAR AN EXPLANATION OF THE ROSENTHAL-JACOBSON 
FINDINGS * 
PAMELA C. RUBOVITS anp MARTIN L. MAEHR ? 
University of Illinois, Urbana-Champaign 
Thi ly is a follow-up to work by Rosenthal and others which has indicated 
that cacher (and experimenter) expectancies are fulfilled in student (and sub- 
ject) performance. The study was specifically concerned with the kind of 
teacl ehavior which might eventuate following the experimental manipula- 
tion of an expectancy and which could affect the performance of students. 


is found for the so-called interaction quality hypothesis. Teachers 
er in the amount of attention given to allegedly "gifted" and *non- 
lents in a microteaching situation. However, the pattern of atten- 
iise administered to the two groups did differ, An attempt to relate 
| teacher behavior to the personality trait of dogmatism was un- 


whether and/or how expectancies affect 
teacher behavior. Most certainly, if it is fi- 
nally determined that teachers’ expectations 
do indeed eventuate in reliable changes in stu- 
dent performance, one would have to inquire 
more specifically into the intervening teacher— 
student interactions which directly affect stu- 
dent behavior. Such was the task of the 
present study. More specifically, the goal was 
to test one hypothesis regarding the effects of 
teacher expectations on teacher-student inter- 
action. This hypothesis is drawn from Rosen- 
thal's work and can be referred to as the 
“interaction quality” hypothesis. 

Rosenthal and Jacobson only speculated as 
to “how teachers brought about intellectual 
competence simply by expecting it [Rosenthal 
& Jacobson, 1968, p. 180].” Their data indi- 
cated that there was no difference in the 
amount of time teachers spent with the stu- 
dents in the two groups, thus suggesting that 
“it may have been more a matter of the type 
of interaction which took place between the 
teachers and their pupils which served as the 
determinant of the expected intellectual gains 
[p. 159].” 

This interaction quality hypothesis has ap- 
peared before as the preferred explanation in 
the research on the effect of experimenter 
bias (Rosenthal, 1966). In some of these ex- 
perimenter bias studies, experimenters ac- 
tually admitted that they behaved or felt 
differently toward their “bright” and “less 
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than bright” subjects (Rosenthal, 1966; Ros- 
enthal & Fode, 1963; Rosenthal & Lawson, 
1964). 

The interaction quality hypothesis, how- 
ever, is still only speculative. The studies on 
experimenter bias did not identify specific 
experimenter-subject interaction. The more 
recent classroom study (Rosenthal & Jacob- 
son, 1968) also did not investigate exactly 
what teacher behaviors could be influencing 
student performance. In an unsuccessful at- 
tempt at replication of this study, Claiborne 
(1969) apparently made an attempt to con- 
sider teacher-student interactions, but he 
provided no data on this variable. Meichen- 
baum et al. (1969), in a study which con- 
firmed the Rosenthal-Jacobson findings, spe- 
cifically and systematically considered 
teacher-student interactions. Employing the 
very global categories of “positive,” “nega- 
tive,” and “neutral,” they found some basis 
for concluding that expectations influenced 
the quality of teacher-student interaction, but 
in a complex and not easily interpretable man- 
ner. Two teachers increased their positive in- 
teraction with "special" students; one teacher 
decreased her negative interactions with spe- 
cial students. Thus, expectations differentially 
affected the quality of the teachers’ interac- 
tion with the expectancy students. Since the 
Meichenbaum et al. study involved only four 
teachers, their mixed results from very gen- 
eral categories cannot be taken as definitive 
support for the interaction quality hypothe- 
sis. What is needed is more detailed and ex- 
tended consideration of teacher-student inter- 
action following the manipulation of teacher 
expectations, 

One goal of the present research, then, was 
to find out whether or not varying teacher ex- 
pectations eventuated in differential teacher— 
student interactions. Another question was 
also considered. Assuming that giving teach- 
ers an expectancy affects their behavior, it is 
doubtful that all teachers are similarly af- 
fected. Two decades of research on authori- 
tarianism (Kirscht & Dillehay, 1967) would 
suggest that the degree of authoritarianism 
may well mediate an individual's conformity 
to an imposed set or expectancy. Accordingly, 
it was hypothesized that individuals high in 
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this trait would be more susceptible to an 
expectancy influence than those au- 
thoritarianism. 
METHOD 
Subjects 
Two different groups participate tudy. 
One group, referred to as teachers, d of 
26 female undergraduates enrolled ir lop- 
ment course. All teachers expressed ir each- 
ing as a career, but none had yet d dent 
teaching. Although the teachers eers, 
their instructor did allow them to > in 
this study in lieu of a term project em- 
phasized that the teachers knew n the 
experimental manipulations; they thou; were 
participating in a pilot microteaching being 
xperi- 


conducted to give them additional 
ence. 

The other group participating in the study was 
referred to as students. It included 104 sixth and 
seventh graders from a middle schoo! in a small 
midwestern city. These students were : 
lected and were given no instructions 8 
they were to behave. 


Measurement Procedures 


A major problem of the present stuc s to find 
a procedure for effectively analyzing the quality of 
teacher-student interaction relative to the purposes 
of the research. Most measures oi {co~ «student 
interaction (Flanders, 1960; Perkins, 1° or inter- 


) are not 
s, More- 


personal behavior in groups (Weick, 1% 
readily adaptable to microteaching situs 


over, the rather general analysis which these rating 
systems typically provide would not reveal the va- 
lidity of the interaction quality hypothesis. A special 
instrument was therefore developed. This instrument 


requires a trained observer to record the incidence of 
six different teacher behaviors. These six indexes are 
(a) teacher attention to students’ statements, sub- 
divided into attention to requested statements and 
attention to spontaneous student statements; (b) 
teacher encouragement of students’ statements; (c) 
teacher elaboration of students’ statements; (8) 
teacher ignoring of students’ statements; (e) teacher 
praise of students’ statements; and (f) teacher criti- 
cism of students’ statements. A detailed description 
of this observational procedure may be found im 
Rubovits (1970). It should be added that this im- 
strument appears to provide sufficiently reliable as- 
sessment of teacher behavior. In a preliminary che 
before proceeding with the study, dual raters wel 
employed, and interrater agreement was found to be 
90% or above in all categories. 1 

In addition to assessing transactions in the micro- 
teaching situation, personality and questionnaire 
data were obtained. An attempt was made to assess 
the teachers’ authoritarianism with the Rokeach 
(1960) Dogmatism Scale, Form E. A questionnaire 
was used to check on the credibility of the exper- 


mental manipulations and to obtain information 
about the teachers’ perci ptions of the students’ be- 


bavior. 


Experimental Procedure 


k prior to teaching, each 
sson plan which outlined the 
which suggested points that 
topic of television was se- 
;uld readily elicit the partici- 
nth graders and would allow 
unt of teacher-student inter- 
: was designed to maximize 
; allow the teachers considerable 
nd omitting points as well as 
vples. The teachers were re- 
; in mind the two objectives 
gather general information 
` viewing habits and (b) to get 
suggestions about possible class projects for a unit on 
television. Along v the lesson plan, the teachers 
were also given a brief description of the students 
that they would be meeting. Principally, they were 
told that they would be teaching a group hetero- 
geneous in terms of ability, since this “most closely 
approximates the typical classroom situation.” 
For each teact session there was a different 
group of four stu The students were brought to 
an unused class ‘here the study was conducted 
er was waiting. Before the stu- 
dents arrived the teacher was given a seating chart 
which not only contained each student’s first name 
but also an IQ score and a label indicating whether 
or not he had becn selected from the school's gifted 
Program or from a “regular” track. The IQ scores 
and labels had been randomly assigned to the stu- 
dents and bore no relation to the students’ actual 
abilities, It was hypothesized that the teachers formed 
different sets or expectations for those students with 
high IQs and the label “gifted” than they did for 
those students with lower IQs and the label “non- 
gifted,” 
E given the seating chart, the teacher was 
to familiarize herself with the names and to 
examine closely the scores and labels above each 
Dime. When the students arrived, the teacher asked 
ced student to sit in the seat designated on the 
ES This procedure allowed the teacher to identify 
the eh by name. As she seated each student, 
TA eacher was told to look directly at each student 
nS to read again to herself the IQ score and the 
el for that particular student, in order to become 
3 thoroughly acquainted with the child's competence 
pl possible, The teacher then introduced herself, ex- 
aining that she had come from the University of 
E to try out some new teaching materials with 
Bion an observer seated herself two rows 
ftach X e students and began categorizing the 
he sie behavior as soon as the teacher had seated 
m students and introduced herself. The observer 
egorized behavior for 40 minutes, although the 


Approximately 1 
teacher was gi 
topic to be tau 
could be covere 
lected as one whi 
pation of sixth ar 
for a considerab 
action, The less 
discussion and al 
freedom in addir 
in creating nev 
minded, however, to ke 
of the lesson: 
about these student 
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teaching session itself lasted for 43-60 minutes. It 
should be stressed that the observer was not aware 
of which two subjects had been labeled gifted and 
which two, nongifted. 

After the teaching session, the observer and the 
teacher discussed what had taken place, with the 
observer being careful not to reveal the random 
assignment of IQ scores and labels. Each teacher 
was then administered the postexperimental question- 
naire as well as the Dogmatism Scale. A month later, 
the teachers were given a thorough explanation of 
the study; the nature of the experimental manipula- 
tions was described, and the possible implications of 
the results were discussed. 


RESULTS 


Interaction Analysis 


The observation of the teachers was con- 
ceptualized as a measurement operation. The 
frequency counts collected on each teacher 
were considered to be the same as test scores, 
thus allowing for the combining of scores. 
Each teacher met with two different kinds of 
students, gifted and nongifted; each kind of 
student was viewed as a test. Every teacher 
made a score on each test; that is, for every 
teacher a certain number of observations were 
recorded for her interaction with gifted stu- 
dents and a certain number for her interac- 
tion with nongifted students. These scores 
were treated as repeated measures of the same 
jndividual. The individual observations can 
properly be considered as ordinal data, just 
as responses to individual (dichotomously 
scored) test items are ordinal measures. In 
the same way that we treat the sums of indi- 
vidual test item responses (total scores) as if 
they were interval data, we can proceed with 
the frequency observations of the interactions 
as if they were interval data. 

In the present case, two-way analyses of 
variance with repeated measures (Winer, 
1962) were employed.* Dogmatism, high and 


3A seemingly more straightforward approach 
might involve the use of difference scores as the 
dependent measure. However, as Manning and Du- 
Bois (1962) pointed out, a major problem is in- 
herent in the use of such scores, namely, that the 
reliability of difference scores tends to decrease as 
the correlation between the scores increases. There- 
fore, it sems preferable where possible to avoid the 
use of such composite measures, and thus, in the 
present case, a repeated-measures design employing 
a gifted and nongifted score separately seems pref- 
erable. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS Or INTERACTIONS WITH GIFTED AND Noncirr: 
Srvupents ny TEACHERS Hich AND Low IN DOGMATISM 


Pameta C. Rusovits AND Martin L. MaEHR 


Gifted Nongifted Gifted Nongifted Gifted Nongifted 
ipia Total atiention Aptenon urguaoiictted Attention t: requested 
High d ti | 4 
NDA 5124 B41 38.77 38.31 12.46 5.00 
SD 17.20 20.17 13.73 17.16 7.14 3.28 
Low d ti 
u pipe 50.00 47,92 37.69 40.92 12.31 00 
SD 17.02 16.44 18.37 12.04 9.57 95 
Combined x 
M 51.00 45.00 38.23 39.62 12.38 00 
SD 17.20 18.24 16.76 14.90 8.43 ».66 
Encouragement Elaboration Ignoring 
High dogmatism À 
M 11.77 13.38 6.38 5.38 11.54 | 12,92. 
SD 7.54 8.04 8.66 3.38 7.94 | 111 
Low dogmatism : 
M 12.46 11.85 7.54 8.08 11.77 11.38 
SD 7.10 6.20 4.14 3.67 8.64 8.17 
Combined 
M 12.12 13.62 6.60 6.73 11.27 1245 
SD 1.20 748 4.99 3.85 8.30 9.78 
Praise Criticism 
High dogmatism | 
M 9.85 5.54 1.46 2.08 
SD 9.91 5.81 1.55 3.63 
Low dogmatism 
10.00 7.08 1.69 147 
SD 8.86 5.25 1.27 1.72 
Combined 
M 9.89 6.31 1.54 1.92 
SD 10.17 5.44 1.44 2.85 | 


low on the basis of a median split, was the 
nonreplicated main effect; gifted-nongifted 
was the replicated main effect. A separate 
analysis was conducted for each dependent 
measure of teacher interaction with gifted and 
nongifted students. 

Table 1 presents the mean number of 
teacher responses recorded in each category 
with gifted and nongifted students. Table 2 
presents the results of the separate analyses of 
variance. 

First let us consider the differences in 
teacher interaction with gifted and nongifted 
students, or the differences due to label. Only 
for two of the categories are there statistically 
significant differences in teacher interaction 
with gifted and nongifted students. 


Across all teachers, significantly more state- 
ments were requested of gifted students than 
of nongifted students (Category 1B). Also, 
across all teachers, the statements of gifted 
students were praised significantly more times 
than were the statements of nongifted stu- 
dents (Category 5). Significant differences for 
these two categories had been predicted. It 
also had been expected that there would be "2 
significant difference in the total amount o 
attention paid to gifted and nongifted stu 
dents. Implicit in the interaction quality hy- 
pothesis is the assumption that the amour 
of interaction remains constant across SU" 
dents. It is, presumably, the variation in the 
quality of that interaction that is critica 
Since the difference due to labeling for Cate- 
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TABLE 2 
ANALYSIS OF VARIANCE OF LABELS AND DOGMATISM 


df | MS F 


| 


Source df 


1 I ANOVA of labels and dogmatism in 


(E) ANOVA of labels and dogmatism 
in Category 3 (elaboration) 


Category 1 (total attention) 


1 78.77 -16 
24 476.81 

1 232.69 1.13 

1 60.31 30 
24 205.29 


Between 
Dogmatism (D) 1 5.56 
Ss within groups 24 121.31 

Within 
Label (L) 1 3.25 06 
DXL 1 10.17 48 
L X Sswithin groups | 24 66.34 


OVA of labels and dogmatism in Category 1A 
{attention to unsolicited statements) 


1 7.69 .02 
24 318.04 

1 24.92 12 

1 44.31 21 
24 204.99 


(F) ANOVA of labels and dogmatism 
in Category 4 (ignoring) 


Between 
Dogmatism (D) 1 15.85 ESI 
Ss within groups 24 33.16 

Within 
Label (L) 1 .69 07 
DXL 1 14.16 144 
LXSswithin groups | 24 9,84 


‘A of labels and dogmatism in Category 1B 
- (attention to requested statements) 


1 11.08 A2 
24 93.67 

1 529.93 17.33* 

1 15.17 49 
24 30.58 


(G) ANOVA of labels and dogmatism 
in Category 5 (praise) 


Between 


Dogmatism (D) 1 11.17 74 
Ss within groups 24 15.20 
Within 
Label (L) 1 167.33 5.40* 
XL 1 4.61 15 
L X Ss within groups | 24 30.94 


— (D) ANOVA of labels and dogmatism in 
Category 2 (encouragement) 


(H) ANOVA of labels and dogmatism 
in Category 6 (Criticism) 


1 16.17 .07 
24 98.12 

1 3.25 A3 

ho. 1 12.33 49 
Ss within groups | 24 24.99 


Between 


Dogmatism (D) 1 .08 01 
Ss itin ped 24 6.67 

Within 
Label (L) 1 1.92 03 
DXL 1 69 10 
L X Ss within groups | 24 6.97 


(total attention) was not significant, it 
that in accord with the interaction 
y hypothesis, the amount of teacher- 
t interaction was not markedly affected 
€xpectancy conditions. What was pre- 
Xl, and substantiated, was the hypothesis 
‘there would be significant differences in 
‘mount of teacher-initiated interaction 
Sifted and nongifted students. Con- 
» it was expected that there would be 
: nificant differences in Category 1A 


(attention to unsolicited statements). In effect 
this category provided a measure of the 
amount of student-initiated interaction. No 
significant differences in this category allow 
for the inference that there was little differ- 
ence in the spontaneity or verbosity of stu- 
dents. Gifted students were not, therefore, 
called upon more because they volunteered 
less than nongifted students. The differential 
recognition noted in Category 1B apparently 
is attributable to the expectancy effect. 


a 
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For the other four categories, significant 
differences due to labeling had been predicted, 
but none of these hypotheses was supported. 
The teachers showed no difference in the 
amount of criticism of gifted and nongifted 
students. They did not ignore the statements 
of one type of student more than the state- 
ments of the other type. Finally, teachers did 
not differentially encourage or elaborate upon 
the statements of one type of student. 

Consider now the effect of level of dogma- 
tism on teacher interaction. There were no 
significant differences in the behavior of 
teachers high in dogmatism and teachers low 
in dogmatism, as measured by these cate- 
gories. It had been predicted that there would 
be a significant interaction between the effect 
of labeling and the level of dogmatism. This 
prediction was not supported for any of the 
categories, It therefore must be concluded that 
the behavior of these 26 teachers did not indi- 
cate in any way that highly dogmatic teachers 
are more affected by the labeling than are 
teachers lower in dogmatism. 


Credibility of the Experimental Situation 


Through a postexperimental interview and 
questionnaire, an extensive check was made 
on whether or not the teachers accepted the 
experimental situation as it was presented to 
them. Although they were given ample op- 
portunity to do so, no teacher expressed any 
suspicion of the hypothesis being tested. 
Furthermore, when the teachers were asked to 
give their own personal evaluations of the 
students, they expressed marked agreement 
with the randomly assigned labels. Fifty-two 
students had been labeled gifted and 52 non- 
gifted, but only in the case of 8 of the gifted 
students and 3 of the nongifted students did 
the teachers express any reservations or dis- 
agreement with the labels. These data, as well 
as Clinical observations obtained in the post- 
experimental interview, suggest overwhelm- 
ingly that the teachers not only accepted the 
situation as planned but also interpreted the 
behavior of the students through the lens of a 
label. 


Discussion 


This study was conducted not to replicate 
the expectancy effect, but to make a start at 
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explaining it. In this regard the results 
support for the interaction quality hypothesis 
which was suggested by Rosenthal and Jacob- 
son (1968) and supported to some extent by 
Meichenbaum et al. (1969). 

In the present study, gifted students were 
called upon more and were praised more than 


nongifted students. It is, of co: possible 
to speculate that teachers in this study, being 
inexperienced, may have been s of bright 
students and so were oversolicitous of them, 
seeking out their opinions and being sure to 
praise them often. Whether or not the same 
results would be obtained with experienced 


teachers can only be answered by attempting 
to replicate these results with experienced 
teachers. However, Meichenbaum et al. 
(1969) included one experienced teacher who 
responded to expectancy students by increas- 
ing her positive interaction with them. 

It is, of course, interesting to speculate 
how calling upon and praising expectancy 
students more could lead to improved aca- 
demic performance by these students, if in 
fact teachers in other studies that found an 
expectancy effect (Meichenbaum et al., 1969; 
Rosenthal & Jacobson, 1968) behaved as 
teachers in this study did. Being given more 
of an opportunity to participate in the class 
could cause expectancy students to clarify 
their thoughts more through dialogue with the 
teacher and to demonstrate their proficiency 
more frequently. Receiving more praise has 
far-reaching implications for improving the 
students’ motivation and learning. From the 
present study, however, nothing can be con- 
cluded about student performance, given these 
differences in teacher performance. More re- 
search is needed to replicate this study’s find- 
ings and to investigate the effects of differ- 
ences in teacher behavior on student perform- 
ance. 

It had been expected that differences be- 
tween interaction with gifted and nongifted 
students would also appear in the categories 
of ignoring, elaboration, encouragement, an 
criticism. That these differences did not occu! 
may be an experimental artifact. If the lesson 
had allowed for less student discussion 3? 
demanded more clear-cut exhibitions of co™ 
petence, such differences might have bet" 


ibly, the teacher behaviors 
ating, and encouraging de- 
ation in different teach- 
the category of teacher 
well be that with the exis- 

tural proscriptions regard- 
tment of students, varia- 
vanifest themselves in the 
of gifted and nongifted 


found. Quite pos 
of ignoring, elabo: 
serve further cons 
ing situations. As 
criticism, it ma; 
tence of strong 
ing negative tr 
tions will not 
‘amount of criticism 
students. 

That dogmatizn 
‘effect on teachers’ | 
but not altoge:! 
all, the N was 
relatively homo: 
did not yield e: 
The matter, however, 
further. 

All in all, th 
tant evidence t 
tations do affec: 
it appears tha. 
such a way that 
performance wouid 
mer reported |) 
(1968). Howey 
order, inasmuc 
differences did 


lid not have the predicted 
chavior is a disappointing, 
surprising, finding. After 
vall (26), the sample was 
eous, and the median split 
remes of any real magnitude. 
ought to be pursued 


study has provided impor- 
effect that teacher expec- 
teacher behavior. Moreover, 
acher behavior is affected in 
: not unlikely that student 
be influenced in the man- 
Rosenthal and Jacobson 
a word of caution is in 
the originally predicted 
not occur in four out of six 
Cases. As indicated previously, there seem to 
be valid reasons in each case why these ex- 
pectations were not confirmed. Nevertheless, 
one might wonder whether the two confirmed 
predictions may in the overall scheme of 
things be best attributed to chance, in spite 
of the high significance levels found in each 
instance. Certainly statistical theory does not 
lead to that conclusion. The probability of 
obtaining two significant differences out of a 
total of six predictions is less than .05, assum- 
Ing the truth of a null hypothesis, Be that as 
ìt may, further work on the problem is indi- 
fated and, indeed, is already in progress 
(Rubovits & Maehr, 1970) to lend further 
Support to the hypothesis that expectations 


Influence the quality of teacher-student inter- 
action, 
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EFFECTS OF AGREEMENT AND CORRECTNESS O^ 
RELATIVE COMPETENCE AND CONFORMITY 


RONALD F. ETTINGER,? C. J. MARINO, NORMAN S. ENDLER, 
SHELDON H. GELLER, anp TARAS NATZIUK 


York University 


An individual’s estimate of his own competence relative to that of a grou 
was hypothesized to be (a) an effect of his prior experiences of correctness 
the group and of the group's agreement with him and (b) closely related : 
subsequent conformity to the judgments of that group on an unambiguo: 
task, A Crutchfield-type apparatus was used to manipulate prior experience 
of the subject’s correctness and group agreement as well as to measure subse 
quent conformity. The results indicate that the subjects’ perceived competen: 
relative to the group was closely related to later conformity. Subjects who 
perceived themselves as more competent than the group conformed significantly 
less than subjects who perceived themselves as less competent ($ < .05). It 
was concluded that relative competence is an important factor mediating the 


effect of prior experiences on conformity. 


An individual’s experiences in a group prior 
to social pressure situations are highly rele- 
vant to his susceptibility to those pressures. 
In particular, an individual’s tendency to con- 
form to the erroneous judgments of a group 
has been found to be related to the extent of 
the group’s prior agreement with him (Hol- 
lander, Julian, & Haaland, 1965; Julian, 
Regula, & Hollander, 1968; Julian, Ryckman, 
& Hollander, 1969). Similarly, if an indi- 
vidual receives information regarding his own 
accuracy or correctness on a task, his subse- 
quent tendency to conform will be influenced 
(Crutchfield, 1955; Endler, 1965, 1966; 
Jones, Wells, & Torrey, 1958). Experiences of 
correctness and group agreement may affect 
the confidence an individual has in his own 
ability. Self-confidence may, in turn, affect 
susceptibility to social influence; however, evi- 
dence concerning the relationship between 
confidence and susceptibility is somewhat 
inconsistent. 

A number of researchers have provided 
evidence that a negative relationship exists 
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of Canada. The authors wish to thank Corinne Wilks, 
Marilyn Trotter, and Ruthie Rosset for their assist- 
ance in the collection and analysis of the data, 

2 Requests for reprints should be sent to Ronald 
F. Ettinger, who is now at the Department of 
Psychology, Albion College, Albion, Michigan 49224. 


between confidence and susceptibility to dif- 
ferent types of social pressure (DiVesta, 
1959; Endler, 1965, 1966; Hochbaum, 1954; 
Kanareff & Lanzetta, 1960; Mausner, 1954). 
In contrast, Savell and Healey (1969) have 


reported a positive relationship between the 
two variables. Julian et al. (1968) found a 
positive relationship between the extent of 


prior agreement and task confidence; how- 
ever, prior agreement and subsequent sus- 
ceptibility to group pressure did not appear 


to be systematically related. This conflict may 
be partially resolved by considering another 
factor that may be relevant. In addition to 
making inferences about his own competence 
based on a knowledge of group agreement 
and/or correctness, an individua! may make 
inferences about the competence of the group. 
It is logical to assume that an individual 
in a group will consider both his own compe 
tence and that of the group prior to making 
a judgment. This is especially true when there 


is a unanimous group judgment that is M | 


conflict with his own perception. 

The present study was designed to inte 
grate previous research on the effects of prio" 
levels of agreement with that on the effects 
of feedback concerning correctness on subse- 
quent conformity. Specifically, relative com- 
petence was postulated as an important € 
planatory concept underlying the relationship 
between prior experience in a group 2" 
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mity. An individual’s infer- 
ve competence are based on 

both agreement and cor- 
competence may be de- 
repancy between an indi- 
e (self-confidence) and his 
mpetence of the group. In 
discrepancy favors the 
ere he perceives himself as 
an the group), his subse- 
vill be less than in situa- 
liscrepancy favors the compe- 
yup. The concept of relative 
des am explanatory basis for 
jus research on the effects of 
agreement and correctness on subse- 
It conforn In a study of the effects 
:ement on conformity, Julian et al. 

take into account the pos- 
bjects made inferences about 

of the group. Although “indi- 
idence” varied directly with 
'sreement, it was not possible 
he relationship between the 
ats of the competence of the 
{į prior agreement. Conform- 
ied directly with the dis- 
a a subject’s estimate of his 
and inferences he made about 
Competence of the group. Similarly, the 
ncept of relative competence may provide 
à tentative explanation of the results con- 
ming the effects of feedback concerning 
trectness on conformity. For example, 
hdler (1965, 1966) found that subjects who 
told that their responses were correct 
never they agreed with the group and 
correct whenever they disagreed conformed 
Ore than subjects who were told the op- 
osite. In other words, when the discrepancy 
tween an individual's competence and that 
the group was structured to favor the 
oup, individuals conformed to a greater ex- 
than when they perceived themselves as 
nore competent than the group. Assuming 
mat the subjects were motivated to be correct 
| both studies, they may have simply chosen 
E alternative that they perceived to have 
E e greatest probability of being correct. This 
ubjective probability of correctness may have 


sequent conf 
es about rel: 
mation about 
s. Relative 
as the 

"s comp: 

e of the 


ting pres 
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been based on the frequency of correctness 


(subject and group) on previous trials. 
These general formulations can be cast in 
the broader theoretical framework of Rotter’s 
(1954) social learning theory. The most 
simplistic and general formulation of this 
theory states that the probability of a given 
behavior is a joint function of the expected 
consequences (expectancy, which is expressed 
as a subjective probability) of that behavior 
and the reinforcement value associated with 
those consequences. Assuming that being cor- 
rect has a greater reinforcement value than 
being incorrect for most individuals, it can 
be stated that conforming or independent 
behavior is a function of the probability that 
such behavior will result in being correct. 
The present experiment employed a task 
similar to that used by Julian et al. (1968). 
However, in addition to manipulating levels 
of prior agreement, correctness was also ma- 
nipulated. It was hypothesized that prior 
experiences of agreement (group support) 
and correctness interact to affect an indi- 
vidual’s perception of his competence relative 
to that of the group. Information about 
group agreement alone may lead a subject to 
make inferences regarding his own compe- 
tence and that of the other group members. 
Inferences about competence are not neces- 
sary if the subjects are directly informed 
about the correctness of their responses. The 
subjects may then use group agreement in 
conjunction with information about correct- 
ness to directly determine their competence 
relative to that of the group. Furthermore, 
it was hypothesized that an individual’s per- 
ception of his relative competence is directly 
related to his tendency to yield to group 
pressure. Individuals who perceive themselves 
as less competent than the group will con- 
form more than individuals who perceive their 
competence as greater than that of the group. 
Finally, in the absence of a discrepancy be- 
tween self- and group-competence, correct- 
ness per se operates as the effective variable. 
Tt was hypothesized that individuals who per- 
ceive themselves and the group as equally 
competent, on the basis of purported cor- 
rectness, will conform less than individuals 
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who perceive themselves and the group as 
equally incompetent (incorrect). This study 
also assessed the effectiveness of the proce- 
dures used for manipulating agreement and 
correctness and evaluated the subjects’ sus- 
picion of the deception involved. Since it was 
hypothesized that the effects of correctness 
and agreement on conforming behavior are 
mediated by relative competence, it was neces- 
sary to explicitly determine whether or not 
relative competence had been affected by the 
experimental manipulations of prior group ex- 
periences. It was also necessary to assess the 
subjects’ suspicion of the deceptions em- 
ployed (Allen, 1966; Stricker, Messick, & 
Jackson, 1967, 1969). 


METHOD 
Apparatus 


A Crutchfield-type social communication apparatus 
(Endler, 1966) was used to record each subject’s 
response and to communicate both contrived re- 
sponses of the other group members and the “correct” 
response to each subject. Subjects were led to believe 
that they were communicating directly with each 
other, but, in fact, all communications were con- 
trolled by the experimenter. 

A light-discrimination apparatus, similar to that 
employed by Julian et al. (1968), consisted of three 
blue stimulus lights arranged in a horizontal plane 
and placed 10 centimeters apart. For each trial, one 
stimulus light was extinguished first, and then the 
remaining two were extinguished simultaneously. 
The sequence of extinguishing was randomly deter- 
mined. On manipulation and critical trials, one light 
was extinguished .045 second before the other two 
lights. Previous pilot work indicated that this par- 
ticular time interval enabled the subjects to correctly 
identify the order of extinguishing 95% of the time. 
On buffer trials, the time interval ranged from 
.010 second to 1 second. The time between the 
extinguishing of the first stimulus light and the 
remaining two lights was controlled to within .001 
second by means of a variable solid-state timer. 


Questionnaire 


A competence questionnaire required the subjects 
to make three independent evaluations of their own 
competence and that of the group by (a) evaluating 
ability on a 6-point Likert-type scale, (b) indicating 
the expected number of correct judgments in the 
next set of 10 trials, and (c) estimating the per- 
centage of prior judgments that were correct. An- 
other questionnaire designed to measure suspicion of 
deception consisted of two open-ended questions 
concerning knowledge of the purpose and method of 


"axe" T7 AM 


ErrixGER, Marino, ENDLER, GELLER, AND NATZIUK 


the experiment and was administered at the end 


of the experiment. 


Subjects and Procedure 


Forty male undergraduate subjects we: 


to participate in a study that d to test 
visual discrimination, Only subje 10 had not 
taken any psychology courses were ruited. The 
task required the subjects to judge on each trial 
which of three blue stimulus lights wa tinguished 


first. Subjects were tested in groups e, and 
their judgments were communicated io the experi- 
menter who transmitted contrived res; s to the 
subjects. 

Five subjects were randomly assigned io each cell 
of a2 X 2 X2 factorial design, with correct-incorrect, 
agreement-disagreement, and the tim: question- 
naire administration (intervening versus postexperi- 
mental) defining each of the factors, pectively. 

All subjects in the eight experimenta! conditions 
were subjected to social pressure and were required 
to make 36 judgments. The 36 trials were conducted 
as follows: 

1. Trials 1-6 consisted of buffer trials »dard for 
all groups. Three very easy trials (1-:econd time 
lapse) and three very difficult trials (00 second) 
were presented, Subjects responded second. On the 
easy trials they were informed that the ip agreed 
find was correct. On the difficult trial members 
of the group disagreed, and neither the subject nor 
the group was correct. 

2. Trials 7-12 comprised the manipulation trials, 
during which two of the three independent va riables 
were manipulated (agreement and correctness). All 
subjects responded first. During each o! these trials 
in the correct condition, the subjects were informed 
that they were correct; in the incorrect condition 
subjects were informed that they were incorrect (and 
that the group consensus was correct). In the agree- 
ment condition, the contrived responses were mM 
agreement with the subjects response; in the dis- 
agreement condition, the contrived responses were In 
uniform disagreement with the subject response. 
The competence questionnaire was administered either 

(ie, after 


at the end of the manipulation trial 
Trial 12) or after all 36 trials. 
3. Trials 13-32 contained 4 buffer trials and 16 
critical trials. On the critical trials, subjects respon 
in Positions 4 or 3 after secing unanimous incorrect 
judgments of the four other group members. It was 
during these 16 critical trials that social pressure was 
induced and conformity was measured Fa 
4. Trials 33-36 were included to preserve credibil- 
ity. The group unanimously made correct judgments: 
Subjects responded third. 


Statistical Analyses 


; . 5X 
A three-way analysis of variance (Correctness 


Agreement X Time of Questionnaire Administration 


was computed for each of the three estimates of 
the subject’s own competence and the three estimates 
of the group’s competence. Another three-way 
analysis was computed using conformity as the 
dependent variable. 

A post hoc anal 


is of conformity, with subjects 
stratified into three levels of relative competence, 
was also computed. The stratifications were based 
on differences between the subject’s estimate of his 
own competence and the subject’s estimate of the 
groups competence. They were divided into three 
different levels of relative competence: (a) subjects 
who perceived themselves as being more competent 
than the group average (self > group); (b) subjects 
who perceived their competence as being equivalent 
to the group av e (self = group); and (c) sub- 
jects who perceived themselves as being less com- 
petent than the up average (self < group). Clas- 
sifications were 1 ndependently for each measure 
of competence: y evaluation, expectancy, and 


estimate of percentage correct. 
RESULTS 
Competence Manipulation 


The eífectiveness of the experimental ma- 
nipulations was assessed by means of three- 
way analyses of variance (Correctness X 
Agreement x Time of Questionnaire Admin- 
istration) of three self-report items measuring 


TABLE 1 

MEANS AND STANDARD Deviations OF THREE MEA- 

SURES OF SUBJECTS’ PERCEIVED COMPETENCE OF 

SELF AND OF THE GROUP 
Measures of subjects’ perceived competence 

Experimental À eid iat we 
conditions Ability Expectancy | % correct 
EX. Self | Group | Self | Group Self | Group 
Correct- ie p 

agree 

d 49 | 43 |78 | 55. |70.0 | 71-1 
SD ‘57 | 125 | 1.10] 1.35 | 14.91| 9.61 
Correct- 

disagree 

a 49 |31 [7.5 | 5.7 [76.5 |518 
nod 57 | 148 | 126 | 1.49 |10.81| 14.91 
Neorrect- 

agree 

M 31 |28 |51 |48 [43.5 | 44.0 
SD | 1.50 | 1.03 |262| 1.14 | 25.28) 14.10 
Theorrect- | 

disagree 

x [25 | 46 |54 | 68. |415 |65.5 
SD | 1.27 | 1.26 | 2.42) 2.57 23.24) 23.03 
Note,—y Jaa r% n 


gros ectnem refers to the subject, and agreement refers to the 
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TABLE 2 


THREE-WAY CLASSIFICATION ANALYSIS OF VARIANCE 
or TOTAL CONFORMITY SCORES 


Source df MS F 
Correctness (A) 1 193.59 | 7.70* 
Agreement (B) 1 16.89 67 
Time of questionnaire (C)| 1 14.39 OE 
AXB 1 4.89 19 
AXC 1 6.40 25 
BXC 1 62.49 2.48 
AXBXC 1 | 20249 | 8.05* 

Within cells (error) 32 25.13 
*p «.01. 


the subjects’ estimated self-competence (abil- 
ity evaluation, expectancy, percentage cor- 
rect). Similar analyses were performed on the 
subject’s estimate of the group’s average 
competence (see Table 1 for the means and 
standard deviations). The main effect of cor- - 
rectness on the subjects estimated self- 
competence was significant in all three cases 
(p < .001). Neither agreement nor the Cor- 
rectness X Agreement interaction had a 
significant effect on the subject's estimated 
seli-competence. The interaction between 
agreement and correctness for the subject’s 
estimate of the group’s competence was sig- 
nificant for all three items (p < .001). In the 
subject-correct condition, group agreement 
produced a high estimate of group compe- 
tence, and group disagreement produced a low 
estimate of group competence. The reverse 
was true in the subject-incorrect condition. 
The correlations among the three different 
measures of competence (ability evaluation, 
expectancy, and percentage correct) ranged 
from .77 to .85. 

Since the above results provide evidence 
that the experimental manipulation was ef- 
fective, an analysis of the conformity results 
was computed. 


Conformity 


The three-way analysis of variance (Cor- 
rectness X Agreement X Time of Question- 
naire Administration) of total conformity 
scores is summarized in Table 2. Table 3 
presents the relevant means and standard de- 
viations. The main effect of correctness was 
significant. Subjects in the subject-correct 
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TABLE 3 


MEANS AND STANDARD Deviations OF Tora CON- 
FORMITY SCORES AS A FUNCTION OF SUBJECT COR- 
RECTNESS, GROUP AGREEMENT, AND TIME OF AD- 
MINISTRATION OF COMPETENCE QUESTIONNAIRE 


Experimental condition 


Time of adminis- 
tration of compe- 
tence question- 
naire 


Subject correct Subject incorrect 


Group Group Group | Group 
agrees | disagrees} agrees | disagrees 
Intervening 
M 3.20 4.60 12.00 | 3.00 
SD 4.38 5.26 4.74 | 453 
Postexperimental 
M 3.20 60 4.60 | 9.60 
SD 7.16 55 416 | 6.50 


Note.—n = 5 subjects per group. 


condition conformed significantly less than 
those in the subject-incorrect condition. The 
triple interaction was also significant. 

Since the Correctness X Agreement inter- 
action was not significant, these results only 
provide partial confirmation of the hypothe- 
sis that subjects who see themselves as more 
competent than the group (subject-correct, 
group-disagrees condition) conform more than 
subjects who perceive themselves as less com- 
petent than the group  (subject-incorrect, 
group-disagrees condition). The significant 
three-way interaction may be interpreted by 
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Fic. 1. The effects (interaction) of subject cor- 

rectness and group agreement on conformity for two 

levels (times) of administration of a competence 
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comparing the simple Correctness X Agree- 
ment interactions at the different levels 
(times) of the administration of the com- 
petence questionnaire (intervening versus 
postexperimental). When the competence 
questionnaire was administered at the end of 
the experiment (postexperimental), the hy- 
pothesis received support. When it was ad- 
ministered between the manipulation and the 
social pressure situation, the hypothesis was 
not supported. The simple two-way inter- 
actions (Correctness X Agreement) for inter- 
vening and postexperimental questionnaire 
administration, respectively, are presented in 
Figure 1. 


Post Hoc Analysis of Relative Competence 


A simple one-way analysis of vari: ance : 
performed on the conformity scores for the 
three levels of relative competence (self < 
group, self = group, self > group) based on 
each of the three measures of competence 
(ability, expectancy, and percentage correct). 
The effect of relative competence based on 
both ability evaluation (F = 3.47, df = 2/37, 
p<.05) and estimated percentage correct 
(F = 5.94, dj = 2/37, p < .01) was signifi- 
cant, thereby supporting the hypothesis re- 
lating conformity to relative competence. The 
effect of relative competence based on expect- 


TABLE 4 

MEANS AND STANDARD Deviations Or Tora CON- 
FORMITY SCORES FOR SUBJECTS STRATIFIED INTO THREE 

LEVELS or RELATIVE COMPETENCE AS ASSESSED 

BY THREE INDEXES OF COMPETENCE 
(Post Hoc Cr. ASSIFICATION) 
Conformity scores for levels of 
Index of relative competence — 
competence — — 
Self > group | Self = group | Self, < group 

Ability 

n 15 11 14 

M 2.40 5.45 7.71 

SD 3.93 7.86 9.37 
Expectancy 

n 15 11 14 

M 3.33 4.09 7.79 

SD 5.55 6.73 9.19 
% correct 

n 13 11 16 

M 1.00 7.55 6.63 

SD 88 9.56 8.38 


ncy, although in the predicted direction, was 
Mot significant. The relevant means and stan- 
d deviations appear in Table 4. Subjects 
ho perceived themselves as more competent 
the group consistently conformed less 
an those who perceived the group as more 
‘competent. 


Suspicion of Deception 


_ A one-way analysis of variance was also 
with subjects divided into two 


). Suspicious subjects also rated 
es higher than the unsuspicious 
bjects on the self-competence measures 
Meg, for ability evaluation, F = 4.33, df= 
1/38, p< .05) but not on the estimates of 
group competence (see Table 5). Table 6 il- 
trates a comparison of mean conformity 
Scores for suspicious and unsuspicious sub- 
Jects in the three levels of perceived relative 
‘Competence. Suspicious subjects were equally 
“distributed among the experimental conditions. 


DISCUSSION 


The experimental manipulation of group 
agreement and subject correctness effectively 
induced the expected levels of perceived indi- 
Vidual and group competence in most of the 
Subjects. The discrepancy between the sub- 
jects’ evaluations of their own competence 


TABLE 5 
Ratios FOR Suspicious AND UNSUS- 


CTS ON PERCEIVED SELF- AND GROUP- 
ENCE AND TOTAL CONFORMITY 


Dependent variable psy eid E 
Ability 4.46 348 | 434* 
PIpectancy 7.47 $83 | 6.50* 

0) 

Group rrect 65.00 53.60 2.09 
Ability 3.53 3.83 <1 
xpectancy 6.33 6.12 <1 

% correct 59.00 57.00 «1 
Totalconformity| — 2.67 6.56 4.64" 


*« os, 
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ious (F = 4,64,..df —1/38;— 


TABLE 6 


Means or TOTAL CONFORMITY Scores FOR SUSPICIOUS 
AND UxsusPiCIOUS SUBJECTS STRATIFIED INTO ' 
THREE LEVELS OF RELATIVE COMPETENCE 


Suspicious Unsuspicious 
Level of relative 
competence 
n M n M 
Self > group 8 1.12 7 4.14 
Self = group 4 4.75 7 5.86 
Self < group 3 4.67 11 8.54 


and their evaluations of the group’s com- 1 
petence was very small in both group-agree 
conditions. In the group-disagree conditions, 
the discrepancy was in the direction of the - 
correctness manipulation. E 

Given the success of the experimental ma- 
nipulations of competence, it was expecte 
that conditions producing low competence 
discrepancies (subject-correct, group-agree 
and subject-incorrect, group-agree) would not 
differ greatly in the amount of conformity — - 
elicited. The  high-discrepancy conditions 
(subject-correct, group-disagree and subject- 
incorrect, group-disagree), according to the 
hypotheses, should have elicited significantly 
different amounts of conformity. The results 
clearly illustrate that this only occurred when 
the competence questionnaire was adminis- 
tered postexperimentally (see Figure 1). In 
the postexperimental questionnaire condition, — 
there was greater conformity when the com- 
petence discrepancy favored the group than, . 
when the discrepancy was in the subject's 
favor. The low-competence-discrepancy con- | 
dition produced intermediate levels of con- 
formity as hypothesized. The intervening — | 
questionnaire may have produced a reaction — 
on the part of the subjects. In spite of these 
apparently inconsistent results in the two  - 
questionnaire-administration conditions, the | 
predicted main effect of correctness was sig- -— 
nificant, That is, subjects in the incorrect — 
conditions conformed more than subjects in 
the correct conditions. 

The post hoc analysis of relative compe- 
tence based on subjects’ responses to the 
questionnaire lends further support to the 
hypotheses. Subjects who perceived them- 
selves as less competent than the group con- 
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sistently conformed more than those who per- 
ceived themselves as more competent than the 
group. The apparent discrepancy between the 
results based on the experimental manipula- 
tions and those based on the subjects’ re- 
sponses to the competence questionnaire can 
be partially explained by the fact that the 
manipulation was not successful for all sub- 
jects. Nevertheless, the results based on the 
subjects’ perception lend considerable support 
to the relative competence hypothesis. 

The results also provided indirect evidence 
that group agreement, when manipulated ex- 
clusively, may have been used by subjects in 
prior studies to evaluate relative competence. 
This conclusion was reached on the basis of 
the present findings that when information 
regarding correctness was available, agree- 
ment failed to produce a main effect. A sig- 
nificant correctness main effect and a Correct- 
ness X Agreement (X Time of Questionnaire 
Administration) interaction was produced. 

Mean conformity scores obtained in the 
present post hoc analysis closely parallel those 
reported by Endler (1966) and suggest that 
an interpretation of his results in terms of 
relative competence is quite appropriate. 
The Julian et al. (1968, 1969) studies, 
like the present one, required the subjects to 
make judgments about unambiguous stimulus 
events. In both Julian et al. studies, the mean 
number of conforming responses for subjects 
who experienced 100% agreement on all pre- 
trials was very close to the subject-correct, 
group-agree condition in the present study. In 
both cases there was no basis for subjects to 
assess discrepancies between their own compe- 
tence and that of the group. Conversely, in the 
75% agreement condition, Julian et al. (1968) 
found relatively little conformity. This could 
be explained by the fact that in the 75% 
condition, the subject was inevitably in the 
majority, while each other group member was 
in a minority one-fourth of the time. If the 
subjects inferred correctness from majority 
agreement, then in the 75% condition, the 
subject was the only group member who was 
never incorrect, and thus was relatively more 
competent than the group. A potential weak- 
ness in this argument lies in the finding that 
subjects who experienced 50% agreement con- 
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formed more than those in the % agree- 
ment condition, in spite of the íact that they 
were also in the majority all the time. Perhaps 
being part of a 4 to 1 majority leads to 


greater certainty of correctness than does 
being part of a 3 to 2 majority. 
In a study of self-blame and conformity 


using children as subjects, Costanzo (1970) 
found evidence that also parallels the present 
results. Conformity was found to be related 
lf-blame and 
other-blame. A sample item from Costanzo’s 
measure of self-blame and other-blame asks 
the children how much they deserve to be 
blamed for an imagined act of in pee 
and also how much blame a friend deserv 
for the same act. The discrepancy between 
self-blame and other-blame may also be con- 
ceptualized as indicative of a generalized level 
of relative competence. 

The results of this experiment are not con- 
sistent with those reported by Savell and 
Healey (1969), who found that confident 
subjects conformed to a greater extent than 
did those with little confidence. This may 
have been largely a function of methodo- 
logical differences. The present study required 


judgments of unambiguous events, whereas 
the Savell and Healey experiment required 
subjects to differentiate between stimuli that 
were actually identical (i.e., ambiguous stim- 
uli). Confidence in one's ability to make very 
difficult or impossible discriminations must 
logically be quite different from confidence 1n 


one's ability to judge unambiguous events. 
'The feeling of confidence reported by sub- 
jects in the high-agreement condition in the 
Savell and Healey experiment may have been 
a false confidence. In an effort to maintain 
a high level of accuracy, subjects in the 
Savell and Healey study may have become 
dependent on their partners as the only 
potential source of information about Cor- 
rectness. In the other experimental conditions, 
the subjects may have been more aware of 
the impossible nature of the task and thus 
less motivated toward correctness. ` 

The present results can be interpreted in 
terms of social learning theory (Rottet, 
1954). For example, Lanzetta and Kanare 
(1959) concluded that imitation was a func 
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| probability of success, anx- 


tion of perceive 
with disapproval, and the 


jety associatec 


value of a monetary reward. Similar elements 
were incorpor: into the social learning 
interpretation of the present experiment. 
Within the present experiment, the subject’s 


e was restricted to three 
ts of a stimulus event. 


behavioral rej 
possible judgr: 


By focusing on expected consequences re- 
lated to socia! approval, the social learning 
model can be further restricted to one which 
states that the probability of each of the three 
judgments is function of the expected 
pproval-relaie consequences for each choice 

id the reinforcement value associated with 


ose consequences, In the social pressure 
ituation, the subjects may have indicated 


the judgment ‘hat they expected to lead to 
the greatest positive and/or least negative 
evaluations from the experimenter and the 
other group members. In most cases, the sub- 


incorrect responses with 
negative sanctions may have made the re- 
sponse that they expected to have the greatest 
probability of being correct. This interpreta- 
tion assumes that subjects are motivated to 
be correct. provision of a monetary 
Teward for correct responses might have pro- 
vided a means of manipulating this variable. 


jects associating 


Two other aspects of the results warrant 
further discussion. First, subjects who were 
Even the competence questionnaire before 
being subjected to group pressure did not 
respond to that pressure as hypothesized 


(producing a lack of interaction between cor- 
rectness and agreement); and second, 38% 
Of the subjects evidenced some degree of sus- 
Picion and conformed significantly less than 
Subjects who evidenced no suspicion. A tenta- 
tive explanation, consistent with previous re- 
Marks concerning approval motivation, is 
Possible. The intervening questionnaire may 
have sensitized subjects to other approval- 
related contingencies in the situation. Ap- 
Proval may not have been perceived as de- 
pendent on “correct” responses. Similarly, 
resisting purported social influence when one 
'S suspicious of deception may be a strategy 
for avoiding the ridicule or disapproval of the 
deceiver and others aware of the deception. 
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The introduction of the intervening question- 
naire may have produced variations in the 
normative expectations of the group members. 
As Hollander and Willis (1967) suggested, 
conformity to group norms may not be defined 
alike for all members of the group. Further 
research would be necessary to determine the 
empirical validity of these propositions. The 
experimental manipulations of correctness did 
not have the hypothesized effect on the com- 
petence of the suspicious subjects. However, 
the post hoc analysis of the responses of 
suspicious subjects to the competence ques- 
tionnaire indicated that their low mean con- 
formity may have been due to their consistent 
estimates of themselves as more competent 
than the group. 

Tn a situation requiring judgments of physi- 
cal reality in the presence of a group, gaining 
approval and/or avoiding disapproval may be 
perceived by subjects as dependent on being 
correct. Conformity to a group consensus in 
such a situation may be dependent on the 
subjective probability that such a consensus 
is correct, If there is a conflict between the 
subject's initial judgment and that of the 
group consensus, the subject may make 
judgments on the basis of relative competence. 
Relative competence as affected by prior 
experiences of agreement and correct- 
ness appears to be an important mediator of 
conforming behavior. 
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tion with anticipated outcomes 


at different points in the pe 


Several cognitive theories generate predic- 
lions concerninz an individual's satisfaction 
with anticipated or experienced outcomes. 
Some theories deal with satisfaction in a pre- 
performance state in which outcomes are 


expected but have not yet been received, while 
others predict satisfaction in a postperform- 
ance state in which outcomes have been, or 
are being, received. 

One set of preperformance predictions may 
be derived fror cognitive theories of decision- 
making behavior. Although several different 
labels have been attached to the constructs 
with Which these theories deal, the underlying 
logic is quite uniform. The attractiveness of 
à given option is seen as a multiplicative func- 
tion of the valences of the outcomes associated 
With the option and the subjective probabil- 
ity (expectation) of receiving those outcomes. 
term representing the negative consequences 
9r costs involved in obtaining an outcome is 
often subtracted from the Valence X Subjec- 
tive Probability products to yield a more ac- 
curate representation of the desirability of the 
option. The most desirable of several available 
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tive Sciences, Yale University 


rt jective probability of doing well on a task and the reported success 
I z performance of the task were manipulated within a factorial design. 
The ence of doing well was measured but not manipulated. Preperformance 


was found to follow a pattern suggested 


y while postperformance satisfaction was consistent with Morse's 
he These two models are highly similar, though designed to predict satis- 


rformance sequence. Predictions derived 


mple multiplicative and subtractive models were not supported. 


options (the option with the greatest expected 
gain) will be chosen (Atkinson, 1957; Ed- 
wards, 1954; Lewin, Dembo, Festinger, & 
Sears, 1944; Rotter, 1954; Tolman, 1959). 

This paradigm suggests that the individual’s 
satisfaction with any specific outcome he may 
obtain by pursuing a given option is a func- 
tion of the anticipated gain (or loss) he per- 
ceives that outcome to entail. Thus, his pre- 
performance satisfaction with an outcome is 
determined by its valence, its probability of 
being obtained, and the costs that must be 
incurred: 


Satisfaction with Specific Outcome = 
Valence X Subjective Probability — Costs. 


By summing the Valence x Subjective 
Probability terms of the several outcomes that 
may be obtained by pursuing an option, one 
derives an index of preperformance satisfac- 
tion with that option: 


Satisfaction with Option = 2 (Valence 
Subjective Probability) — Costs. 


According to this model, when subjective 
probability is held constant, an increase in 
valence produces an increase in satisfaction. 
Likewise, when valence is held constant, an 
increase in subjective probability produces an 
increase in satisfaction. Highest satisfaction 
is predicted when both valence and subjective 
probability are high, and the lowest satisfac- 
tion is predicted when both valence and sub- 
jective probability are low. 

A different model of preperformance satis- 
faction has been proposed by Steiner (1970). 
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He contends that satisfaction may be 
expressed as the proportion of desired gain 
which an individual expects to receive. The 
numerator of this ratio represents expected 
gain as defined above, while the denominator 
represents total desired gain and is expressed 
simply as the sum of the valences of all de- 
sired outcomes. Thus, 


Satisfaction = 
(Valence X Subjective Probability) — Costs 
ZValences E 


Steiner's model assumes that any situation 
may be characterized as involving several 
outcomes, each of which has some valence and 
some subjective probability of attainment. If 
a specific outcome carries a high valence, its 
contribution to the satisfaction ratio will be 
strongly influenced by its subjective probabil- 
ity level. But if the valence is low, subjective 
probability will have little effect on satisfac- 
tion. According to this formulation, if costs 
are constant, preperformance satisfaction will 
be highest when both valence and subjective 
probability are high, and lowest when valence 
is high but subjective probability is low. Thus, 
this model predicts an interaction effect of 
valence and subjective probability. 

Ross and Zander (1957) have presented a 
simple subtractive model of satisfaction in 
postperformance situations. According to this 
model, outcomes are related to specific needs 
of varying strength. In a given situation these 
needs may be partially or totally satisfied. 
Postperformance satisfaction is seen as an 
inverse function of the degree to which these 
needs have been unfulfilled. Thus, 


ADAC 1 
Satisfaction — SN, ENS 
where XN, refers to the sum of the strengths 
of all salient needs, and XN, refers to the sum 
of the degree to which these needs have been 
fulfilled. When the degree to which needs have 
been fulfilled is held constant, any increase in 
the strength of needs (analogous to the val- 
ence of the outcomes) produces a decrease in 
satisfaction. On the other hand, when the 
strength of needs is held constant, any increase 
in the degree to which they have been fulfilled 
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produces an increase in satisi 
model predicts that the highest 
will occur when needs (of wha 
are completely fulfilled, and the 
faction will occur when needs of h 
are only slightly fulfilled. 

A more complex version of the su 
model has been presented by M 


gh strength 


tractive 
1953). 


She contends that intraindivid ision is a 
positive function of need strength. Whereas 
Ross and Zander consider satisfaction to be an 
inverse function of the amount of tension re- 


maining in the system, Morse subtracts the 


amount of tension remaining from the amount 
of tension which has been relieved. ‘Thus, the 
absolute magnitude of the original degree of 
tension is taken into consideration. Here, the 
greatest satisfaction is produced when high 


tension is substantially reduced, the low- 
est satisfaction is produced when high tension 
is only slightly reduced. Morse has presented 


the following algebraic representation of this 
model: 


Satisfaction = ER — (T, — ER) 
2ER — T; 


ll 


where ER refers to the amount of environ- 
mental return, and T, refers to the original 
amount of tension. It should be noted that 
Morse’s predictions are analogous to those of 
the Steiner model. This is not surprising since 
both view satisfaction as a function of the 
amount of outcome received (or expected) ™ 
relation to the amount desired. On the other 
hand, the multiplicative model views satisfac- 
tion as a direct function of the amount of out- 
come received, and the simple subtractive 
model conceives satisfaction to be an inverse 
function of the degree to which desired out- 
comes have not been received. y 
Two additional theories make opposing 
predictions concerning the effects of prepet- 
formance subjective probabilities on post- 
performance satisfaction. Equity and disso- 
nance theorists have contended that discon- 
firmed expectancies (in any direction) Pf 
duce negative affect (Adams, 1963; Aronson 
& Carlsmith, 1962; Carlsmith & Aronson, 
1963; Patchen, 1961). Thus, unexpected suc 
cess should produce dissonance and should not 
be as satisfying as expected success, and Ur 


expected failure should be more dissatisfying 
than expected failure. On the other hand, it 
may be argued that when success or failure is 
unexpected, greater affect in the appropriate 
direction is produced. The elation-depression 
hypothesis predicts, as does dissonance theory, 
that an individual who unexpectedly fails 
should be especially depressed. However, in 
opposition to the dissonance hypothesis, an 
individual who unexpectedly succeeds should 
be more elated than an individual whose suc- 
cess was anticipated (Feather, 1969). 

The present research compares predictions 
based on the four satisfaction models dis- 
cussed above. Both preperformance and post- 
performance satisfaction are examined. The 
aim was to test hypotheses concerning satis- 
faction with different combinations of cogni- 
tive states, ignoring the relative frequency 
"with which such states occur in nature, and 
the dynamics which tend to produce them. 


METHOD 
- General Overview and Strategy of Procedures 


In everyday life subjective probabilities are likely 
lo reflect past experience with the task that is to be 
Performed, or with similar tasks. In the present 
Study it was desirable to manipulate subjective proba- 
bilities, so a comparatively unfamiliar task was em- 
Ployed. Subjects were given practice trials and told 
that their performance indicated that they were 
likely or unlikely to do well on the “real” task which 
Would follow, Throughout this study the expression 
Pteperformance” refers to the subject’s state after 
completing the practice trials but before performing 
the real task, “Postperformance” refers to his state 
after performing the real task. 
E following strategy was adopted in order to 
En h the preperformance and postperformance 
ES inations of valence, subjective probability, and 
the ronmental return. First, subjects were told that 
TH ud perform both a practice and a real task 
a at performance on the real task was known 
che related to certain intellectual abilities. It was 
E that the subject's practice score would 
Subje is prospects for doing well on the real task. 
: ied then (1) scaled the valence of doing well on 
: Ras task, (2) performed the practice task, (3) 
subi tandomly assigned to receive the high or low 
1 uum probability treatment, (4) performed the 
Mor and (S) within each of the previously 
Bou subjective probability conditions, were 
s T assigned to receive high or low perform- 
BS (Percentile) feedback. The eight cells of this 
Sekt. design are presented in Table 1. 
kv, eno was measured after valence and sub- 
Ve probability had been established (preperform- 
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TABLE 1 
EXPERIMENTAL DESIGN 


Postperformance 
Preperformance fee 
S's valence probability 
treatment 
High | Low 
High (above median) | — High E 
Low (below median) ns $ E 


ance state) and after the subject's outcome had be- 
come known (postperformance state). In addition, by 
crossing and balancing preperformance expectancies 
and postperformance feedback, the elation-depression 
effect and dissonance predictions were compared. 


Subjects 


One hundred and fifty-three students (including an 
equal number of males and females), enrolled in the 
introductory psychology course at the University of 
Illinois, served as experimental subjects. All received 
course credit for their participation. No attempt was 
made to treat the sex of the subjects as an indepen- 
dent variable or otherwise to determine the effect of 
sex on reactions to the experimental treatments. 


Procedure 


Subjects were run in groups which ranged in size 
from 5 to 10. Modal group size was 8. Each subject 
received a booklet which contained all the materials 
used in the experiment. Subjects were told that they 
were participating in the experiment as individuals 
and that several individuals were being run at the 
same time, solely for reasons of economy, Thus, they 
were asked to behave as though they were being 
tested alone; that is, they were not to communicate 
with the other subjects in any way. After answering 
any questions on this point, the experimenter read 
the following introductory statement: 


The purpose of this research is to study your 
performance on a task of mental agility. First, I 
will give you a practice task as a warm-up exer- 
cise. The practice task will not be the same as 
the real task, although similar abilities are in- 
volved, This fact will insure us that performance 
on the practice task does not interfere with or 
facilitate performance on the real task. Since the 
practice and real tasks have been given together in 
many psychological experiments, your practice 
score will allow me to make a pretty good guess 
as to how well you will do on the real task. I 
will transmit this information to you after you 
perform the practice task so that you may know 
what your prospects are. 
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In past experiments, we have established that 
performance on the real task is moderately related 
to several academically relevant criteria. For ex- 
ample, we have found moderate correlations be- 
tween scores on the real task and tests of ab- 
stract and creative thinking ability, grade point 
average, and a personality adjustment inventory. 


Following these introductory remarks, valence 
was assessed by asking the subjects to indicate their 
answer to the following question on a 10-point bi- 
polar scale: How important is it for you personally 
to do well on the real task? (1= not at all impor- 
tant, 10 — very important). The experimenter then 
asked the subjects to turn to the page on which the 
practice task was located and to read the instruc- 
tions as the experimenter read them aloud. The in- 
structions told the subjects to form as many words 
as possible in the time allowed írom the letters of 
the word "ESTABLISHMENT" which appeared in 
the middle of the page. After asking for any ques- 
tions, the experimenter allowed the subjects 90 sec- 
onds to work on the task and then asked them to 
count the number of correct words they thought 
they had produced, and to place that number at the 
top of the page. 

The subjects were then asked to turn to the next 
page of the booklet. This page contained a large 
table entitled “Table of Norm Probabilities.” The 
experimenter provided the following explanation of 
the table: 


I indicated in my introductory remarks that your 
score on the practice task would provide some in- 
formation about how likely you are to do well on 
the real task, and that this information would be 
transmitted to you. Information about the pre- 
dictive powers of practice tasks is usually given 
through a table of norm probabilities. To show 
you how such a table is read, you have been sup- 
plied with one that was developed several years 
ago at the university. Of course, the practice and 
real tasks used at that time were considerably dií- 
ferent from the ones we are using today. 


The left-hand column indicates the subject's score 
on the practice task. The topmost row contains sev- 
eral percentile levels for the real task. The numbers 
in the body of the table indicate the probability of 
a person with a given practice score scoring at or 
above the percentiles in the top row on the real task. 

After answering any questions concerning the table, 
the experimenter proceeded to collect the subjects" 
booklets and, at the same time, made the following 
statement: 


For the purpose of this experiment, and to main- 
tain experimental control over the stimuli at- 
tended to, it will only be necessary to show you 
the line of probabilities which corresponds to the 
practice score you achieved. Therefore, my as- 
sistant will now carefully check your scores and 
insert, on the page following the table you just 
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looked at, the line which corresponds to your 
practice score. 


After handing the booklets to his assistant, the 
experimenter explained to the subject since it 
would take some time to check their papers, the 


experimenter had volunteered this time to a friend 


of his in the Sociology Department who was in the 
process of developing a questionnaire. The experi- 
menter then passed out copies of the questionnaire 
(actually the Rotter Internal-External, I-E, scale) 
and asked the subjects to fill them When all 
subjects had completed the sociology questionnaire, 


i asked the 
table of 


the experimenter returned the book! 
subjects to turn to the page followir 


norm probabilities. That page contained headings 
that were equivalent to those on the i-ble of norm 
probabilities; the word “score” appearec in the left- 


hand corner, and nine equally spaced deciles (10-90) 
followed across the page. A strip of paper, nsibly 
cut from a larger table, was taped to this skeleton 
form. It reported the subject's score and a series of 
nine probabilities. Each probability v cated di- 
rectly under one of the deciles. The «ts were 
asked to read their score and their ability of 
falling at or abo Lot the deciles on the real 
Hali of the subjects v Te randomly assigned to 
receive low probabilities, while the other half re- 
ceived high probabilities. The strip of low probabili- 
ties read as follows from 10th to 90th percentiles: 
71, 62, 51, 43, 36, 28, .14, .08, .03. The high- 
probability strip read as follows: 1.0, .98, .95, .94, 92, 
.90, .86, .81, .78. 

After examining their probability strips, the sub- 
jects were asked to answer the questions on the 
following page of the booklet. Subjects’ satisfaction 
with anticipated real task performance was assessed 
through the following questions: (a) Considering 
the importance of doing well and your probability of 
doing well, how satisfied are you with the present 
state of affairs? (1 = not satisfied at all, 10 = very 
satisfied). (b) Considering the importance of doing 
well and your probability of doing well, how pleased 
are you with the performance you think you 
achieve on the real task? (1 = not pleased, 10 = very 
pleased). (c) Considering the importance of doing 
well and your probability of doing well, how happy 
are you with the score you expect to attain on the 
real task? (1— not happy at all, 10 = very happy): 

The same procedures were followed in administer- 
ing the real task; that is, the experimenter read the 
instructions aloud while the subjects followed in 
their booklets, The instructions indicated that the 
subjects were to unscramble as many of 30 gra 
of four or five letters as they could in the allotte 


time. Their score was to be the total nuni 
items solved correctly. After asking for questio" 


nt to the 
r an 
for 


*' The I-E scale results are not releva: 
hypotheses under consideration in this pape 
will not be considered further (see Sobel, 1970, 
details). 
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‘experimenter allowed the subjects 90 seconds to 
Ik on the task and then asked them to count the 
of words they had unscrambled and place 
number at the ¢ the page. The experimenter 
collected all the booklets and gave them to his 
tant, explaining that his assistant would carefully 
ck their scores and insert, on the page following 
rected score and the percentile 
he real task was titled “Re- 
on Experimental Task” and 
ore" and “tile” listed side by 
inder these words were the 
b and the percentile numbers 
S or 93. Half of the subjects in each of the pre- 
tormance probability conditions were randomly 
ed to receive each percentile. On the following 
» subjects were asked to scale their postper- 
Atisfaction by answering the following 
a) How satisfied are you with your per- 
Mermance on the rez! task? (1 — not satisfied at all, 
= very satisfied). How satisfied are you with 
percentile you fell at on the real task? (1— not 
Müshed at all, 10 = very satisfied). 
The last two pages of the booklet contained sev- 
@al questions which required written answers. These 
ims sought to determine whether the subjects sus- 
that preperformance and postperformance 
fedback were independent of performance and that 
Me purpose of the experiment was to measure satis- 
tion based on this contrived information. 
: Total time for the experiment varied from 45 to 
aà minutes, depending upon the size of the group. 
all subjects were run in the same classroom, by the 
Sime experimenter and assistant. 
The entire experimental procedure contained six 
asic steps which may be summarized as follows: 
) The subject received a brief description of the 
tal task and indicated how important it was to him 
do well on it; (b) the subject worked on the 
tice task and received information concerning 
likelihood that he would do well on the real 
SR; (c) the subject indicated his satisfaction with 
3 anticipated performance on the real task (pre- 
Performance satisfaction) ; (d) the subject performed 
Teal task and received feedback concerning his 
ormance; (e) the subject indicated his satisfac- 
With his performance on the real task (post- 
ormance satisfaction) ; (f) the subject responded 
€ postexperimental questionnaire. 


stained the wor 
Gide under the title 
^s real task 


4 
tion 


subjects were eliminated from the analysis 
Mo they had performed exceedingly well or very 
3 ES the experimental tasks (scores higher than 
Raving ct than 4 on the practice and real tasks). 
Person Performed in such an extreme fashion, such 
fon’ Were thought unlikely to believe the bogus 
Fa mation they received from the experimenter. 
do but 14 of the remaining subjects were ran- 
m, assigned to the four treatment conditions 
pu by manipulating preperformance and post- 
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performance feedback. In order to equalize the num- 
ber of high- and low-valence subjects assigned to 
each feedback condition, the experimenter noted the 
valence scores of the last 14 subjects and allocated 
each to the treatment category with the smallest s. 
After following this procedure, it was necessary to 
randomly eliminate only 2 subjects to achieve equal 
Cell frequencies. Eighteen subjects fell into each of 
the eight cells reported in Table 1. 


RESULTS 
Preperformance Analyses 


Both the preperformance multiplicative and 
ratio models predict that with valence held 
constant, an increase in subjective probability 
produces an increase in Satisfaction. The 
multiplicative model also pe ces 
effect due to valence; as valence increases, - 
satisfaction is presumed to increase. On the — 
other hand, Steiner's ratio model predicts a — 
Valence X Subjective Probability interaction 
in which high valence and high subjective 
probability produce the highest satisfaction, 
while high valence and low subjective proba- 
bility produce the least satisfaction. In terms- 
of a factorial analysis, then, under conditions 
of high subjective probability, both models 
predict that high valence will produce greater 
satisfaction than low valence. However, under 
conditions of low subjective probability, the 
models make opposite predictions. Here, 
Steiner's model predicts that high valence will 
produce the least satisfaction, while the multi- 
plicative model predicts that low valence will 
produce the least satisfaction. 

Table 2 shows that when the sum of the 
scores of the three items designed to measure 
satisfaction with the preperformance state 
was the dependent variable, the expected main 
effect due to subjective probability was ob- 
tained; subjects who received the high-prob- 
ability treatment were more satisfied with 
their anticipated success on the real task than 
were subjects who received the low-probabil- 
ity treatment. While the main effect due to 
valence was not obtained, the Valence x Sub- 
jective Probability interaction predicted by 
the Steiner model did appear. For this, and 
for all subsequent factorial analyses, valence 
was dichotomized at the median scaled value. 
Low valence includes the scale numbers 1—6, 
and high valence includes the numbers 7-10. 


d 
E 


TABLE 2 


Cert Meaws AND ANALYMIS OF VARIANCE 
Scwwaxy Taste FOR THE SUM OF tHe 
Perresroewaxce SATISFACTION [reus 


Cell mesne 
Subjective probability 
Valence - ——— -— 
High | Low 
> High $ 20.3 12.1 
Low 187 15.0 
Tr Analysis 9€ variance Ce ay 
Source dj MS F 
Ee (probabit (B) lit 290.00 47.43* 
Subjective ility ,2901 4 
AXB 1| 18450| 6.78* 
140 27.20 
"p<. 


A post hoc analysis (Scheffé, 1959) of the 
components of the interaction revealed that 
under conditions of low subjective probabil- 
ity, high valence produced significantly less 
satisfaction than low valence (p < .05). This 
relationship supports Steiner’s predictions and 
clearly opposes the predictions of the multi- 
plicative model. Under conditions of high sub- 
jective probability, high valence produced 
more satisfaction than low valence, although 
this difference was not statistically reliable. 


Postperjormance Analyses 


The postperformance subtractive models of 
Ross and Zander (1957) and Morse (1953) 
both predict that when the strength of needs 
or level of tension within the cognitive system 
is held constant, an increase in environmental 
return produces an increase in satisfaction. 
Ross and Zander's simple subtractive model 
also predicts a main effect due to need 
strength; as need strength increases, satisfac- 
tion is presumed to decrease. On the other 
hand, Morse's model predicts a Tension Level 
X Environmental Return interaction in which 
high tension and high environmental return 
produce the highest satisfaction, while high 
tension and low environmental return pro- 
duce the least satisfaction, "Therefore, under 
conditions of low environmental return, both 
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subtractive model» predict that high need 


REED UN VEM lem satisfaction than lag 
need strength. However, unde: conditions of 
high environmental return, the models make 
opposite predictions. Here, the simple sub. 
tractive model predicts that low need strength 
will produce the most satisfaction, while 
Morse’s model predicts that high need 
strength will produce the highest satisfaction, 
For the postperformance analyses, the ini- 
tial measurement of the subjective importance 
of doing well on the real task is assumed to 
reflect the strength of needs to do well and the 
corresponding level of cognitive : 
3 shows that the predicted mai: 
environmental return was produced wh 
item which measured satisfaction wi 
task performance was used as the ( 
variable; subjects who received high pel 
tile feedback were more satisfied with 
performance than subjects who received low 
percentile feedback. While the nain effect due 
to need strength predicted by th= simple sub- 
tractive model was not obtained, the Tension 
Level X Environmental Return interaction 
TABLE 3 


CELL MEANS AND ANALYSIS OF VARIANCE 
TABLE FOR "SATISFACTION 
REAL TASK PERFORMANCE" 


Cell means 


Subjective probability 


High Low 


Valence 
Degree of environmental return 
nn 
High Low High Low 3 
NES 
High 7.7 3.3 79 $1 | 
w 7.7 3.8 7.0 39 | 
Analysis of variance y 
Source df MS E 
Meine (A) 1 P 
ubjective probability (B) 1 38 | 
Environmental return (C) 1| 596.17 | 167.39% 
AXB al .56 i 
AXC 1 11.67 
BXC 1 AT 
AXBXC 1 3.67 
Error 136 3.56 


rely failed to reach the 
statistical significance; 
sh environmental return 
satisfaction, while high 
ivironmental return pro- 
isfaction. 
vs that the previously es- 
ibjective probability did 
rmance satisfaction, There- 
lissonance theory nor ela- 
potheses received support. 
; that all the predictions of 
ere supported by the item 
atisfaction with the percen- 
1e subject was reported to have 
real task. A huge main effect 
mental return was produced, 
ed Tension Level X Environ- 
nteraction also reached a 
enificance. Furthermore, a 
revealed that when the sub- 
lity was held constant, all pair- 
ng means were significant 
\s was predicted by both sub- 
inder conditions of low en- 
ronmental return, a high level of tension 
sfaction than a low tension 
tions of high environmental 
, high tension produced significantly 
Bore satisfaction than low tension. This re- 
tionship supports Morse’s predictions and 
tarly opposes the predictions of the simple 
active model. 
4 Once again, the previously established level 
0 subjective probability had no effect on 
Wstperformance satisfaction. 


level of 
xt hoc anal 


Fostexperimental Questions 


An analysis of the responses to the ques- 
l5 asked at the conclusion of the experiment 
aled that only 10% of the subjects, when 
Honed about the possibility of deception, 
Hentioned the possibility of having been de- 
E *d with respect to preperformance infor- 

maton, while only 8% cited postperformance 
"dback. No subject indicated that the pur- 
of the experiment was to manipulate all 
edback and to measure subsequent satisfac- 
On. Although the subjects who expected to 
"or who did fail tended to be somewhat 
fe suspicious when questioned, negative 
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Tastx 4 


Cet, Means AND ANALYSIS Of VaxiANCE SOWMARY 
Taste ror "SATISFACTION. WITM 
FALLEN AT ON Tux ReaL Task” 


Cell means 
| Subjective probability 
I 
iode msi Sa = 
| High | Low 
Valence |———— _ — 
Degree of environmental retum 
High | Low High Low 
High 53^ [28 88 29 
Low 7.3 | 4.2 65 40 
Analysis of variance 
Source | a | m- oe 
Valence (A) 1 1.78 
Subjective probability (B) 1 ll 
Environmental return (C) 1 | 65025 | 207.21* 
AXB 1 6. 
AXC 1 75.11 23.93* 
BXC pv M 
AXBXC | 1 235 
Error | 136 3.14 
*p «01. 


responses were fairly equally distributed over 
the experimental conditions. These results dis- 
count the possibility of demand characteristics 
as an explanation of the data. 


Discussion 


The results clearly demonstrate the superi- 
ority of models which consider satisfaction to 
be a function of the amount of outcome ex- 
pected or received in relation to the absolute 
magnitude of the individual’s desires or needs. 
The fact that high valence coupled with low 
subjective probability produced less satisfac- 
tion than low valence coupled with low subjec- 
tive probability supports the predictions de- 
rived from Steiner’s model, rather than those 
derived from the simple multiplicative model. 
In the postperformance situation, the fact that 
high tension coupled with high environmental 
return produced more satisfaction than low 
tension coupled with high environmental re- 
turn supports the predictions derived from 
Morse’s model, rather than those derived from 
the simple subtractive model. 
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Since the general principles underlying 
Steiner’s and Morse’s models are quite simi- 
lar, it is possible to generalize each model to 
include the situations covered by the other. 
Steiner’s model may be generalized to in- 
clude postperformance situations by replacing 
the term which represents subjective proba- 
bility with one which represents the degree to 
which an outcome has been obtained (i.e., 
environmental return). The ratio model 
would then appear as follows: 


Satisfaction = 
E Valence X Environmental Return — Costs 
XValences 


This model asserts that satisfaction is a func- 
tion of the proportion of desired gain which 
an individual actually receives. Likewise, 
Morse's model may be generalized to include 
preperformance situations by replacing the 
term which represents environmental return 
by one which represents subjective probabil- 
ity. The complex subtractive model would 
then appear as follows: 


Satisfaction = 
Subjective Probability 
— (T; — Subjective Probability) = 
2 Subjective Probability — Tı. 


This model contends that satisfaction is a 
function of the amount of tension expected to 
be remaining minus the amount of tension 
expected to be reduced. The generalized ver- 
sion of Steiner’s model would make the same 
relative predictions as the Morse model, while 
the generalized version of Morse’s model 
would make the same relative predictions as 
the Steiner model. However, the exact nu- 
merical predictions of the two models for any 
given cognitive state will be different. The 
present study tested predictions only in terms 
of relative satisfaction among treatment con- 
ditions; the data do not permit tests of the 
comparative accuracies of the two models. 
On a theoretical level, Steiner’s model may 
have one important advantage over Morse’s. 
Morse (1953) has noted that it is quite diffi- 
cult for her model to predict accurately the 
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satisfaction of low-need individusls. She sug- 
gested that satisfaction for such persons may 
be more a function of their “general psycho- 
logical condition [p. 38]” than of the needs 
under consideration. To the « t that this 
general psychological conditio ludes situ- 
ational outcomes other than the ones under 


specific consideration, Steiner's model has a 


definite advantage. The ratio model explicitly 
considers all outcomes involved any situa- 
tion; the valences and subjective probabilities 
of all outcomes are summed in numerator, 
and the valences of all outcon e summed 
in the denominator. Except when the valence 
of a single outcome dominates the ratio, all 


aspects of the situation contribute to the 
total satisíaction, the effect of each outcome 
being weighted by the size of its valence term. 

It is clear that the amount oí variance 


accounted for by the Valence Subjective 
Probability interactions, and by the Tension 
x Environmental Return interactions, was 
quite small, both in absolute magnitude and 
in relation to that accounted for by the 
main effects of subjective probability and 
environmental return. The variance controlled 
by these interactions ranged fro .02 to .04, 
while the main effects accounted ‘or from .17 
to .55 of the variance. There are several pos- 


sible reasons for these differences. First, the 
experimental situation did not include any 
objective source of reward for success on the 
real task. Therefore, the importance of doing 
well was based upon purely subjective Con- 
siderations, The range of true valence in this 
laboratory setting was probably not very 
great. Indeed, in any given situation, there 
may be a norm which sets acceptable bound- 
aries for the valence dimension. Thus, al- 
though valence may vary widely from situa- 
tion to situation, its range may be attenuate 
in any one setting. On the other hand, sub- 
jective probability and environmental reug 
may be less subject to such norms. In a given 
situation, some individuals may expect an 
receive a high proportion of what they want 
while others expect and receive Very Tug 

'The results of the present study failed t0 
uncover any relationship between preperfor™ 
ance subjective probabilities and satisfaction 
with postperformance environmental retum: 
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predictions of disssonance theory 


oí the elation-depression hypothe- 
not be compared. One difference 
the procedures followed in this ex- 
t and ‘hose of previous studies which 


» these issues is that subjects 
y were asked to commit them- 
ence at the very outset of the 
sion. Thus, it may be assumed 

id not vary freely during the 
experiment, as it might have 
such commitment been made. It 
hat the relationship between pre- 
mance states and postperformance affect 


n either dissonance theory or the elation- 
iion 5;pothesis has led us to suspect. 
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A study conducted in 1963 demonstrated that 10 common color names possessc 


highly similar connotations among Caucasian and Negro college students, 


white viewed as “good,” “weak,” and 


itr 


“active,” and black viewed as relative! 


“bad,” “strong,” and “passive.” The present study was conducted in 1969 t 


determine whether the development o 


middle and late 1960s was paralleled by changes in the connotative meaning o 


f the “black identity" movement in the 


black and white. It was found that for Negro subjects, black had become more 
positive (good) and more active, while white had become less positive and less 


active, with these changes being most 


pronounced among those students mos! 


strongly committed to black separatism. Among the Caucasian subjects tested, 
no changes were found in the meanings of black, white, or any other color nam: 
across the 6-year period. The differences between the two racial groups ari 


interpreted in terms of the differential 
the two groups tested. 


Color names have long been used in Ameri- 
can culture to convey connotative or affective 
meanings. In particular, the color name white 
has been used in religion, literature, and the 
mass media to symbolize “goodness,” while 
black has been used to symbolize “badness” 
(Isaacs, 1963; Jordan, 1968). The general 
effect of this practice was documented in the 
findings of a study conducted in 1963 (Wil- 
liams, 1964), in which college students rated 
10 color names, presented in a nonracial con- 
text, on the semantic differential (Osgood, 
Suci, & Tannenbaum, 1957). The results 
of the study indicated highly similar findings 
for both Caucasian and Negro students, with 
the color name white being rated as good, 


1 This paper is based on a cooperative study by 
the first and second authors which was supported in 
part by Grant HD-2821 from the National Institute 
of Child Health and Human Development. The 
study served as the basis of the second author's 
master's thesis at the University of North Carolina, 
Greensboro, which was directed by the third author. 
The authors wish to express appreciation to Alfonso 
E. Gore, Donald L. Jones, Clarence Smith, and Allen 
McMurtry, for their assistance in data collection. 

2 Requests for reprints should be sent to John E. 
Williams, Department of Psychology, Winston Hall, 
Wake Forest University, Winston-Salem, North 
Carolina 27109. 


effect of the black identity movement o: 


weak, and active, while the color name black 
was rated as relatively bad, strony, and pas- 
sive. It was concluded that as of the time of 
the study, members of both racial groups 
were responding to the general cultural mean- 
ings of these words and that participation in 
the Negro subculture had little demonstrable 
effect upon the responses of the Negro stu- 
dents. 

The similarity of the findings just de- | 
scribed was somewhat surprising in view of 
the long-standing custom of using color names 
to refer to racial groups, with Caucasians 
called “white” and Negroes called “black.” 
How could the Negro subjects give a rela- 
tively negative rating to the color name by 
which they were often designated? A possible 
explanation was that as of 1963, Negro per- 
sons did not accept the “color code” and did 
not think of themselves as black persons. 
This idea receives some support from a study 
conducted in 1964 (Williams, 1966), in which 
Negro subjects rated the concept black person 
much more negatively than the concept 
Negro. In terms of semantic distance, the 
concept Negro was much closer to the concept 
brown person than to black person. Thus, it 
seems reasonable to conclude that the Negt? 
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263 study did not think of 
ick, and, hence, their re- 
sponses to the word black reflected only the 
general cultural meaning of the term. 

If the foregoing inferences are correct, the 
developments of the mid- and late 1960s re- 
flect a rather d tic change in the willing- 
ness of Negro Americans to be designated as 
black. Under impetus of a heightened 


subjects in the 1 
themselves a: 


need for a sense of racial identity, and fol- 
lowing the lead of the Black Muslim move- 
ment of the early 1960s (Lincoln, 1961), a 


new racial identity movement developed, with 
the designation black used as a rallying point. 
This choice of the term black in preference 


to others (e.g., Afro-American) seems due, at 


least in part, to its being opposite to the name 
(white) by which the majority group was 
designated, thus making it particularly effec- 
tive as an identity term. Serving to counter- 


act the "bad" and “passive” connotations of 
black were such slogans as “Black is Beauti- 
ful” and “Black Power" (Carmichael & Ham- 
ilton, 1967). In this way the term black has 
become acceptable to many Negro Americans 
and is often preferred to the term Negro, 
which is viewed as old fashioned, at best, and 
insulting, at worst. In addition to the accep- 
tance of black, the rhetoric of the new iden- 
tity movement involved a rejection of “white- 
- hess,” which was associated with the general 
culture and with the oppression of Negro per- 
Sons. In the imagery of Fanon (1967), the 
Negro was urged to throw off his “white 
Mask,” and to assert his “true blackness.” 
The acceptance of black and the rejection 
of white provides a somewhat novel oppor- 
tunity to study the effects of a deliberate 
effort to modify general cultural symbols by 
associating them with subcultural identity. To 
What extent has this effort altered the conno- 
tative meanings of black and white to Negro 
persons and to Caucasian persons? The pres- 
ent study was designed to provide evidence 
bearing on these questions by a 1969 replica- 
tion of the 1963 study (Williams, 1964). In 
addition to the replication proper, a pro- 
cedure was added to assess the relation of 
Connotative meanings of color names for Ne- 
STO subjects to their ideological commitment 
to black separatism. Based on the reasoning 
Outlined above, the following predictions were 
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made. Among Negro subjects: (a) the color 
name black would be rated more positive 
(good) and more active than in 1963, while 
maintaining its connotation of strength; (b) 
the color name white would be rated less posi- 
tive and perhaps less active, while retaining 
its connotation of weakness; (c) the pre- 
dicted changes would be most pronounced in 
students who identify most strongly with the 
black separatist movement. Among Caucasian 
subjects, it was predicted that the conno- 
tative meanings of color names would be 
found to be essentially the same as in 1963. 


METHOD 
Subjects 


Negro subjects were 239 students (135 male, 104 
female) at North Carolina Agricultural and Tech- 
nical State University, from whom data were col- 
lected during the spring term of 1969 in the form of 
an “opinion survey,” conducted in conjunction with 
humanities classes, required of all nonfreshman stu- 
dents. 

Caucasian subjects were 99 students (50 male, 49 
female) at Wake Forest University, who participated 
during the fall term of 1968 in order to meet re- 
search participation requirements in introductory 
psychology. All subjects were in their second, third, 
or fourth year of college work. Although these sub- 
jects participated during late 1968, they are desig- 
nated below as a “1969” group to facilitate com- 
munication of the findings. 


Instruments 


Semantic diferential. The semantic differential 
employed with both Negro and Caucasian subjects 
was identical to that used in the 1963 study (Wil- 
liams, 1964). Subjects rated each of 10 color names 
on 12 scales. The following color names were pre- 
sented, in random order: black, white, brown, yel- 
low, red, blue, green, purple, orange, and gray. Each 
color name was printed in black ink at the top ofa 
separate page of white paper, with the 12 rating 
scales mimeographed on the page below. The twelve 
7-point scales were defined by the following pairs of 
adjectives: (a) unpleasant—pleasant; (b) strong- 
weak; (c) good-bad; (d) slow-fast; (e) foul- 
fragrant; (f) small-large; (g) clean-dirty; (4) dull- 
sharp; () nice-awful; (j) active-passive; (k) 
sacred-profane; (1) heavy-light. Scales a, c, €, 8, i, 
and k represented the evaluation factor; Pairs b, i 
and I represented the potency factor; and Scales d, 
h, and j represented the activity factor. 

Each subject’s semantic differential ratings of the 
10 color names were scored in standard fashion 
(Williams, 1964) by assigning the digits 1-7 to the 
7 positions on each rating scale, summing the rat- 
ings on the appropriate scales for the factors of 
evaluation, potency, and activity, and then dividing 
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the factor sum by the number of scales to return 
the data to the standard 1-7-point range. Low 
scores represented the good end of the evaluation 
dimension, the “weak” end of the potency dimen- 
sion, and the "passive" end of the activity dimen- 
sion. The data for analysis consisted of each sub- 
ject's evaluation, potency, and activity scores for 
each of the 10 color names. 

Black Separatism scale. This scale, devised spe- 
cifically for this study, was aimed at assessing the 
degree of ideological commitment to black separa- 
tism among the Negro subjects by asking them to 
which of six national civil rights organizations they 
would like to belong. In the development of the 
scale, a group of 10 Negro students and faculty had 
served as judges and had showed 10076 agreement 
in ranking the six organizations, from most separa- 
tist to least separatist, as follows: Black Panther 
Party (BPP), Student Non-Violent Coordinating 
Committee (SNCC), Congress of Racial Equality 
(CORE), Southern Christian Leadership Conference 
(SCLC), National Association for the Advancement 
of Colored People (NAACP), and Urban League 
(UL). The scale presented the Negro subjects with 
an alphabetical list of the six organizations, with the 
request that they consider to which organizations 
they would like to belong and indicate their first, 
and if they wished, second and third choices. 

The following procedure was used in determining 
à black separatism score for each Negro subject, 
based on his first and second choices of civil rights 
organizations to which he would like to belong. The 
six organizations were assigned scores at equal in- 
tervals through the range 1-21, as follows: UL, 1; 
NAACP, 5; SCLC, 9; CORE, 13; SNCC, 17; BPP, 
21, The score for a particular subject was determined 
by first assigning him the score belonging to the 
organization which he had ranked in the first posi- 
tion. The second choice of organization was then 
considered: if this choice was a more separatist or- 
ganization, 1 point was added to the subject's score ; 
if it was less separatist, 1 point was subtracted, Sub- 
jects who indicated only a first choice were given 
the score of that organization; subjects who indi- 
cated two or more organizations without indicating 
their rank were assigning the mean score of the 
organizations indicated. 


Procedure 


Negro subjects. Subjects were tested in their 
regular humanities class sections. After an intro- 
duction to the opinion survey by a Negro professor 
from the Education Department, the procedure was 
conducted by one of two Negro experimenters, both 
of whom were advanced social science students at 
the same institution. 

To reduce the probability of a racial set toward 
the color ratings, the semantic differential procedure 
was presented first, employing the same instructions 
as were used in the earlier study (Williams, 1964), 
with the added feature that the general instructions 
were repeated using the color name turquoise as an 
example. Any reference to color as it relates to race 
was carefully avoided. 
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Following the semantic differential, the subjects 
were administered a modified version of the Stenfors 
and Woodmansee (1968) Black Power Scale, which 
asks the subject to respond "yes" or “no” to 26 
statements dealing with Negro attitudes toward inte- 
gration and civil rights. (These data were collected 


for use in a separate study and are not reported 
here. The procedure is noted, however, because of 
its possible sensitizing effects on the block separatism 
data which were collected next.) The third phase of 


nstructions 
which the 
zations as 


the procedure involved a set of writter 
for the Black Separatism scale, follo 
subject indicated his choices of orga 
indicated above. 


Subjects were not asked to indica‘e (heir names, 
but were asked to mark the three of test ma- 
terials with a code number. Subjects were requested 
not to discuss the study with other students who 
were yet to be tested. Due to the mass testing 
schedule, there was little opportunity ‘or students to 
communicate; however, all sections of students after 
the first were asked to mark their papers if they 
had prior knowledge concerning the study. The five 
papers so marked were eliminated. 

Caucasian subjects. Subjects were tesicd in groups 
by a psychology professor and his graduate as- 
sistant, both of whom were Caucasian. The semantic 


differential was administered following the standard 
instructions which had been employed in the earlier 
study (Williams, 1964). Any reference to color as it 
relates to race was carefully avoided. Subjects noted 


their sex on the test booklets, but e not asked 
to identify their test booklets by ne. They were 
asked to refrain from discussing | tudy with 


other students. 


RESULTS * 


The principal method of data analysis was 
Lindquist’s (1953) Type I analysis of vari- 
ance, a two-factor mixed design, with one 
within-subjects variable and one between- 
subjects variable. Separate analyses of vari- 
ance were conducted for evaluation, potency, 
and activity scores. Separate analyses were 
also conducted for the five “race-related” 
color names (black, white, red, brown, yel- 
low) and for the five control color names 
(blue, gray, green, orange, purple). In a par- 
ticular analysis, the within-subjects variables 
consisted of the five individual color names, 
while the between-subjects variable consist 
of the two groups being compared (Caucasian 
versus Negro; 1969 subjects versus 1963 sub- 
jects, etc.). In this design, a differential rè 
sponse by the two groups to certain color 


' Due to space limitations, only the major int 
ings of the study are reported here Readers, p^ 
terested in more detailed analyses are referre 


Tucker (1969). 
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TABLE 1 
MEAN SEMANTIC DIFFERENTIAL SCORES FOR 10 COLOR NAMES ON THREE FACTORS BY 
CASIANS IN 1963 (n = 116), CaucasrANs In 1969 (m = 99), NEGROES IN 
1963 (n = 110), AND NEGROES IN 1969 (n = 208) 
| Evaluation Potency Activity 
Color 
| Cau- Cau- Negro | Negro Cau- Cau- Ne Cau- | Cau- 
lan asi si: i gro | Negro j 3 N N 
| L cigog | 1963 | 1959 | S963 | “ggg | 1993 | 1969 | foes | Fogg" 1965 | 1969 
Race-related | 
Black 5.36 | 411 | 3.38 | 5.98 | 6.05 | 5.70 | 5.84 | 3.31 | 3.38 | 3.63 | 4.37 
White | 1.83 | 2.05 | 2.71 | 3.60 | 3.19 | 3.52 | 3.11 | 4.75 | 4.56 | 5.10 | 4.23 
Red 3.27 3.08 | 3.52 5.58 9:63.11:5:19 0115/25 6.23 | 6.24 | 5.77 | 5.89 
Brown 4.72 | 3.82 | 348 | 4.95 | 5.10 | 492 | 4.08 | 2.74 | 2.78 | 3.51 | 3.49 
Yellow | 2 2.58 | 2.52 | 247 | 3.21 | 3.07 | 324 | 3.13 | 4.99 | 496 | 5.00 | 4.71 
Non-race-related| 
(control) 
Blue ^12 | 248 | 2.28 | 2.22 | 4.05 | 420 | 433 | 446 | 4.50 | 466 | 476 | 4.86 
Gray | 450 | 480 | 423 | 3.99 | 3.68 | 4.11 | 3.90 | 3.51 | 243 | 2.63 | 3.00 | 2.78 
Green 2.833 | 2.40 | 2.78 | 2.60 | 4.56 | 445 | 439 | 442 | 5.00 | 4.82 | 4.62 | 4.87 
Orange | $29 | 3.08 | 3.05 | 291 | 451 | 472 | 411 | 4.37 | 5.16 | 5.60 | 4.79 | 5.35 
Purple 3.17 | 3.44 | 4.00 | 3.58 | 4.95 | 5.22 | 4.87 | 4.80 | 3.97 | 425 | 3.43 | 420 
=> I E 
Note.—Data for 1963 Caucasians and 1963 Negroesarefrom Williams (1964). 


names (but not to all) would be reflected in 
a significant interaction effect. When signifi- 
cant effects were found, ¢ tests, computed by 
Lindquist’s (1933, p. 93) critical difference 
method, were used to determine the source(s) 
of the difíerence(s). Unless otherwise indi- 
cated, the following analyses were of the type 
- Just described, with statistical effects not be- 
ing considered significant unless the .01 level 
was reached, 


Connotations of Color Names: Present Study 
- and 1963 Study 


Data from four groups of subjects were 
analyzed: Caucasian (x — 99) and Negro 
(n= 208) subjects from the present study 
(31 Negro males were randomly eliminated 
to equalize sex representation) and Cauca- 
San (n= 186) and Negro (n= 110) sub- 
Jects from the 1963 study. Each group's mean 
evaluation, potency, and activity scores are 
Presented in Table 1. 

Caucasian data; 1969 versus 1963. Sepa- 
Tate analyses of the evaluation, potency, and 
activity scores for the five race-related color 
names between the two Caucasian groups re- 
Vealed highly significant main effects of color 
names, with no significant groups or inter- 
action effects. The same results were found 
a the three analyses of the control color 
E There was, thus, no evidence of any 

ange in the connotative meanings of race- 


related or control color names among Cauca- 
sian subjects across the time interval studied. 

Negro data: 1969 versus 1963. 1. Race- 
related color names. The analysis of the 
evaluation scores in the two Negro groups 
revealed a significant interaction effect which 
was traced to the fact that the 1969 group 
gave white a significantly higher rating 
(less good) and black a significantly lower 
rating (more good) than did the 1963 group. 
In spite of these relative shifts, however, the 
1969 Negro subjects still rated white signifi- 
cantly lower (more good) than black. Thus, 
while significant changes were found in the 
evaluative meanings of white and black across 
the time interval studied, their position, 
relative to one another, remained the same. 

The analysis of the potency scores for the 
two Negro groups revealed a significant main 
effect of color names, with nonsignificant 
groups and interaction effects. Thus, there 
was no evidence of change in the potency 
connotations of the race-related color names. 

When the activity scores for the two Negro 
groups were analyzed, a significant interaction 
was found. This effect was attributable to a 
significant drop in the 1969 group in the 
activity connotation of white and a significant 
increase in the activity connotations of black, 
with the result that the 1969 group did not 
rate the two color concepts differently. 


2. Non-race-related color names. The 
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analysis of the mean evaluation scores for 
the five colors revealed significant main ef- 
fects of color names and groups, but no sig- 
nificant interaction effect. The groups effect 
was due to a slight, but consistent, shift 
toward lower (more good) evaluation scores 
for all five color names, with the rank order 
of color names in the 1969 group remaining 
the same as in the 1963 group. The potency 
score analysis revealed the usual significant 
main effect of color names, but no significant 
groups or interaction effects. The activity 
score analysis revealed a significant inter- 
action which was traced to the more active 
rating of orange and purple by the 1969 


group. 
Present study data: Caucasian versus 
Negro. 1. Race-related color names. The 


evaluation score analysis for the 1969 Cauca- 
sian and Negro subjects revealed a signifi- 
cant interaction effect which was found to 
be due to the Negro subject’s lower (more 
good) rating of the color names brown and 
black and higher (less good) rating of the 
color name white. Another way to describe 
the findings is to note that in the Caucasian 
group, all differences between adjacent means 
in the rank order white, yellow, red, brown, 
and black were significant, while in the Negro 
group, white and yellow formed one cluster, 
while black, brown, and red formed a second 
cluster, with significant differences between 
the two groupings, but nonsignificant differ- 
ences within each grouping. 

The analysis of the potency scores in the 
1969 Negro and Caucasian groups revealed 
the usual significant main effect for color 
names, with the groups and interaction effects 
found to be nonsignificant. 

When the activity scores of the two groups 
were analyzed, a significant interaction effect 
was found. This effect was traced to the 
higher activity scores of black and the lower 
activity scores of white among Negro subjects 
as compared to Caucasian subjects. Another 
way to describe this effect is to note that 
while Caucasian subjects rated white as more 
active than black, the Negro subjects did 
not rate the two concepts differently. 

2. Non-race-related color names. The 
analysis of the mean evaluation scores for the 
1969 Caucasian and Negro subjects revealed 
a significant interaction which was traced to 
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the significantly more positive rating given 
to gray by the Negro subjects. However, the 
overall similarity in the ratings of the two 
groups was quite high, with the means from 
both groups falling in the same rank order 


ind between 


and with significant difference 


successive means. A significa nteraction 
was found in the analysis of the mean 
potency scores which was du the Negro 
subjects rating gray significantly weaker than 
did the Caucasian subject: > activity 
score analysis revealed the expected signifi- 
cant main effect of color but no 
significant groups or interaction effects. 
Relationship of Color Meanings to Black 
Separatism 

On the basis of their black separatism 
scores, Negro subjects were med to one 
of three groups: Group 1, scores = 1-6, pri- 


marily NAACP oriented; Group 2, scores = 
7-12, primarily SCLC oriented; and Group 3, 
scores = 13-21, primarily SNCC oriented. 
After the scores of all 239 subjects were 
grouped as indicated, 23 males were ran- 
domly eliminated from Group 3 in order to 
more evenly balance the sex distribution in 
each group, leaving a total group of 216 sub- 
jects, with 92 falling in Group 1, 77 in 
Group 2, and 47 in Group 3. ; 
An inspection of the mean evaluation, 
potency, and activity color names scores for 
the three separatism groups indicated that 
the color ratings in Groups 1 and 2 were 
quite similar, This impression was confirmed 
by analyses which compared Groups 1 and 2 
and revealed no significant groups or inter- 
action effects for evaluation, potency, 4n 
activity scores for either race-related or non- 
race-related colors. Groups 1 and 2 were, 
thus, combined into a single group (# = 169); 
designated “majority group,” for comparison 
with Group 3 (n= 47), designated “high | 
separatist group.” The mean evaluation 
potency, and activity scores for the high- 
separatist and majority groups are present 
in Table 2. : 
Race-related color names. The evaluation 
score analysis revealed a significant inter- 
action, which was found to be attributable 
primarily to the two groups’ differential 1e- 
sponse to white, which the high separatis? 
rated significantly higher (less good) tha? 
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TABLE 2 
SEMANTIC DIFFERENTIAL Scores FOR 10 Coror Naes AMONG 1969 NEGRO 
Susjects DIVIDED INTO THE Majorrry Group (n = 169) AND 
Hicu-Seraratist Group (n = 47) 
Evaluation Potency Activity 
Majority High- Majority High- + ite High- 
llic 40 do e 
Race-relat ec | | | | 
Black 3.47 317 | | S8L .| . 609 4.13 5.14 
White 2.50 341. | 320 2.68 4.30 3.64 
Red 3.53 3.74 5.30 5.26 5.83 6.00 
Brown 3.64 3.34 4.93 5.35 3.39 3.71 
Yellow 2.38 2.84 3.22 2.87 4.83 441 
Non-race-r« ntrob) | 
Blue 2.21 2.23 4.45 449 4.91 4.78 
Gray 3.94 4.19 3.56 3.87 2.76 3.00 
Green 2.65 2.71 444 441 4.89 4.60 
l Orange 2.87 3.23 4.45 4.29 5.34 5.24 
Purple 3.67 3.39 4.80 481 4.04 451 
did the majority group. The lower (more Color Meanings in 1969 Negro Majority 
good) ratii lack by the high-separatist Group versus 1963 Total Negro Group 
pu was not statistically significant from The finding that the 1969 high-separatism 
E A group ume When aoe and majority groups were responding differ- 
E. nd p j; mee and black within e t E ently to white and black suggests that the 
Bs d Eroups we compared, it was found overall differences in the responses of the 
| E ; 18) OU eee rated white aw 1963 and 1969 Negro groups might be at- 
y lower (more good) than they did tributable solely to the ratings of the minor- 


black, while the high-separatist subjects did 
Not rate the two concepts differently. 

The analysis of the potency scores for high- 
Separatist and majority subjects revealed a 
Significant interaction. While the means in 
Table 2 suggest that the groups were re- 
Sponding differently to four of the five colors 
(white and yellow appear weaker, and brown 
and black appear stronger in the high- 

Separatist group), the only difference to reach 
statistical significance was that for white. 
, The activity score analysis also revealed a 
Significant interaction, with the high sepa- 
Hüsts rating white as significantly more 
Passive and black as significantly more active 
than did the majority subjects, who did not 
Tate the two concepts differently. 

| Non-race-related color names. The analy- 
5 9f evaluation, potency, and activity scores 
E the high-separatist and majority groups 
3 ealed the expected significant main effects 

Color names, with monsignificant groups 
and interaction effects. Thus, the non-race- 
"dated color names were responded to in the 


Same 
Bic ner by both groups of Negro 


ity (approximately one-fifth) of the total 
1969 group which were classified as high 
separatist. It was considered of interest to 
determine whether the meanings of white and 
black had been altered among the more 
typical Negro college students who composed 
the 1969 majority group. This question was 
explored by comparing the white and black 
ratings of the majority group (N = 169) with 
the comparable ratings for the total 1963 
Negro group (N = 110). The results indi- 
cated that the 1969 majority group rated 
black as more positive (good), and white 
as less active, than did the total 1963 group. 
Thus, at least some of the 1963-1969 changes 
in the meanings of white and black seem 
not to be attributable solely to the more 
separatist Negro students, but, rather, seem 
pervasive in the total Negro college student 
group. 
DISCUSSION 

The results of the study indicate that as 
predicted, the connotative meanings of the 
color names black and white among Negro 
college students were altered during the years 
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1963-1969: black became more positive 
(good) and more active, with its connota- 
tions of strength remaining unchanged; 
white became less positive and less active, 
with no change in its connotations of weak- 
ness. The counterpart changes in the evalu- 
ative and activity meanings of black and 
white suggest that both concepts are impor- 
tantly involved in the dynamics of the new 
black identity movement. This interpretation 
is strengthened by the finding that the most 
dramatic differences in the meanings of black 
and white were found among those Negro 
students who were most strongly committed 
to the black separatist movement. 

The fact that no significant changes were 
found in the Negro students’ responses to 
the other three race-related color names 
(yellow, red, and brown) provides evidence 
of the selective effect of the black identity 
movement upon the color names black and 
white. The general finding of little change 
in the meanings of the non-race-related con- 
trol colors among Negro students provides 
additional support for this interpretation. 

The results for the Caucasian students in- 
dicate that as predicted, there had been no 
changes in the connotative meanings of black, 
white, or any other color name across the 
6-year period. The events of the years 1963— 

1969, including the development of the black 
identity movement, appear not to have al- 
tered the general Caucasian view of black as 
being bad, strong, and passive, and white as 
being good, weak, and active. There were, 
of course, individual differences in the re- 
sponses of Caucasian subjects to these color 
names, with some Caucasian ratings being 
quite similar to the 1969 Negro ratings. On 
the other hand, the data provided no indica- 
tion that the percentage of Caucasians 
making such ratings had increased since 
1963. Thus, it would appear that as of 1969, 
the impact of the black identity movement 
upon color meanings was restricted to the 
Negro subculture and that the movement had 
had no appreciable effect upon the meanings 
of color names to Caucasian persons. 

It would be of interest to know more about 
the mechanism(s) responsible for the altered 
meanings of black and white to the Negro 
subjects in the present study. These findings 
could be a reflection of general changes in 
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the meanings of these terms t 
sons; for example, the concep: 


Negro per- 
black has 


become generally more positive and active, 
On the other hand, the findings could be due 
to an increase in the percentage of Negro 
subjects who interpret black to mean black 


person and, as a result, rate the concept as 
more positive and active, thus altering the 


group means in these directions. In this 
case, it is possible that the nonracial mean- 
ings of black (and white) to Negro persons 


may not have changed across the time inter- 
val studied. While these alternative hypothe- 
ses cannot be tested in the context of the 
present study, it can be noted that the 1963- 
1969 changes in the meaning of black served 
to move the meaning of this concept closer 
to the meaning of the general concept person 
(Williams, 1966). The data of the present 
study also seem to imply that the meaning 
of the concept black person has undergone 
sizable changes in recent years, since the 1969 
evaluative rating of black was substantially 


more positive than the 1964 rating of the 
black person (Williams, 1966). 
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c attractive than dissimilar others 


tion was greater for value-expressive 


regarding O's attitude direction, 


According to attribution theory (Heider, 
1958; Jones & Davis, 1965; Kelly, 1967), in- 
dividuals tend to attribute enduring motives 
to others on the basis of others’ behavior. For 
txample, another person (O) might state an 
attitude, and the first person (P) might infer, 
on the basis of that attitude and the condi- 

tions under which it was stated, the reason 
why O holds it (Horowitz & Pastore, 1955; 
Pastore & Horowitz, 1955). It is proposed 
here that P's attraction to O is influenced 
- mot only by that stated attitude of O, but also 
by the imputed reason for O holding that 
attitude. The present study was concerned 
With attitude similarity-dissimilarity as a basis 
for Motive attribution and with the way in 
Which interpersonal attraction is affected by 
knowledge of both O's attitude (ie., similar 
attitudes lead to attraction) and his reason for 
holding it (ie, certain motivational bases 
lead to attraction). 

A particular attitude may be held for any 
umber of reasons, A number of theorists 
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Mesa tees for reprints should be sent to Abraham 

c Pipe of Psychology, University of 
: Athens, Georgia 30601. 


"PYTUDE BASE AS A MODERATOR OF THE ATTITUDE 
SIMILARITY-ATTRACTION RELATIONSHIP ' 


THOMAS R. BATCHELOR axp ABRAHAM TESSER * 


University of Georgia 


order to assess the effects on interpersonal attraction of both attitude simi- 
y and the reason an attitude is held, 40 subjects were given information 
others which included other's attitude on a topic (similar-dissimilar) and 
her's reason for holding the attitude (value expressive, utilitarian, need for 
nition, and ego defensive). As predicted, attitudinally similar others were 


(p <01); others with value-expressive- 


based attitudes were most attractive, and others with ego-defensive-based atti- ` 
were least attractive (p < .01); the effect of attitude similarity on attrac- 
-based attitudes than for ego-defensive- 
based attitudes (p < .05). It was found further that attitude direction (simi- 
I ssimilar presented alone) significantly influenced the inferences made 
ling the reason the other's (O's) attitude was held and that the reason 
the attitude was held (presented alone) significantly influenced the inferences 


(e.g., Byrne & Clore, 1967; Cohen, Stotland, 
& Wolfe, 1955; Katz & Stotland, 1959; Sar- 
noff, Katz, & McClintock, 1965; Smith, 
Bruner, & White, 1956) have suggested sev- 
eral motivational bases or determinants that 
may account for the formation and mainte- 
nance of attitudes. Despite the various labels 
that have been applied to these bases, several 
broadly defined motives, or needs, are most 
commonly posited; (a) the utilitarian or in- 
strumental function—a need to maximize one’s 
gains and minimize losses; (b) the need for 
cognition—a need to cognitively structure 
one’s environment for purposes of under- 
standing, prediction, and effective behavior; 
(c) the ego-defensive function—a need to 
defend oneself against anxiety-producing 
thoughts and impulses; and (d) the value- 
expressive function—a need to have a clear, 
accurate picture of oneself in regard to one's 
social world and to maintain consistency 
among one's values and principles. (See the 
Method section for a more complete descrip- 
tion of these bases.) 

To the extent that people do recognize/ 
attribute, or are aware of them, the attitude 
bases of O will be given some weight in the 
evaluation of that O who has expressed an 
attitude. 

Several hypotheses regarding the effects of 
the specific motivational bases of attitude 
formation and maintenance may be derived 
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from this basic hypothesis. In regard to these 
specific motivational bases, it seems reasonable 
to suggest that there might be some rather 
consistent evaluation of these bases across 
persons in terms of desirability. For example, 
the fact that a person’s attitude has a value- 
expressive base implies a commitment to 
ideals, Further, in our culture much emphasis 
and value is placed on the Socratic dictum 
“Know Thyself.” To be aware of one’s own 
true feelings and motives leads to an under- 
standing and a defining of oneself, presumably 
a prerequisite to adequate personality adjust- 
ment. This “knowing of oneself” seems to be 
a precept of considerable current interest and 
concern, On the other hand, lack of insight 
into one’s self is an undesirable state of 
affairs—a condition of maladjustment to be 
avoided. The value-expressive attitude base 
seems to reflect the former ideal. The ego- 
defensive base seems to indicate the latter un- 
desirable condition. In addition, an ego-defen- 
sive base also may imply hostility or tenseness. 
The remaining two bases, utilitarian and need 
for cognition, might be expected to fall some- 
where between them in terms of desirability. 
To the extent that these observations are 
correct, one might predict that (a) Ps will be 
attracted to Os with value-expressive-based 
attitudes more than to Os with any other atti- 
tude base; (5) Ps will be attracted to Os with 
ego-defensive-based attitudes less than to Os 
with any other attitude base. If this is so and 
balance forces are operative, (c) Ps will most 
often infer that Os with attitudes similar to 
their own hold such attitudes for value-expres- 
sive reasons; also, (d) Ps will most often infer 
that Os with dissimilar attitudes hold such 
attitudes for ego-defensive reasons. Finally, 
(e) Ps will most often infer that Os with a 
value-expressive attitude base will hold an 
attitude similar to their own attitude; (f) Ps 
will most often infer that Os with an ego- 
defensive attitude base will hold an attitude 
dissimilar to their own attitude. These hy- 
potheses imply that P evaluates himself posi- 
tively and assumes his own attitudes are 
value-expressive based, 

The last major hypothesis deals with an 
interaction between motivational base and 
attitude similarity-dissimilarity. Given an as- 
sumption on P's part that persons with value- 
expressive-based attitudes are generally aware 
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of their motivations and feelings and essen- 


tially “know themselves,” then knowledge of 
the existence of this attitude base i» a particu- 
lar O would lead an evaluator to clearly rec- 
ognize O’s unequivocal alliance with a par- 


ticular point of view. On the other 
assumes that an O with ego-del 
attitudes generally lacks such se! 
in fact, does not really know his true feelings 
and motivations, then the knowledge that a 
particular O’s attitude is based on ego defen- 
siveness does not lead the evaluator to any 
firm conclusion about O's rea! attitude or 
feeling, 

If knowledge of the various attitude bases 


hand, if P 
isive-based 
ight and, 


leads to different conclusions i: y specific 
situation regarding the probability of the true 
existence of the attitude as stated, then the ! 
similarity-dissimilarity dimension should be 
differentially effective across these bases. In 
other words, if O is believed to hold an atti- 
tude for value-expressive reasons, the evalu- 
ator can be confident that it is (s true atti- 
tude and can therefore respond with certainty 
to the similarity-dissimilarity dimension. 
Thus, similarity-dissimilarity ild affect 
evaluative ratings most differentially in the 
value-expressive case. On the other hand, if 0 
is believed to hold an attitude for ego-defen- 


sive reasons, the evaluator cannot be sure that 
it is O's true attitude; in fact, O's attitude 
may very well be the opposite of the one 
stated (e.g., O may be expressing à “sour 
grapes” attitude). Therefore, the evaluator 
cannot respond with much confidence to the 
similarity-dissimilarity dimension. This would 
suggest that similarity-dissimilarity should 
affect evaluative ratings least differentially in 
the ego-defensive case. No specific hypothe- 
ses were formulated with respect to the other 
two motivational bases, utilitarian and n 
for cognition, 
METHOD 

Subjects 

The subjects were 20 males and 20 females, €?" 
rolled in undergraduate psychology courses. All sub- 
jects volunteered to participate in the experiment, 
and about two-thirds of the subjects received et | 
credit toward their final course grade for partici- | 
pating. 


Procedure 


A : s ith a 
A 38-item attitude questionnaire dealing e 
heterogeneous set of issues (eg, legalization of m 


, premarital sex, the draft, capital punishment) 
he subjects during class time in 
individual attitudes across 
attitude scales were 6-point 
-anging from strongly agree 
f) to strongly disagree (strongly 
opposed to), with no neutral scale point. For each 
Subject, 20 of the items checked in the extreme 3 were 
ndomly selected and randomly assigned to the five 
rimental ta 
The subjects «v 
jn an experiment 
being 


a number of top 
Likert-type scal 
(Strongly in favo 


subsequently seen individually 
that was described to them as 


concerned with the way individuals on the 


basis of limited information perceive, evalu- 
ate, and form impressions of other individuals. 
In particular, we nterested in attitudes and 
the reasons people develop, maintain, and/or 
change their attitudes. There are obviously a 
great variety of ns why persons hold cer- 
Lain opinions. A itudes may serve simultane- 
"ously several different purposes for individuals. 
On the other bend, in some instances one spe- 
cific function may be of overwhelming impor- 
tance. We have on the basis of clinical evalua- 
_ tions selected several of the more frequent 
reasons for holding attitudes and have briefly 
described them on these cards, which are coded 
by the numbers 1-4. 
The four card “ve placed on the desk, and the 


"subjects were told 


d that they would be using them 
Tater in the c 


ment, The subjects were then pre- 


sented with each of the five experimental tasks. 
Task 1 
The subject was presented with an attitude state- 
E for each of two Os. The direction of one of 
e attitude statements was similar to the subjects, 


While the other was dissimilar, The statements were 
Presented one at a time, and the order of similar- 
dissimilar direction (of statement) was counterbal- 
E from subject to subject. The subject was in- 
Tucted to read each attitude statement and then to 
‘cord his impression of each other person by re- 


‘ponding to eight rating scales. 
——— 


Lh. Tesponse in the two extreme scale positions of 
E nd of the scale was considered an "extreme 
lo at ^ All subjects except four females responded 
fti» d 20 items in the extreme, Of these, two 
trem ed to 19, one to 17, and one to 16 in the 
E For these four subjects, the remaining few 
ma raga to make up the necessary 20 items 
‘I rawn at random from the total item pool. 
the t may be noted that in the general instructions, 
Rote attention was directed toward the 
ES- nd holding the attitude, This focusing may be 
lituatio osely related to such “naturally occurring 
me, O5 as clinical settings, interviews, etc. We 
Cognize that there may be other situations in which 
Ocus On reasons does not occur. 
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Task 2 


The subject was presented with a different attitude 
topic, or issue, for each of four Os, but was not 
informed of O's attitude direction, that is, whether 
any given O was favorable to or opposed to the 
issue. An O's motivational base, or reason for holding 
the attitude, was indicated by a number referring to 
one of the four numbered cards. The 5 X7 cards 
numbered 1-4 contained descriptions of the motiva- 
tional bases. These descriptions represent a distillation 
of descriptions presented elsewhere by a number of 
researchers and theoreticians concerned with the 
motivational basis of attitude formation and change 
(Cohen et al., 1955; Katz & Stotland, 1959; Sarnoff 
& Katz, 1954; Smith et al, 1956). 


Card 1 (utilitarian base). This person holds this 
attitude because it facilitates his attainment of some 
objective. That is, he holds it for purposes of ob- 
taining some positive return from his environment 
and/or avoiding some punishing consequence from 
his environment. Such a positive return may be in 
the form of physical or material objects, or perhaps 
more commonly in the form of gaining status or the 
social approval of others. He holds this attitude as 
a means of achieving a practical end—maximizing 
gains and minimizing losses. 


Card 2 (value expressive). Individuals have char- 
acteristic values which serve to help them know who 
they are and what they are. This person’s attitude 
gives expression to the positive values, or ideals, 
which he holds about himself and his social world. 
This attitude is internalized by the person as an end, 
not as a means to achieving some environmental 
reward. For example, a person may have a set of 
principles or an ideology by which he lives, and any 
particular attitude may simply reflect these values 
and principles. 


Card 3 (need for cognition). People are uncom- 
fortable with situations and things which are am- 
biguous, chaotic, and unpredictable. To reduce this 
discomfort and behave effectively, they must men- 
tally impose order on their world. Some attitudes 
are useful in accomplishing this order by making 
clear what to expect from particular things or situ- 
ations and how to behave toward those things and 
in those situations. By maintaining this particular 
attitude, this person is better able to understand, and 
predict, the events in his world so that he can 
behave effectively. 


Card 4 (ego defensive). This person holds this 
attitude as a way of defending himself against recog- 
nizing, or becoming aware of, the fact that he har- 
bors unwelcome thoughts, feelings, or impulses. This 
individual's attitude may represent a defense which 
takes the form of denying the impulse, projecting 
the impulse onto others, taking an extreme opposite 
position to the one he unconsciously holds, or dis- 
placing the unwelcome thought, impulse, or feeling 
toward a safer or more acceptable target. In this 
way he gains some behavioral expression of the 
feeling while protecting himself against the anxiety 
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TABLE 1 
MEAN ArrRACTION RATING BY LEVELS OF BASE AND SIMILARITY 
Base | Combined rating Similar Dissimilar | Difference E 
gl 5 2 " 
No base | 8.83, 10.30 7.35 | 2 42.18 
Value expressive 9.53, 11.03 8.03 3 43.628 
Utilitarian 6.83, 8.25 5.40 ^ 39.378 
Need for cognition Es n2 ec $ ae 
Ego defensive 6.55, .85 ` A 
Total 8.17 9.43 6.90 2. 159.46* 
Nove: Combined rating means with a common subscript do not differ significantly (5 > .01) by New Ceuls analysis, 
*p <.01. 


that he would experience if he saw himself as the 
true source of the feeling. 


After the subject read the attitude topic and the 
description of the indicated motivational base, he was 
instructed to record his impression of each O on the 
eight scales. 


Task 3 


Each subject was presented with an attitude state- 
ment and a motivational base for each of eight stimu- 
lus Os. For each subject, the order of the four moti- 
vational bases, each paired with both a similar and 
a dissimilar attitude statement, was systematically 
varied by means of an 8 X 8 Latin square. The sub- 
ject was instructed to read both the attitude state- 
ment and the reason (motivational base) why the 
person held the attitude—indicated again by the code 
number of the card—and then to record his impres- 
sion of each person on the standard eight scales, 

Tasks 4 and 5 were inference conditions, with 
Task 4 being inference of base from attitude and 
Task 5 being inference of attitude from base. 


Task 4 


Each subject was presented with an attitude 
statement (direction indicated) for each of two Os. 
Again, the order of similar and dissimilar attitudes 
was counterbalanced. The subject was instructed to 
read each attitude statement and then give his opin- 
ion as to which of the four described reasons (moti- 


vational bases) motivated the O to hold that par- 
ticular attitude. 


Task 5 


The subject was presented with an attitude topic, 
or issue (no direction given), for four Os. For each 
of the Os, a motivational base was indicated by a 
code number. The order of the bases was random. 
The subject was instructed to read the attitude issue 
and the reason (motivational base) indicated, and 
then to give his opinion as to which of the attitude 
directions it represented—agree (in favor of) or 
disagree (opposed to). 

The order of the various tasks was Systematically 
arranged. While this study is based to a great extent 
on the premise that individuals do attend to attitude 
bases in their evaluations of others, Task 1 was 
placed first in the sequence of conditions so that the 
procedure would not suggest to subjects to infer a 


imilar strangers. 
ects with the 


motivational base for similar a 
Task 2 was second to acquaint the 


motivational bases. Tasks 1 and ether prepared 
the subjects for responding to Task 3, in which both 
base and attitude were invol rhe remaining two 
conditions could have been interchanged. For each of 


them, it was necessary that the subjects be quite 
familiar with the various motivational bases. This 
familiarity was provided by the subjects’ experience 
in the first three conditions. 


Rating Scales 


In Tasks 1-3, the subjects rec 
sions on a modified version of th 


Byrne Interper- 


sonal Judgment Scale (Byrne, 1961), The scale 
called for the subjects’ ratings in the order shown, 
of how much they would like the st: D how much 
they would like working with him in an experiment, 


how similar to themselves in personality they thought 
him to be, how much they thought he would like 
them, also how intelligent, how knowledgeable of 
current events, how moral, and how well-adjusted 
they considered him to be. The ratings on the first 
two scales were combined to yield the attraction 
measure. A high positive correlation (r= .85) bes 
tween these two scales has been reported (Byrne & 
Nelson, 1965). 


RESULTS 


A 2 (Sex) x2 (Attitude Similarity/Dis 
similarity) x 5 (Attitude Base: four bases 
plus no base) analysis of variance, with re- 
peated measures on the last two factors Was 
performed for each of the dependent mea 
sures.’ There were no significant main effects 
due to the sex factor in any of the analysts: 

The main effect of attitude similarity-d& 
similarity on the subjects’ ratings of d 
persons was highly significant for every œŒ 


e) 

"The results of Task 2 (base presented x 
were analyzed by a 2 (Sex) X4 (Base) e ie 
variance with repeated measures on base. Only of 


main effects of attitude base were significant on 


for 
the dependent measures. The pattern of ees 
the different attitude bases was essentially b x$ 
as in the 2 (Sex) X 2 (Similarity /Dissimilarity 


(Base) analysis, 


ire. The mean ratings of similar 
strangers for the attraction mea- 
nted in Table 1. On the attrac- 

ummation of ratings for the 

personal feelings and working 
| experiment), the difference 
: of attitudinally similar and 
vers was significant. Thus, in 
previous research findings, the 
ich asserts that persons will like 

à similar attitude more than 
imilar attitude, was confirmed. 

other six dependent measures 
revealed that a person with a similar rather 
than a dissimilar attitude is thought (a) to be 
more similar in personality to oneself (F = 
90.24, df = 1/38, p < 001); (b) to recip- 


pendent mea 
and dissimilar 
Sure are p: 
lion measure 
first two sca 
together i: 
between ra 
dissimilar s 

P line with oth 
hypothesis, y 
a stranger 
one with a í 
Analysis of the 


rocally like oneself more (F = 117.81, df= 
1/38, p < .0 ; (c) to be more intelligent 
(F = 39.61, cf = 1/38, p < .001); (d) to be 
more knowledgeable of current events (F = 
43.27, df = 1/38, p < .001); (e) to be more 
Moral (F = 17.72, df=1/38, p< .001); 


and (f) to be better adjusted (F = 33.05, dj 
=1/38, p< | 01). 

. The main effect of attitude base was also 
Significant for every dependent measure. It 
Was predicted that knowledge of the motiva- 
tional base of another person's attitude will 
affect one’s evaluation of and attraction rat- 
ing of that person. On the attraction measure, 
he differences among attitude bases were sig- 
nificant beyond the .001 level, thereby con- 
ON the hypothesis (F = 24.60, df = 4/ 


Tn a comparison of the value-expressive base 

With the average of all others, persons with 
‘alue-expressive-based attitudes were rated 
Significantly more attractive (F = 30.28, df 
z1/1532, p< 001), as predicted. Also as 
hypothesized, a comparison of the ego-de- 
“sive base with the average of all others 
Tevealed that persons with ego-defensive- 
ased attitudes were rated as significantly 
SS attractive (F = 42,42, df= 1/152, p< 
E». A Newman-Keuls analysis of the or- 
ted differences in mean attraction ratings 
E motivational bases (combined over 
T War and dissimilar Os) is presented in 
able 1, 


lt Was also hypothesized that similarity— 
lia. milarity of attitudes should be differen- 
Y effective across attitude bases in influ- 


heit LP oi up 
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encing the subjects’ attraction ratings of oth- 
ers. Specifically, it was predicted that attitude 
similarity—dissimilarity should have the great- 
est effect on attraction when the basis of the 
attitude is value expressive and the least 
effect when the basis of the attitude is ego 
defensive. Inspection of the difference in at- 
traction between similar and dissimilar others 
at each motivational base shows this, in fact, 
to be the case (see Table 1). The relevant 
source of variance for testing the significance 
of this finding in the overall analysis is the 
Similarity X Base interaction. This interaction 
(which included five levels of base) was mar- 
ginally significant (F = 2.14, df = 4/152, p 
< .08). Since, however, the hypothesis ex- 
plicitly dealt with the value-expressive and 
ego-defensive bases, it seemed reasonable to 
look at the Base X Similarity interaction 
when only these two levels of base were con- 
sidered. This interaction did reach a more 
conventional level of significance (F = 6.16, 
df — 1/152, p < .05), and indicates that the 
effect of attitude similarity is stronger at the 
value-expressive base than at the ego-defensive 
base. In order to determine if this difference 
was due to a heightening in the effectiveness 
of similarity with the value-expressive base 
or a dampening in the effectiveness with the 
ego-defensive base, or both, some additional 
2X2, Base X Similarity interaction com- 
ponents were computed. The first included 
value expressive and no base as the two levels 
of base, and the second included ego defensive 
and no base as the two levels of base. The first 
interaction was not significant (F < 1), while 
the second was significant (F = 5.79, dj = 
1/152, p < .05), indicating a dampening ef- 
fect of attitude similarity when the base is 
ego defensive. Consistent with this implica- 
tion is the finding that similarity and base 
interact when the two levels of base are ego 
defensive and utilitarian (F = 5.07, p < .05) 
and ego defensive and need for cognition (F 
= 2.81, p< .10), while there is no interac- 
tion when the two levels of base are value 
expressive and utilitarian (F < 1) or value 
expressive and need for cognition (F = 1.52, 
ns). 

Chi-square analyses were used to test the 
hypotheses dealing with inferences, It will be 
recalled that each of the 40 subjects was asked 
to infer the motivational base from knowledge 
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TABLE 2 


CHI-SQUARE ANALYSES OF INFERRING SPECIFIC 
MOTIVATIONAL BASE FROM KNOWLEDGE OF 
ATTITUDE DIRECTION OF STRANGER 


Base Valve-expressive | All others xi 
Similar 20 20 13.33** 
Dissimilar 5 35 3.33 

Ego-defensive All others x 
Similar 4 36 4.80* 
Dissimilar 15 25 3.33 
* p «.05. 
** p <.001. 


of O’s attitude direction (Task 3). It was 
predicted that when O's attitude was similar 
to the subject’s, the value-expressive base 
would be overinferred, and when O’s attitude 
was dissimilar, the ego-defensive base would 
be overinferred. The results in Table 2 lend 
significant support to the first prediction, but 
not the second. The ego-defensive base, how- 
ever, does seem to be underinferred when the 
other’s attitude is similar to the subject’s. 
Attitude similarity-dissimilarity did not have 
a significant effect on the inference of the 
utilitarian or need-for-cognition bases. 
Subjects were also presented with the base 
for each of four attitude topics and were 
asked to infer O's attitude direction (Task 5). 
The subject's original questionnaire provided 
information for classifying his inference in 
terms of similarity to his own attitude. Table 
3 reveals, as predicted, that when given a 
value-expressive attitude base, the subjects 
inferred, more often than would be expected 
by chance, that O's attitude was similar to 
their own. Contrary to prediction, given an 
ego-defensive base, a dissimilar attitude was 
not inferred significantly more often than 
would be expected by chance. However, if the 
effects of value expressive and ego defensive 
are looked at in comparison to each other, 


TABLE 3 


CHI-SQUARE ANALYSES OF INFERRING ATTITUDE Di- 
RECTION FROM KNOWLEDGE OF MOTIVATIONAL 
BASE OF STRANGER'S ATTITUDE 


Similar Dissimilar x 
Value expressive 31 9 12.10* 
Utilitarian 20 20 .00 
Need for cognition 17 23 | 90 
Ego defensive 15 25 2.50 


*p < 025, 
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that is, analyzed as a 2 X 2 chi-square, we 
find a significant difference (^ = 12.10, df= 
1, p< .001): Subjects more often inferred a 
similar attitude for persons with a value 
expressive base and a dissimilar attitude for 
persons with an ego-defensive base. Caution 


should be used in interpreting this finding, 
however, since the observations are not inde. 
pendent. 
Discussion 
The finding that subjects were more at 
tracted to an O with a similar attitude than to 


an O with a dissimilar attitude is unremark- 
able, considering the extensiveness of data 
previously gathered in support of such 4 
notion (cf. Byrne, 1969). 

It is more important to note that strong, 
support was also obtained for the hypothesis) 
dealing with the main effect of motivational 
base on attraction. Ignoring whether another 
person agrees or disagrees, if his attitude is 


seen to be value expressive, he is liked better 
than if his attitude is seen to be ego defensive. 
Others with need-for-cognition- and utilitar- 


ian-based attitudes are liked an intermediate 
amount. The most reasonable explanation for 
these findings seem to be that the value-et 
pressive base connotes self-awareness, cogni- 
tive consistency, and good personal and emo- 
tional adjustment—all culturally and socially 
desirable characteristics or traits, while the 
ego-defensive base connotes inconsistency; 
lack of self-insight, and poor adjustment. 
Not only was attraction expected to be af- 
fected by attitude similarity and motivational 
base, when considered independently, but it 
was also predicted that similarity and ba 
would interact in their effect on attraction: 
Specifically, it was anticipated that attitude 
similarity would have a greater effect on at- 
traction when the motivational base w 
value expressive than when the base was ego 
defensive. This prediction was confirmed. The 
basis for the prediction was the notion tha 
certainty about how the other person 7^ J 
feels is inherent in a value-expressive attr 
tude, and uncertainty is inherent in an €g? 
defensive attitude, since the other person hint 
self may not be aware of his true feelings 
the extent that attitude similarity leads i? 
attraction because it helps to consenst® i 
validate one's own beliefs (i.e., lowers © 


ce motivation: Byrne & Clore, 1967) 
the value expressive attitude should be 
re effective than the ego-defensive attitude. 
If one accepts the idea of a “two-step” in- 
formation-pr« ing strategy, another possi- 
ble explanation for the interaction effect be- 
“tween attituc ^ and similarity, and, in 
"particular, the mpening effect of the ego- 
defensive base the similarity-dissimilarity 
"dimension, c: derived from Newcomb's 
(1968) discussion of nonbalance (i.e., a con- 
dition in whic e P to O relation is nega- 


tive), as opposed to balance and imbalance. 
The subjecis in the present study were pre- 

sented with two bits of information on which 

to base their evaluation of a person—attitude 


"similarity and motivational base. If subjects 
go through a two-step procedure (to arrive at 
4 “composite cognition"), whereby attitude 
base is examined first and given some high 
weight toward an attraction rating, it is con- 
tevable that being presented with a highly 
undesirable piece of information such as the 
ego-defensive base leads the subject to an 
immediate negative evaluation of the target 
person and, thus, renders the attitude (simi- 
lar or dissimilar) a less relevant and less po- 
tent influence on the attraction rating. Some 
Support was given to this explanation by the 
‘comments of a number of subjects during 
"ostesperim. ntal interviews. Of those subjects 
x when probed, indicated a weighting sys- 
tem regarding the effects of base and similar- 
lty on their ratings, almost twice (15 out of 
4) as many said that they differentially 
Weighted base higher than attitude similarity 
Or dissimilarity. 
The finding that attitude similarity had a 
highly significant effect on assumed personal- 
B, similarity suggests another possibility. 
hile similar attitudes lead to a positive eval- 
“ation, it also implies similarity along other 
dimensions. for instance, motivational base. If 
the motivational base is undesirable, as is the 
*8-defensive, similarity might be threatening. 
| is in turn might cause the similar, ego- 
tfensive other person to be evaluated lower 
t an could be predicted on the basis of simi- 
ES or base alone. On the other hand, dis- 
eR attitudes lead to a lower evaluation 
ES not imply similarity along other dimen- 
E^ If the motivational base is undesirable, 
| ‘similarity might be relieving, and then the 
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rating would be more positive than expected 
by the individual components alone. A similar 
type of argument has been advanced by No- 
vak and Lerner (1968) in explaining their 
finding that when the O is believed to be 
normal, subjects showed a greater desire to 
interact with a similar rather than a dissimilar 
O; when O is perceived as emotionally dis- 
turbed, subjects prefer to interact with a dis- 
similar O rather than a similar one. 

The inference data revealed that subjects 
tended to overinfer value-expressive reasons 
and to underinfer ego-defensive reasons for 
other persons with similar attitudes. For Os 
with dissimilar attitudes, there were marginal 
trends in the direction of underinferring the 
value-expressive motive and overinferring the 
ego-defensive motive. One explanation for the 
direction of the findings is that subjects may 
be implicitly rewarding the agreeing O with 
the attribution of a socially desirable charac- 
teristic, and punishing the disagreeing O with 
attribution of a socially undesirable motive. 
Alternatively, as previously argued, similarity 
of attitudes implies similarity on other di- 
mensions. If subjects think of their own atti- 
tudes as value expressive and not ego de- 
fensive, then these are the kinds of inferences 
they would make about similar Os. On the 
other hand, if the other person is not atti- 
tudinally similar, it implies lack of similarity 
along other dimensions, hence, it is more diffi- 
cult to know what that person's attitude base 
is. The data seem to be consistent with this 
explanation, in that the attribution tendencies 
were clearly significant when the other person 
was attitudinally similar to the subject, but 
only marginally so when the O was attitudi- 
nally dissimilar. 

Turning now to the task in which the sub- 
jects inferred attitude direction from motiva- 
tional base, it will be recalled that only when 
O had a value-expressive-based attitude was 
there a difference in the number of similar and 
dissimilar attitudes inferred. While there is a 
trend toward overinferring dissimilar attitudes 
with an ego-defensive other person, it does not 
approach significance. Perhaps this is due to 
the fact that even though the ego-defensive- 
based O is seen to be dissimilar to the subject, 
that O is so *mixed up" that he might, in 
fact, be attitudinally similar. 

The results of this study provide strong 
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support for the hypotheses. Not only does 
attitude similarity affect attraction, but the 
reason the attitude is held also affects attrac- 
tion. Further, attitude similarity and motiva- 
tional base interact in their effects on attrac- 
tion. 

The present study looked at four bases 
identified by social psychologists as being use- 
ful. It is important to point out that these 
may or may not be the kinds of bases naive 
perceivers typically use (Hastorf, Richard- 
son, & Dornbusch, 1958). To the extent that 
they are not, the results of the study are 
probably conservative. Future research might 
concern itself with determining the kinds of 
bases actually used. 

In addition to demonstrating the effects of 
attitude similarity and base on interpersonal 
attraction, the study showed that attitude 
similarity—dissimilarity has reliable effects on 
the inferred reason on which the attitude is 
held, and that the reason the attitude is held 
has reliable effects on whether the attitude is 
inferred to be similar or dissimilar. This sug- 
gests that additional research be undertaken 
exploring further conditions of inference. For 
example, a value-expressive base might be 
inferred from out-of-role performance (Jones, 
Davis, & Gergen, 1961), a utilitarian base 
from the fact that an individual is given a lot 
of money for stating his opinion (Bem, 1967), 
an ego-defensive base if an individual over- 
states his position (“me thinks thou do pro- 
test too much’’), or a need for cognition if 
the other states his opinion in a novel, am- 
biguous situation. 

Finally, it would be useful to determine the 
extent to which the attitudinal bases of the 
perceiver affect his perception of attitudinally 
similar and dissimilar Os when the attitudinal 
bases of these Os are similar or dissimilar to 
his own. These perceptions should be investi- 
gated both in terms of interpersonal attrac- 
tion and the kinds of inferences made. 
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REDUCTION OF PHYSICAL AGGRESSION: 
THE EFFECTS OF FOUR STRATEGIES ' 


RICHARD PISANO axb STUART P. TAYLOR? 


Kent State University 


Four types of aggression-control strategies were employed in an attempt to 


reduce the shock intensities set by 40 high aggressors. The most effective 
strategy was the one in which the victim matched the attacker's shock set- 
tings. The most ineffective strategy was the one in which the opponent behaved 


Psychologists have expended considerable 
"energy in attempting to determine the princi- 
‘pal antecedents of physical aggression. In 
fact, some researchers have become quite 
adept at eliciting physical aggression in the 
laboratory (e.g., Berkowitz, 1964, 1965; Berk- 
owitz & Geen, 1967; Buss, 1966; Taylor, 
1967). Surprisingly, very little attention has 
been directed toward investigating the means 
of controlling or reducing physical aggression. 
‘The major purpose of this experiment was to 
attempt to influence the level of aggression of 
persons who are prone to inflicting pain upon 
others by varying the response strategy of the 
victim. 
Theoretically, there are at least three strate- 
Bies that a victim may adopt to reduce ag- 
gressive behavior: (a) He may attempt to in- 
‘limidate his adversary by behaving toward 
him in an intensely punitive manner. (^) He 
“May attempt to eliminate all provocative cues 
by adopting a passive strategy. In addition to 
allowing the aggressive response to extinguish, 
this latter strategy could elicit guilt and 
anxiety concerning attacking a “defenseless” 
person. (c) He may attempt to combine the 
Passive and punitive strategies by matching 
the attacker’s aggressiveness. 

Potentially, these strategies might produce 
Undesirable effects: The punitive and match- 
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Partial fulfillment of the master’s degree at Kent 
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in a punitive manner; it increased aggressive responding. A monetary incentive 
manipulation, in which the subjects were given extrinsic rewards for setting 
low-intensity shocks, and a passive strategy, in which the opponent continually 
set low-intensive shocks, were minimally effective. 


ing strategies could elicit further, more inten- 
sive aggressiveness since they involve the use 
of noxious stimulation (Buss, 1961); given 
the fact that a victim's pain can be an effec- 
tive reinforcer (Buss, 1961) and that fear of 
punishment is an inhibitor of aggression 
(Dollard, Doob, Miller, Mowrer, & Sears, 
1939), the passive strategy could encourage 
rather than discourage attack behavior. 
Shure, Meeker, and Hansford (1965) re- 
ported, for example, that a pacifist strategy in 
a bargaining situation encouraged exploitive 
behavior. 

In this experiment, high aggressors, that is, 
subjects who attempted to deliver intense 
shocks to their opponent prior to receiving 
information concerning their opponent’s ag- 
gressive intentions, competed in a reaction 
time situation in which they could hurt and/ 
or be hurt by their opponent. One opponent 
behaved in a minimally aggressive manner, 
continuously setting low shock intensities. 
Another opponent behaved in a punitive man- 
ner, always setting maximum shock intensities. 
The third type of opponent matched the ag- 
gressiveness of the subject. Thus, the three 
types of opponents conform to the three types 
of aggression-reducing strategies noted above. 

A fourth group of high aggressors was in- 
iormed that participants would receive a 
monetary reward each time they selected a 
shock intensity. Specifically, the subjects were 
told that the lower the intensity of shock set, 
the more money they would receive. The 
purpose of this condition was to determine 
whether or not monetary incentives could re- 
duce the expression of physical aggression. 
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METHOD 
Subjects 


Forty undergraduates, enrolled in an introductory 
psychology course at Kent State University, who set 
high shock intensities (3, 4, or 5) on the first trial 
were classiñed as high aggressors. They were ran- 
domly assigned to one of four aggression-reduction 
conditions. For purposes of comparison, 12 subjects 
who set low shock intensities (1 or 2) on the first 
trial (low agressors) were randomly assigned to two 
of the aggression-reduction conditions. Sixty-eight 
other subjects who set low shock intensities were 
used in another study. 


Procedure 


The subject was seated at the task board (de- 
scribed in Taylor, 1967), and the shock electrode 
was attached to the left wrist. After the subject’s 
“unpleasantness” threshold for shock was determined, 
he heard a tape recording of the task instructions. 

Each subject was told that he was competing with 
another subject in an adjoining room on a task in- 
volving reaction time. At the beginning of each trial, 
he was instructed to select (by pressing one of five 
buttons) any one of five intensities of shock that he 
wished his opponent to receive. He was informed 
that the shock would be administered to his op- 
ponent at the end of a trial if he was faster than 
his opponent, and that he would receive the shock 
his opponent set for him if his opponent was faster. 
Thus, he realized that either he or his opponent 
would receive a shock, depending upon the outcome 
of the competition, and that both could select the 
intensity of the shock the other would receive. 

Actually, there was no opponent. The frequency of 
wins and losses and the amount of shock received 
were programmed by the experimenter. 

Immediately after the task instructions were heard, 
the trials began. Each trial consisted of four specific 
events: (a) a signal to set the level of shock that the 
subject wished his opponent to receive; (b) a ready 
signal, where the subject pressed his finger down on 
the telegraph key; (c) the trial itself, where the 
subject released the key as quickly as possible; and 
(d) a feedback signal that consisted of a light which 
indicated the amount of shock that had been set for 
the subject by his opponent, and if the opponent 
was faster, the shock as well. The four events and 
the intertrial interval were of 5-seconds duration 
each, The shock, when appropriate, was of 1-second 
duration, 

All subjects lost 50% of the trials, according to 
several predetermined orderings. In each of the four 
blocks of six trials (Trials 2-25), the subjects re- 
ceived three shocks. Two exceptions to the prede- 
termined orderings were made. When possible, if a 
subject’s reaction time was extremely fast on a given 
trial, he was not shocked on that trial. If his reac- 
tion time was extremely slow on a given trial, he 

was shocked on that trial. No subject received shock 
on the first trial. 
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In establishing the intensities of shock 
ministered to each subject, the intensi 


to be ad- 
judged as 


“definitely unpleasant" was designated ss Number $; 
Number 4 was set at 90% of the maximum; Number 
3, at 80% of the maximum; and Num! 2 and 1, at 
70% and 60% of the maximum, respectively. 
Experimental Manipulations 

High aggressors (subjects who se shocks on 
the first trial) were randomly placed i one of the 
four conditions. In the passive cond ill subjects 
competed with an opponent who < ly set a 
minimal intensity shock (Number 1). Subjects in 
the maiching-response condition con d with an 
opponent who employed a matchin egy. On 
the first trial the opponent set a i-i y shock. 
On all other trials (2-25), the opponent set the 
level of shock that the subject had set on the previ- 
ous trial. In the punitive condition all subjects f 
a highly punitive opponent who continually set 
maximum intensity of shock (Number 5). Finally, 
the money condition all subjects competed with a 
highly punitive opponent who continually set the 
maximum intensity shock (Number 5). Following the 
first trial, these subjects received special instructions. 
They were told that they would ri > monetary 


rewards: 5 pennies each time they set the 1-intensity 
shock, 4 pennies each time they sei the 3-intensity 
shock, 2 pennies each time they sc! ihe 4-intensity 
shock, and 1 penny each time they se! t intensity 
shock. The appropriate amount of moncy was deliv- 
ered immediately after the subject set a level of 
shock. 

Low aggressors were randomly placed 
passive condition, in which they f. opponent 
who set a 1-intensity shock on all t , or a puni- 
tive condition, in which they competed with an 
opponent who set a 3-intensity shock on all trials. 


into either à 


RESULTS 
Initial Effects 


According to an analysis of variance of 
mean shock intensities set by the high aggres- 
sors on the first seven trials (see Table 1), 
the differences in shock settings among the 
four conditions were significant at the .002 
level (F = 6.34, df = 3/36). The mean shock 
settings for the punitive, money, passive, an 
matching-response conditions were 3.89, 3.43, 
2.93, and 2.50, respectively. Furthermore, the 
Strategies X Blocks interaction was significant 
at the .002 level (F = 2.33, df = 18/216). 

As can be seen in Figure 1, the punitive 
strategy did not effectively suppress or reduce 
aggressive responding; instead, this strategy 
elicited greater intensities of attack. While 
only 20% of the subjects set the maximum 
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shock intensity on the first trial, 70% of the 
subjects set the maximum shock on the sev- 
enth trial. It is rather informative that very 


few subjects in this condition attempted to 
influence their opponents’ behavior by setting 
low-intensity shocks. Not one subject, for ex- 
ample, chose to s the minimal shock in- 
tensity. Only three subjects selected the next 
highest lev Number 2); these subjects se- 
lected the 2-intensity shock only once. 

The matching strategy appears to have most 
effectively reduced the intensity of the high 
aggressor’ tack. According to the post- 
analysis o wiance test (Newman-Keuls), 
the differences in shock settings between Trial 


1 and Trials 2, 3, 4, 5, and 7 are significant at 
the .05 level. Of course, it should be noted that 
even the matching strategy did not produce 
“spectacular” ‘results, While three subjects 
did reduce their shock settings to the lowest 


level, th& average shock setting on the sev- 


enth trial was 2.4, a reduction of only .9 from 
the first trial, One subject, in fact, increased 
the intensity of his shock settings. While set- 
ting a 3-intensity shock on the first trial, he 
set the n um shock on the seventh trial. 

The immediate consequence of the low 
shock intensity set by the passive opponent on 
the first was a reduction in the subjects’ 


shock settings. In contrast to the matching 
strategy, however, the passive strategy could 
not maintain the low aggression levels. Ac- 
cording to the Newman-Keuls tests, the differ- 
ences in shock settings between the first and 
later trials were not significant. Only one sub- 
ject was appreciably influenced by the op- 
ponent’s passive behavior. He started the ses- 
sion by setting a 3-intensity shock and set a 
l-intensity shock on the sixth and seventh 
trials. On the other hand, while competing 
with subjects who continually set minimal 
shock intensities, four subjects set 4s and 
lour subjects set 3s on the seventh trial. 

It is rather surprising, given the pacifistic 
response style of the victim, that the weakest 
shock intensity (Number 1) was used very 
infrequently. One subject selected the minimal 
shock on three occasions, while two subjects 
selected this shock on only one occasion. Thus, 
the 1-intensity shock was selected on only 5 


of a possible 70 occasions. 
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TABLE 1 


ANALYSIS OF VARIANCE OF AGGRESSION SETTINGS 
FOR Four SrRATEGIES: Snorr-Term Evrects 


Source dj MS F 
Between Ss total 39 
Strategies (A) 3 23.51 6.34" 
Error (between) 36 3.71 
Within Ss total 40 
Trials (B) 6 4.07 | 6.68% i 
AXB 18 142 | 233° M 
Error (within) 216 61 
*p <.002. 
** 5 c 001. 


Subjects in the money condition appeared 
to be influenced both by a desire for extrinsic — 
rewards and a desire to inflict pain on their 
adversaries. According to the Newman-Keuls — | 
analysis, the drop in aggression from Trial 
to Trial 2, the increase in aggression fom 
Trial 2 to Trial 4, the decrease from Trial 4 | 
to Trial 5, and the increase from Trial 5 to - 
Trial 7 are all significant at the .05 level. 
Thus, the subjects vacillated between attack- 
ing their opponent and accumulating mone- 
tary rewards. The trial-by-trial consistency 
evidenced by subjects in this condition was 
rather unexpected. 


imeem money 
passive 
— motching 


POE EET. | 


MEAN AGGRESSION SETTINGS 


TRIALS 


Fic, 1. Mean aggression settings as a function of the 
first seven trials and strategies. 
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BLOCKS 


Fic. 2, Mean aggression settings as a function of 
the last three blocks of trials and strategies. 


Long-Term Effects 


The above results demonstrate that the 
response style of the victim can influence the 
shock-setting behavior of the high aggressor. 
An important question is, Will the effects last? 
Will the high aggressor, for example, continue 
to respond in a relatively nonaggressive man- 
ner while competing with the opponent who 
utilizes a matching strategy? 

Figure 2 shows the mean aggression set- 
tings for each of the experimental conditions 
for the last three blocks of trials. According 
to the analysis of variance of aggression 
scores on the three blocks of trials, the differ- 
ences in aggression among the groups were 
significant at the .001 level (F = 13.7, dj = 
3/36). Both the main effect of blocks and 
the Blocks X Conditions interaction were not 
significant. Apparently, within each condition, 
mean shock-setting behavior remained highly 
stable over blocks of trials. 

]t may be recalled that two groups of low 
aggressors, the subjects who set low-intensity 
shocks on the first trial, were allowed to com- 
pete in the reaction time situation. One group 
competed with passive opponents, and the 
other group competed with punitive oppo- 
nents. For both groups, the mean shock set- 
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ting on the initial trial was 1.33. The mean 
shock intensities set by the subjects competing 
with the passive opponent, on Blocks 1-4, 


were 1.14, 1.06, 1.00, and 1.00. The mean 
shock intensities set by the low aggressors 
who competed with the punitive opponents 


were 2.44, 2.01, 3.28, and 3.83. 


DISCUSSION 
In an interview presented in Adolescent Ag- 
gression, by Bandura and Walters (1959), a 


father was asked, “Have you ever e: 
Earl to use his fists to defend him 

father replied, “Yes, if necessary. I 
him many times that if someone 

fight with him and started the old 

chip on the shoulder, don't hit th: 
his jaw and get it over with [p. 115]." 
punitive victim in this experiment adopted 
such a strategy. From the first trial io the last, 
regardless of the high aggressor’s iutensity of 
attack, this victim attempted to intimidate 
the subject by setting the maximum shock in- 
tensities. The strategy was ineffective, since 


it neither stabilized nor reduced the intensity 
of attack; instead, it instigated greater in- 
tensities of attack. The response of the non- 


aggressive subject to the punitive opponent 
further demonstrates the undesirable nature of 
the punitive strategy. When competing 
against the passive, nonaggressive opponent, 
the low aggressor behaved in a very coopera- 
tive, nonaggressive manner, As long as the 
Opponent set low shock intensities, the low 
aggressor was willing to set low shock intensi- 
ties. However, when the opponent behaved in 
an aggressive manner, the low aggressor re- 
taliated. Thus, the punitive strategy was In- 


TABLE 2 
ANAL NRI OF VARIANCE OF AGGRESSION SETTINGS 


Source df MS F 
-[—————— n 
Between Ss total 39 ge 
Strategies (A) 3 30.7 13.75 


Error (between) 36 2.23 
Within Ss total 80 
Blocks (B) 2 08 
AXB 6 9 
Error (within) 72 19 


* p < 001, 
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effective in terms of controlling and preventing 
the expressi »f aggressive behavior. 

It should be recognized that the ineffective- 
ness of the itive strategy might have been 
due to th ticular shock intensities that 
were available to the victim. The maximum 
shock inte used in this experiment was 
labeled “‘d iely unpleasant, but not pain- 
ful.” If the victim had the opportunity to set 
“definitely ful” shocks, the high aggres- 
sor might have been more reluctant to behave 
in a provocative manner, 

Basically, a “‘pacifistic” strategy involves al- 
lowing an antagonist the opportunity to ag- 
gress at will, without fear of retaliation. 
Those who advocate such a strategy assume 


guilt reac tions, providing a nonaggressive 
model, and allowing the aggressive response to 
habituate. In this experiment, the passive op- 
ponent emp Jo yed such a strategy: he at- 


i^ it reduces aggressive behavior by elicit- 


tempted to placate rather than intimidate his 
attacker * ontinually setting low shock in- 
tensities. This strategy proved to be very 
effective only when used against nonaggres- 
sive opponents. It failed when it was utilized 
against high aggressors, who, at the end of 


the perimental session, were setting essen- 
tially the same intensity shocks as they had 
set on the first trial. It is possible that the 
behavior of the high aggressor was being 
teinforced by the knowledge that his victim 
Was suffering injury, “Pain and injury to the 
Victim,” according to Buss (1961), “are the 
usual consequences of physical aggression 

. since they are closely associated with 
aggression, it is not surprising that they serve 
as reinforcers [p. 2].” 

Psychologists have conducted very few ex- 
perimental investigations on the effects of 
punishment on aggressive behavior. In fact, 
Psychologists have traditionally been quite 
skeptical concerning its use. Recently, how- 
ever, some researchers have suggested that 
punishment might be effective under certain 
Conditions. Its effectiveness, according to 
Walters (1966), would depend “on such 
factors as its nature, intensity, timing, and 
Consistency [p. 67]." More specifically, a 
number of researchers have theorized that 
Punishment might suppress an aggressive re- 


sponse if it were made contingent upon the . 


response. Buss (1961) stated, for example, 
that "successive punishments of the aggressive 
response and reinforcements of the nonag- 
gressive response should lead to a weakening 
of the aggressive response [p. 56]." Lundin 
(1969) noted that “at the human level it is 
also evident that the verbal terms ‘right’ or 
‘wrong’ are used . . . the additive result of 
both is more effective than either alone [p. 
267]." In the matching condition, punishment 
was contingent upon the intensity of shock set 


by the aggressor. When the high aggressor set — 


an intense shock, he received an intense 


shock; when he set a weak shock, he received - 


a weak shock. This “eye-for-an-eye” strategy | 
proved to be relatively effective; the subjects. 
who competed with the matching victim evi- 
denced the greatest reduction in aggressive 
behavior. 

The results of this experiment demonstrate 
quite convincingly that it is extremely diffi- — 
cult to reduce the aggressiveness of persons 
who are intent upon harming others. The | 
opponent was obviously not very successful in 
this endeavor: he provoked the antagonist 
when he attempted to placate him, and he 
only managed to reduce the level of attack 
slightly when he utilized a punishment strat- 
egy. The payoff manipulation, in which the 
subjects were offered monetary rewards for 
setting a low-intensity shock, proved to be 
fairly limited in effectiveness; the aggressor 
simply vacillated between setting low- and 
high-intensity shocks. 

It is evident that researchers must direct 
more attention to the problem of aggression 
control. Although most psychologists would 
concur with Rothballer (1967), that “man’s 
aggression has emerged in this century as the 
central problem in human society which, if 
unresolved, might destroy it [p. 148],” the tra- 
ditional approach to “resolving” this prob- 
lem has involved attempting to determine the 
major situational antecedents of aggression 
(Buss, 1966; Taylor, 1967). It is the opinion 
of the present authors that researchers should 
concentrate their attention on determining 
efficient, nonpunitive means of reducing 
rather than instigating aggression. 
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received a 
while the remainder of the subjects received a nonhumorous form of the 


examination 


_ Recent years have witnessed a renewed 
‘interest in the experimental study of humor 
and the psychological functions that it serves. 
The most commonly utilized conceptual 
j rk underlying such studies is the no- 
tion, derived from both psychoanalytic 
(Freud, 1928) and learning-based (Byrne, 
1956) approaches, that humor serves to re- 
lieve tension. There is both observational and 
“experimental evidence to suggest that humor 
is capable of reducing negative affective 
‘States. In his naturalistic observations of 
graduate students who were preparing for 
their doctoral examinations, Mechanic (1962) 
noted that jokes were often used as a defense 
against anxiety engendered by the forthcom- 
ing examinations. Experimental evidence for 
humor-mediated tension reduction has re- 
cently been reported in studies by Dworkin 
and Efran (1967) and by Singer (1968). In 
both studies, subjects who were exposed to 
manipulations intended to induce emotional 
arousal reported significant decreases in anger 
and in anxiety following subsequent exposure 
to humor. 

Changes in affective state in both the 
Dworkin and Efran and Singer studies were 
defined in terms of subject responses on pre- 
humor and posthumor mood checklists. The 
1 Requests for reprints should be sent to Ronald 
E. Smith, Department of Psychology, University of 


Washington, Seattle, Washington, 98105. 
Now at Albion College. 


High-test-anxious subjects receiving the nonhumorous 
performed significantly more poorly than did low- or moderate-test-anxious 
and at a significantly lower level than did the high-anxiety group 
ed the humorous form. The results lend support to the hypothesis 
that exposure to humor may reduce anxiety and thereby affect task performance. 
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Purdue University 
s differing in level of test anxiety were administered a course examina- 


er standard classroom conditions. Approximately half of the subjects 
form of the examination containing one-third humorous test items, 


form 


next logical step, then, would seem to in- 
volve moving beyond subject self-report in- 
dexes of affective change to an assessment of 
the manner in which other behaviors, pre- 
sumably influenced by such emotional states, 
are affected by exposure to humor. A dem- 
onstration that exposure to humor affects 
such behaviors as would be expected under 
conditions of lowered emotional arousal would 
constitute additional empirical support for 
the tension-reduction effects often ascribed to 
humor. 

The present study was designed to assess 
the effects of humor on the academic test 
performance of subjects differing in level of 
test anxiety. There is considerable empirical 
evidence indicating that high test anxiety has 
deleterious effects on such performance (Hill 
& Sarason, 1966; Paul & Erikson, 1964; 
Sarason, 1960). If humor serves to reduce 
anxiety, as the results of studies employing 
self-report data would suggest, then it should 
theoretically be possible to improve the task 
performance of highly anxious subjects by 
introducing humor into the testing situation. 
In the present study, designed to test this 
prediction, subjects differing in level of test 
anxiety were administered a course examina- 
tion containing humorous items, and their 
performance was compared with that of sub- 
jects who received a form of the examination 
containing matched nonhumorous items. 
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TABLE 1 


MEAN PERFORMANCE SCORES AS A FUNCTION OF 
ANXIETY AND Humorous OR NONHUMOROUS 


Test Form 
Test form 
Anxiety 
Humorous Nonhumorous 

High 19.27, 17.67, 

n 33 36 
Moderate 18.08, 19.26, 

n 34 35 
Low 19.26, 19.41, 

n 38 39 


Note,—Means having a common subscript do not differ sig- 
nificantly from one another, 


METHOD 
Subjects 


The subjects were 215 Purdue University students, 
enrolled in an undergraduate psychology course, The 
sample consisted of 109 males and 106 females. 


Measure of Anxiety 


During the first week of class, all subjects en- 
rolled in the course were administered the 37-item 
Test Anxiety Scale (TAS) (Sarason, Pederson, & 
Nyman, 1968). Scores on the scale ranged from 3 
to 36 (M = 13.16, SD = 447). On the basis of their 
TAS scores, subjects were divided into low (3-10), 
moderate (11-16), and high (17-36) test-anxiety 
groups. Each TAS group contained a similar pro- 
portion of males and females, and the total numbers 
of subjects in the low, moderate, and high groups 
were 77, 69, and 69, respectively. 


Experimental Materials 


Two forms of a 30-item four-alternative multiple- 
choice examination were prepared. On the humorous 
form of the test, the stem of every third item was 
designed to be humorous in nature. The correspond- 
ing 10 items on the nonhumorous form of the test 
were equated for number of words in the stem, and 
the four response alternatives were identical for each 
corresponding humorous and nonhumorous item. An 
example of one of the matched nonhumorous (NH) 
and humorous (H) items follows: * 


(16 NH). Over the past six years Tom's behavior 
has become increasingly more disturbed. He has 
developed a delusion that somebody is controlling 
his mind, and he is also having bizarre visual and 
auditory hallucinations. Which other member of 
Tom's family is most likely to exhibit bizarre 
behavior? 

(16 H). Claiming to be a slot machine, Julius 
has been standing against a wall in a Las Vegas 
3 Copies of the humorous and nonhumorous items 

may be obtained from the first author. 
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casino for six years making bell-like sounds and 

occasionally complaining that he is being tilted. 

Which other member of Jul family is most 

likely to exhibit bizarre behavio 

Alternatives: (a) his mother; (5) his sister; (c) 

his identical twin; (d) it is impossible to make a 

probablity statement. 

As a check on the comparative humorous qualities 
of the humorous and nonhumor: ems, they were 
administered to an independent of 54 under- 
graduate psychology students, : were asked to 
rate each on a 5-point scale from ! (not at all hu- 
morous) to 5 (extremely humorous). In every in- 
stance the humorous item was rated as being sig- 
nificantly more humorous (< 01) than its corre- 


sponding nonhumorous item. 


Procedure 


The two forms of the examination were admin- 
istered to the subjects under ndard classroom 
testing conditions. The test was given midway in 
the semester, and the subjects were cold that their 
scores on the examination would count for one-half 
of their course grade. 

Within each of the three test: 
approximately one-half of the sui 
the humorous form of the test, w! 


iety groups, 
is were given 
he remainder 


of the subjects were administered the nonhumorous 
form. 
RESULTS 
The dependent variable was the subject’s 
score on the 30-item examination. The dis- 
tribution of examination scores bad a mean 


of 18.84 and a standard deviation of 3.34. 
Preliminary analyses indicated no signifi- 
cant sex differences in any of the cells, and 
the data for both sexes were combined for the 
final analyses. The mean examination scores 
as a function of level of test anxiety and form 
of the examination are presented in Table 1. 
The data were analyzed by means of a 2 X 3 
analysis of variance, and the summary table 
is presented in Table 2. The nonsignificant 


TABLE 2 
SUMMARY OF THE ANALYSIS OF VARIANCE 
or THE PERFORMANCE SCORES E 

Source 4f MS ES 
Anxiety (A) 2 16.07 1.65 
Humor (B) 1 48 ate 
AXB 2 50.27 5.17 

Error | 209 9.73 

*p <.01, 


umor indicates that the two 
did not differ in overall diffi- 
ignificant Anxiety X Humor 
found, indicating that the ef- 
n performance differed as a 
l of anxiety. Analyses of dif- 
; the means of Table 1 by 
Duncan multiple-range pro- 
1962) disclosed that in the 
condition, the  high-anxiety 
l at a significantly (p < .05) 
did either the low- or mod- 
ips. In the humor condition, on 
the other hand, the high-TAS subjects did not 
differ significantly from either the low- or 
moderate-a»:;ety groups within that condi- 
tion. Comparison of the performance levels of 
the two high-TAS groups disclosed that the 
high-anxiety group receiving the humorous 
test form red significantly higher on the 
test than did the high-TAS subjects who 
received the nonhumorous form (P < .05). 
None of the other groups differed significantly 
from one another in level of performance. 


main effect 
forms of the 
culty. A hig 
interaction y 
fects of hum 
function of ! 
ferences bety 
means of the 
cedure (Wine 
nonhumoro 
group performs 
lower level 
erate-TAS g 


A supplementary analysis was performed in 
order to compare the difficulty levels of the 
matched humorous and nonhumorous items. 
The mean score on the 10 humorous items 
Was 6.04: on the 10 nonhumorous items, the 
Mean score was 6,56. The difference was not 


significant. 
DISCUSSION 


The results of the present study are sup- 
portive of predictions derived from a concep- 
tion of humor as having anxiety-reduction 
properties. High-test-anxious subjects in the 
nonhumorous condition clearly demonstrated 
the lowered level of task performance, which 
has been reported in previous studies relating 
test anxiety to performance under achieve- 
Ment-oriented conditions (Paul & Erikson, 
1964; Sarason, 1960). On the other hand, the 
high-anxious subjects in the humorous test 
- Condition not only performed at a signifi- 
cantly higher level than did the high-TAS 
Subjects in the nonhumorous condition, but 
they also equaled the performance level of 
the low-anxious groups. A plausible explana- 
tion for this facilitation of performance in 
the high-anxiety-humorous condition is that 
the affective state evoked by the humorous 


Humor, ANXIETY, AND Task PERFORMANCE 


item content was incompatible with and served 
to inhibit anxiety sufficiently so that the 
latter did not deleteriously affect ongoing 
task-oriented behavior. The notion that humor 
affect and anxiety are incompatible with one 
another is supported not only by the afore- 
mentioned decreases in self-report anxiety 
following exposure to humor reported by 
Dworkin and Efran (1967) and Singer 
(1968), but also by Hom’s (1966) finding that 
subjects exposed to anxiety-arousing condi- 
tions rated jokes as being less humorous than 
did nonaroused subjects. 

A rather unexpected result concerns the 
relatively low level of performance of the 
moderate-anxiety group in the humorous test 
condition. While the performance level of this 
group did not differ significantly from that of 
any other group, the possibility is nonetheless 
suggested that the manner in which humor 
affects performance may be a complex func- 
tion of the level of anxiety being experienced. 
Several of the subjects in the moderate-anxi- 
ety-humorous-test condition later remarked 
that while the humorous items served to 
“Joosen them up” somewhat, they also had a 
somewhat distracting effect on them. 

The results of the present study thus sug- 
gest that the effects of humor on anxiety may 
be reflected in ongoing task-oriented behav- 
iors, as well as in the self-report affective in- 
dexes that have been employed in previous 
studies. It seems reasonable to assume that 
the manner in which humor affects the effi- 
ciency of such behaviors is a function of the 
nature of the task, the level of anxiety, and 
the characteristics of the situation. An op- 
timal drive level concept would predict that 
humor-mediated anxiety reduction would fa- 
cilitate performance only if anxiety were 
above the optimal level required for the per- 
formance of the task in question. In the 
present study, the major focus was on highly 
test-anxious subjects engaged in a complex 
task under stressful conditions, so that the 
assumption that anxiety reduction would re- 
sult in enhanced performance seems a tenable 
one. 
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ATTITUDE CHANGE, COMMITMENT, AND EGO 
INVOLVEMENT 
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Three central features of the attitude change context were investigated to de- 
termine their impact upon ego involvement in an attitude issue. These three 
potential sources of ego-involvement change are attitude change itself, indi- 
cating a position on an issue, and reading a persuasive message discussing the 
issue. Of 374 subjects, 144 were used to investigate the first two of these sources 
in a 2 X2 independent groups design. The third source and subsidiary hy- 
potheses were examined by employing the remaining subjects in contro] and 
follow-up groups. All three sources increased ego involvement in an attitude 
issue. Increased ego involvement associated with attitude change was explained 


and Sherif (1969) have summarized 
veral experiments demonstrating that atti- 
Change varies with the degree of ego 
ivolvement. The weight of the evidence indi- 
es that attitude change is suppressed when 
0 involvement is high (eg, Freedman, 
64; Miller, 1965). The relationship between 
involvement and attitude change focuses 
pon any aspect of the attitude change con- 
ext which might significantly influence the 
level of ego involvement. Rhine and Sever- 
(1970) compared the effect of persuasive 
ges upon several dependent variables for 
highly and less ego-involving issues. 
en ego involvement was high, there was 
effect of the persuasive message on atti- 
change and several other dependent vari- 
; One of these dependent variables was 
e change in ego involvment itself. Following 
> persuasive message, an increase in ego 
'olvement beyond the .001 level of signifi- 
e was found for the issue for which ego 
vement was initially low. 
- The question arises as to what caused this 
highly reliable increase in ego involvement. 
ree possible explanations are suggested by 
€ conditions of the experiment. First, since 
e less involved subjects changed their atti- 
de more than the highly involved subjects, 
process of attitude change itself might 
ease ego involvement in the issue. Second, 
JThanks are due to Gilbert Riddell for help in 
ecting and analyzing part of the data. 
quests for reprints should be sent to Ramon 
hine, Department of Psychology, University of 
rnia, Riverside, California 92502. 


by relating ezo involvement to commitment. 


ego involvement in the issue might have in- 
creased because the subjects took a specific 
position on the issue. Although the subjects’ 
own attitudes were indicated under both 
high- and low-ego-involvement conditions, a 
commitment to a fairly specific position 
probably had been made prior to the experi- 
ment for the high-ego-involvement issue, but 
not for the low-involvement issue. Third, 
simply being required to read a relevant mes- 
sage may increase involvement in the issue. 
The present research was designed to sub- 
mit these three possibilities to experimental 
analysis. 
METHOD 


Subjects and Design 


Of 374 undergraduates attending the University of 
California, Riverside, 144 experimental subjects were 
tested in a 2X2 independent groups design. The 
factors were two degrees of discrepancy between 
one’s own position and a position advocated in a 
persuasive message, and indicating versus not indi- 
cating a position on an attitude issue. The purpose 
of the discrepancy factor was the manipulation of 
the intervening variable of attitude change. A con- 
trol group of 30 subjects (Rhine & Polowniak— 
RP control) was later replicated with 56 subjects 
(RP replication). Two further groups, the impor- 
tance-only group and the irrelevant-issue group, 
contained 89 and 55 subjects, respectively. 


Procedure 


All experimental subjects received a questionnaire 
dealing with the amount of new park acreage that 
should be developed in Allentown, Pennsylvania. 
This issue was used previously by Rhine and Sever- 
ance (1970). On a 13-point scale ranging from 0 
(“no importance whatsoever”) to 12 (“very, very 
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important”), Rhine and Severance’s control group 
(Rhine & Severance—RS control), which did not 
receive a persuasive message, responded to the 
question, “How important do you consider this 
issue?” with a low to moderate mean rating of 5.02. 

A questionnaire received by experimental subjects 
consisted of four pages. The first page contained 
instructions justifying the presentation of persuasive 
messages. These instructions and anchoring informa- 
tion presented on the second page have been 
described previously (Rhine & Severance, 1970). 

Attitude change manipulation, The third page was 
a persuasive message advocating an increase of 240 
acres for one group of experimental subjects (240- 
acre group) and 40 acres for the remainder (40-acre 
group). The average position of the RS-control 
group, which received no message, was 46.17 acres. 
Forty was the position on the attitude scale closest 
to this average. Advocating 240 acres had pre- 
viously led to clear attitude change. Except for these 
numbers, the messages were identical, 

The fourth page of the questionnaire contained 
three questions, The first question asked for the 
amount of increased park acreage advocated in the 
persuasive message. Answers were given on a 13- 
point scale, consisting of equal increments of acreage 
from 0 to 240 acres, This question was placed first 
to ensure that the subjects would have the position 
advocated clearly in mind before going on. 

Indicating versus not indicating a position. The 
second question of the fourth page was received by 
some subjects, but not by others. It asked for the 


subject's own position on the same 13-point scale 
used for the first 


remaining subjects, 
Ego-involvement measure. The third question of 


the fourth Page provided th i 
jet Ns e main data of the ex- 


Ramon J. Rume AND WiLLIAM A. J. Porowwrax 


The importance-only group 
received only the question asking ab 
tance of the issue. Otherwis 
same as the RS and RP cor 
issue group was treated like 
group, except that prior t 
of the parks issue, these sul 
tude on an irrelevant issue 
fornia pass a law requiri 
motorcyclists? The importance-: 
in the same time period as the 
irrelevant-issue group was ru 
tion subjects. 


RESULTS 
Manipulation Checks 


If the subjects in the group re 
message advocating an increase of 
changed their attitude, the as 
subjects who gave their atti 
greater than those of t 
RP-control, and the experime 
whom an increase of 40 acres was ad 
Using scale positions of the 13-point é 
scale, the RS-control mean of 2.31, 
control mean of 2.73, and the 40-ac 
mean of 2.95 were all significantly 
from the 240-acre group mean of 8.0 
three ts > 6, p< .001). On the other 
the 40-acre group differed neither f 
RS-control group nor from the RP- 
group (#<1, in both cases), Hen 
attitude change manipulation was su 

The parks issue was highly stable oi 
2 years that passed between the collect 
data for the RS-control and the RP-& 
groups. The mean attitude score of 2.3 
the RS-control group did not differ 
cantly from the RP-control mean 
(t< 1). Similarly, the RS-control 
yielded a mean importance rating 0 
which was not significantly different fi 
RP-control mean of 5.60 (4 < 1). 


Ego Involvement 


Altitude change and indicating a $ 
The mean importance ratings of the 
mental subjects are given in Table 1, 
analysis of these means is shown in Té 
Following attitude change, importance 
increased significantly beyond the Ol 
While taking a position also incre 
issue's importance, the difference Da 
means, significant at the .05 level, is ™ 


TABLE 1 


ALE POSITIONS POR ATTITUDES AND FOR 
E CATEGORIZED BY INDICATING ox 
r IxptcaTING a POsrTION 


Importance 
T SiG a 2E 
Indicates a | Does not indi- 
position | cate a position 
8.03 84... |- 754 
2.98 


7.09 | 


Reading 'he message. The pos 
additional sect due simply to reading the 


message was also considered. The best com- 
parison is between the RP-control group and 
the 40-acre group for which opinions were 


given. This 40-acre group received the mes- 
sage and ihe RP-control did not; otherwise, 


these two groups are comparable: both gave 
their opinion, and their attitude scores were 
hearly the same. Consequently, the difference 
between the 40-acre group’s importance mean 
of 7.29 and the RP-control mean of 5.60 


(f=2.09, p< .05) indicates that simply 
reading the message also increases the issue’s 
Importance. 

Naturalistic replication. The RP-replication 
group was run after discussion of ecology 
had been considerably escalated both in the 
public news media and on the campus. The 
Increase in involvement associated with ex- 
Perimental manipulation of attitude change 
should also occur when attitude change is 
Produced by more naturalistic events. Since 
Comparisons of the RS- and RP-control groups 
yielded żs less than one for both attitude and 
Importance, these two control groups were 
combined and compared to the RP-replication 
Soup. The attitude mean of the RP-replica- 
tion group of 3.54 scale positions was signifi- 
tantly larger than the mean of 2.47 yielded 
by the combined RS- and RP-control groups 
(t= 241, p< 02). Associated with this in- 
‘tease in attitude, the mean importance of 
712 for the RP-replication group was also 
Seater than the mean of 5.25 of the com- 
lined RS. and RP-control groups (¢ = 3.67, 
P< 001), 

Importance-only results. The mean impor- 
tance rating of importance-only subjects was 
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7.50. This mean does not differ significantly 
from the mean of the 240-acre group (t < 1) 
and is just significantly larger (£= 1.96, 
P = 05) than the mean of the 40-acre group 
which did not specify its opinion. The mean 
of the importance-only group is also greater 
than the mean of the RP-control (¢=2.71, 
P < O01). These latter two groups are com- 
parable, except that the RP-control subjects 
gave their opinion, and the importance-only 
subjects did not. 

The upward drift found for the importance 
ratings of the importance-only subjects could 


_ be due to the sparseness of the questionnaire 


which they received. Importance-only sub- 
jects received only the anchoring paragraph 
and the importance question. Other groups 
had, in addition, either a persuasive message, 
a lengthy attitude question, or both. The 
irrelevant-issue group was run at the same 
time as the RP-replication group to determine 
if the importance-only results could be at- 
tributed to the sparseness of the importance- 
only questionnaire. Since the importance 
mean for the irrelevant-issue group of 8.73 
is greater than the RP-replication mean of 
7.12 (t — 2.98, p € 01), it appears that the 
elevated value of the importance-only mean is 
due to the content rather than the form oí 
the questionnaire. 


DISCUSSION 


The results indicate that three central fac- 
tors of the attitude change context are capa- 
ble of affecting ego involvement in an attitude 
issue. Ego involvement is associated with 
attitude change itself, with the taking of a 
specific position, and with exposure to a rele- 
vant message that does not produce attitude 
change. The relation of attitude change to 
ego involvement is potentially the most 


TABLE 2 


Two-Way ANALYSIS OF VARIANCE 
OF Importance DATA 


Source 


df MS F 
Attitude change (A) 1 | 52.26 | 7.07** 
Does versus does not indicate a 
position (B) 1 | 26.31 | 401* 
AXB 1 36 | .05 
Error 140| 6.56 


*p «.05. 
Dp c.l. 
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fundamental of these three factors. Ego in- 
volvement increases both when attitude 
change is manipulated experimentally and 
when it occurs naturally. 

The results attributed to attitude change 
are directly associated with the discrepancy 
between one’s own position and the position 
advocated, and are only indirectly associated 
with the intervening variable of attitude 
change. The main point of theoretical interest 
is the psychological mechanism through which 
discrepancy operates. There was only one dif- 
ference in the entire procedure for the subjects 
who did or did not change their attitude. 
Subjects who changed received a message con- 
taining the number 240, and those who did 
not change read an identical message, except 
that it contained the number 40. It seems 
unsatisfactory to assume that these different 
numbers by themselves—without an inter- 
vening mechanism—increase personal involve- 
ment. Demonstration of a clear relationship 
between discrepancy and attitude change in 
this and a wide variety of other research 
makes attitude change the logical candidate 
as the mechanism through which discrepancy 
operates. 

A post hoc interpretation of the importance- 
only findings is consistent with the view that 
attitude change influences the level of ego 
involvement. Freedman (1964) distinguishes 
between the two meanings given to ego in- 
volvement, either as “the general level of 
interest in or concern about an issue without 
reference to a specific position [p. 290]” or 
"interest in, concern about, or commitment to 
a particular position on an issue [p. 290]." 
Except for the importance-only group, the 
data indicate that ego involvement is tied to 
specific attitude positions, It is suggested that 
importance ratings of a vaguely held attitude 
will tend to drift upward when they are 
unanchored by a specific position. Compared 
to the importance-only group, the subjects of 
the other groups had emphasis placed on a 
more specific position by reading a persuasive 
message advocating a particular position, by 
stating a position, or by changing their atti- 
tude. When a specific position is emphasized, 
ego involvement is well related to the level of 
attitude characterizing each group. 

It is generally thought that extreme atti- 
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tudes are more resistant to chan: 
erate ones, and Carlson’s 
supports this view. Attitudes acquired by 
gradual increments or by larger leaps become 
increasingly extreme through the process of 
attitude change. It seems probable that ex 
tremity and resistance to change are related 
because ego involvement : npanies the 
attitude change which is mic to the 
acquisition of an extreme position. 

The three factors increasing ego involve- 
ment in this experiment appear to have in 
common the potential to increase personal 
commitment. Taking a position is a form of 
commitment. Changing one's attitude implies 
a commitment to a new position. And com- 
mitment could also be increased by recei| 
of information on a topic to which little 
attention had been given previously. Various 
conceptualizations of ego involvement appeat 
to have considerable overlap with commit- 
ment. Kiesler, Collins, and Miller (1969) 
stated that “any of the various operationali- 
zations of commitment might also be con- 
sidered synonymous with ego-involvement 


than mod. 
) research 


[p. 244]? Sherif, Sherif, and  Nebergall) 
(1965), Freedman (1964), and Miller (1965) 
included commitment as an ingredient of eg 
involvement. Assuming that commitment is an 
important component of ego involvement, 
attitude change by increasing commitment 
should also increase ego involvement. 
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SITUATIONAL FACTORS CONTRIBUTING TO A 
PSYCHOLOGICAL ADVANTAGE IN 
COMPETITIVE NEGOTIATIONS * 


JERRY I. SHAW 2 
San Fernando Valley State College 


xperiment examined the processes by which discrepant performance 
‘ages contribute to the development of a psychological advantage. Sixty 
lyads negotiated bets concerning which person would score highest on 
tory task in which performance was represented as skill- or chance- 
»rmined. Half of the pairs (players) actually performed the task, whereas 
e others (observers) merely watched. The scores were controlled so that one 
ver consistently scored higher than the other. The results indicate a “con- 
nce" phenomenon; that is, the subjects expected that the difference in the 
players’ future scores would be smaller than the average difference ob- 
d in the past. A related tendency for low scorers to underhandicap them- 
yielded a “high man's advantage" in the betting. Differences in the de- 
to which these phenomena occur in the several experimental conditions are 


considered in relation to possible underlying processes. 


Albert Myers (1964, 1966) has demon- 
strated that in a competitive encounter, cer- 
tain persons may gain a psychological advan- 


lage over their opponents. A psychological ad- 
vantage was defined as a condition whereby 


one “increases his probability of winning . . . 
through the acquisition or adoption of some 
attitude, frame of reference, strategy, etc., by 


his opponent or himself [1966, p. 283].” 

In the first study (Myers, 1964), pairs of 
subjects played 10 games on shuffleboard 
bowling machines and negotiated bets concern- 
ing which player could achieve the highest 
score for each game, Unknown to the subjects, 
the machines were wired so as to cause one 
Player’s scores to be consistently higher than 
those of his opponent. In forming bets, players 
Were permitted to use a handicapping pro- 
cedure as a means of making up for any ap- 
parent difference in their averages. The ad- 
Vantage appeared to lie with the more “tal- 
ented,” inasmuch as those players with higher 
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performance averages were able to exploit and 
win money from opponents with lower aver- 
ages, despite conditions allowing players to 
equate the apparent difference in averages by 
setting a proper handicap. For example, on 
those games played for even money, low-aver- 
age players accepted too few handicap points 
to offset the actual difference in performance 
levels. 

In a follow-up study, Myers (1966) asked 
subjects to predict their opponent's scores as 
well as their own. Score predictions revealed 
that both high- and low-average players ex- 
pected the lows to show marked improvement, 
whereas only modest change was anticipated 
for the highs. 

The psychological advantage demonstrated 
by Myers in the laboratory has a number of 
everyday parallels. For example, in athletic 
competition, individuals and teams may be 
reluctant to confer superiority upon their op- 
ponents, and prefer instead to describe them- 
selves as having temporarily “gone cold" or 
as having “lost their touch." A related phe- 
nomenon occurs when individuals ascribe 
their failures to *bad luck" in order to pre- 
serve their self-esteem (Aubert, 1959; Cohen, 
1960, 1964). 

These attempts to explain away differences 
between ourselves and others can have im- 
portant implications for future events. Thus, 
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Festinger (1954) commented that self-evalu- 
ation is a determinant of our choice of associ- 
ates. If one has misjudged his competence in 
a given domain, then his association with oth- 
ers more successful than himself may result in 
painful scorn or ostracism. 

The type of psychological advantage evi- 
dent in Myers’s experiments has two major 
components. The first, related to expectations, 
is the tendency for subjects to underestimate 
the future difference in their performance 
averages, relative to the past difference. The 
author shall refer to this as the “convergence” 
phenomenon, since the players’ assessment of 
the difference between their average scores 
converges to less than the observed difference. 
The second component, related to betting, is 
the tendency for low-average players to un- 
derhandicap themselves. The author shall re- 
fer to this as the “high man’s advantage,” 
since it provides an edge by which players 
can win money from their opponents. 

Although it seems reasonable that con- 
vergence operates frequently in social life, 
the conditions which foster its development 
are not yet understood. For example, do the 
two players in the Myers situation perceive 
the difference in their scores as primarily re- 
flecting unequal luck? If so, there are several 
theoretical explanations which may explain 
their convergence. Specifically, the phenom- 
enon of the gambler's fallacy suggests that 
after a homogeneous series of losses, low- 
average players might expect a significant 
rise in their levels of performance, whereas 
the high-average players might expect them- 
selves to do less well in the future. A some- 
what different explanation is implied by the 
knowledge that extreme scores will be, on the 
average, less extreme in the future. If players 
possess this “regression toward the mean" 
logic, then it could account for the expectation 
that their scores will become more alike in the 
future. 

Alternatively, there are many task situa- 
tions in which skill (not chance) is the salient 
component of performance. Will convergence 
also appear under these conditions? There are 
several explanations which lead us to expect 
convergence in this case as well. For example, 
it may be that high-average players readily 
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assume that they are performing near their 
top level of performance and that there is not 
much room for improvement. In contrast, low- 
average players may concludi it there is 
some knack which they have no: vet mastered, 
but with additional practice they are likely te 
do so. 

A different reason is sugges! y the possi- 
bility that high-average players are likely to 
conclude that the task is fair!) y, whereas 
low-average players may vi he task as 
more difficult. Several experim by Cohen 


(1960, 1964) indicate that 4 
their expected performance vy 
a task is hard, whereas they te 
it when they think a task is : 
Theories of persistence (^! 
assume that the tendency to ; 
after success has been attained 
the additional possibility that i» 
low-average players may conc 
will be a difference in how hard 
on future attempts. If so, ther 
pect low-average players to ant: 
improvement than the highs. 
Until now, we have restricted 


s overrate 
they think 
o underrate 


son, 1964) 
sist declines 
This suggests 
th high- and 
de that there 
each will try 
we might ex- 

jate greater 


our analysis 


to the players themselves and to their percep- 
tions of the task situation. Is it also possible 


servers who, 
lly involved 
is an im- 


that convergence occurs among « 
relative to players, are less perso 
in the performance scores? ' : 
portant point to consider since, f 'equently, it 
is not the performers themselves who must 
decide whether there is a difference in outputs. 
Often, observers are entrusted with this deci- 
sion, as when school administrators choose 
among college applicants or when employer 
find it necessary to promote one of a number 
of young executives. 

Since observers may utilize many of the 
same perceptions that players do in forming 
their judgments, it is expected that observers 
will display convergence too. However, there 
are several reasons stemming from the players 
personal involvement which suggest that play- 
ers, as compared with observers, will be more 
susceptible to the phenomenon. For example; 
certain emotional states may be experienc 
by one or both players whenever there i5 ? 
continued discrepancy in the level of outcome 
each produces. Heider (1958) commented on 


to avoid negative self-attributions 
they undermine self-esteem. Thus, a 
Dive for selí-enhancement may rule out for 
iss successful players the conclusion that they 
» inferior at the task. Instead, they may in- 
that their true level of competence has 
temporarily disrupted and that in the 
, marked improvement may be expected. 
Another possibility stems from the notion 
persons are motivated to present them- 
es in a favorable light (Goffman, 1959). 
low-averaze players may avoid stating 
score predictions because others might 
en label them as “pessimists” and as lacking 
"confidence, and high-average players may 
id stating high predictions to avoid being 
garded as overconfident or conceited. 
"The suggestion that observers are less likely 
‘be bothered by emotional considerations in 
ming their judgments leads us to expect 
they will be less likely to exhibit con- 
gence to the degree that the players do. 
, furthermore, since the emotional expla- 
ins are more directly relevant to a skill 
ation (where scores reflect on the per- 
attributes of the players) than to a 
hance situation, it was expected that the dif- 
ence between the players and observers 
ill be particularly evident where task per- 
mance may be primarily attributed to skill. 
Inasmuch as the circumstances which precipi- 
“tate convergence have not been examined 
empirically, and since there are theoretical 
ns to suspect that convergence does occur 
‘in a variety of natural situations, the present 
experiment was conducted. 


Overview of Research Strategy 


- Pairs of players participated in a task in 
which their performance was represented as 
either skill- or chance-determined. Perform- 
e averages were controlled to create dif- 
ferent levels of apparent success. Another 
Pair of subjects secretly viewed the players’ 
activities from an adjoining room. Between 
trials, all of the subjects (players and ob- 
servers) made estimates for each of the two 
Players’ subsequent scores, and each pair of 
subjects negotiated bets between themselves 
tegarding which of the two players would 
‘Score the higher. 


PSYCHOLOGICAL ADVANTAGE IN COMPETITIVE NEGOTIATIONS 


In formulating these bets, both pairs of. 
negotiators were permitted to use handicaps 
to offset the apparent difference in the two 
players’ average scores. These manipulations 
resulted in eight conditions, comprising a 2 X 
2x2 factorial (Skill-Chance x High-Low 
Performance X Players-Observers). 


Meron 


Task and Apparatus 


The experimental task was adapted from a pro- 
cedure originated by Shomer (1966), in which sub- 
jects are required to make a thickness discrimination, 
allegedly based on their “sense of touch.” Subjects 
attempted to select a pink chip from a closed con- 
tainer, supposedly holding both pink and blue 
chips. These chips appear white under ordinary light, | 
but when held under an ultraviolet lamp, some glow 
pink, and others glow blue. In actual fact, all chips 
were identical and covered with a special coating of 
paint which permitted the experimenter to control 
the apparent color oí the chips after the subjects 
had made their selections. A hidden programming 
device activated either a short-wave or a long-wave 
ultraviolet light in a predetermined sequence which 
caused any selected chip to appear either pink or 
blue. Using this technique, the number of pink and 
blue chips selected by each subject could be precisely 
controlled so as to create different levels of apparent 
success on the discrimination task. 

The container which held the chips was actually a 
double-load dove pan of the type used by many 
magicians. As subjects entered, they were shown an 
open container filled with cardboard chips. The cover 
for the container had an opening into which a black 
cloth sleeve had been sewn. Inside the cover was an 
inner pan, held securely by springs. When the cover 
was placed over the container, the hidden pan 
slipped into place, so that when the subjects reached 
through the sleeve, they were actually reaching into : 
the hidden pan. This pan contained another set of ] 
cardboard chips, all of equal thickness, and all coated 
with a special type of fluorescent paint. This paint 
glows pink when exposed to short-wave ultraviolet 
and blue under long-wave ultraviolet. Under ordi- 1 
nary light, the paint is nearly transparent, so that —— 
the cardboard chips appear to be white. s 

The ultraviolet lamp was mounted in an aluminum 
shield under which two independent sources of light 
were concealed.* One source emitted long-wave ultra- 


3The paint contains two fluorescent phosphers 
(PF-524 short-wave, and PF-506 long-wave) blended 
together in a lacquer base in the proportion of ap- 
proximately two parts PF-524 to one part PF-506. 
These materials were supplied by Ultraviolet Prod- 
ucts, Inc., San Gabriel, California. 

4The light sources were UVL-11 long-wave, and 
a UVL-13 combination long- and short-wave manu- 
factured by Ultraviolet Products, Inc, San Gabriel, 


California. 


violet, and the other emitted short-wave ultraviolet. 
When a switch on the lamp was prewed, an suto- 
matic program determined which of the two light 
sources would be energized. At the same time, an 
electric counter advanced with a loud click, thereby 
masking the noise of the programming device hidden 
in an adjoining room. 

Before the cover was placed on the container, the 
subjects were shown how the two kinds of chips 
appear under ultraviolet light. For this demonstra- 
tion, the chips in the outer container, from which 
subjects believed they would draw, were exposed to 
the short-wave ultraviolet source, Some of these 
chips were coated with a paint which glows pink, 


fully matched to appear the same as the other chips 
would under the two conditions of illumination dur- 
ing the trials. 


Subjects and. Design 
One hundred and thirty-two male undergraduates 


In the skill condition, the task was portrayed as 
between chips of two dif- 


allegedly varied in thickness, getting a pink 
E. & matter of skill. In the chance 
ed ee ta tiquifa i 


.. Since touch is an inadequate cue for color, getting 
a matter of chance. 


| 
gF 
: 
i 
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was determined in advance by the experimenter. The 
exact sequence of scores was generated by assuming 
a bell-shaped distribution of performance differences 
centered about a mean of 2, These mean differences 
occurred on 40% (m=36) of the trials, Perform- 
ance differences of 3 and 1 each occurred on 20% (n 
=18) of the trials, and differences of 4 and O each 
occurred on 10% (» — 9) of the trials. 

There were 15 quartets of yoked-player and ob- 
server pairs in each task orientation condition. Every 
quartet received 1 of 15 outcome schedules. For ex- 
ample, the high and low players' scores in a particu- 
lar schedule were 6 and 4, 4 and 1, 9 and 6, 6 and 4, 


A 


200; and (c) on the practice sew 
always scored the overall mean for | 
level (ie, 6 for high-performance 
low-performance players), Thus, a 
jects began the series of gambling t 
cal practice experience, As an additio 
variance of the scores made by all h 
players combined (5? — 2.00) did n 
cantly from that for the lows (s° 
dj = 89/89, p > 25). 


Procedure 


Subjects arrived at the laborator, 
four, where they were met by Exp 


jects would participate it 
of subjects (players) was d 


e 
menter 2 into an adjoining room. F 
players were unaware of any other p 


in the experiment other than themselves and Experi- 
menter 1. 

After being seated, the discrimination task was de- 
scribed to the players including the 1? css and/or 
color distinctions to be made in the sppropriate task 
orientation conditions. 

The crucial instructions to the subjec's in the skill 
condition stated: 

The task you will be performing today requires à 

discrimination based upon your sense of touch 


This container holds a special set of cardboard 
chips. Some of the chips are thicker, and some are 
thinner.® Taking turns, we would like E pui? 

to reach into the container and see if you can be 
skillful enough to choose | e thicker chips. 
Since you may use oi your fingertips to deter- — 
mine which are the thicker chips, whether or not 
you are successful on this task will depend upon 
the skillful use of your sense of touch. 


Subjects in the chance condition were instructed a$ 
follows: 


The task you will be performing today is one 
Which is based solely on chance. This container 


5' The capable assistance of Leonard I. Schwartz 
who served as Experimenter 2 is gratefully acknowl- 
edged. / 

* It was stressed that the proportion of one kind ob | 
chip to the other was changed for every pair of play- — 
I 


ers, but that it was never 50-50. The purpose of 
was to eliminate the simple assumption that a d 
of nearly 5 out of 9 could be expected by chance 


alone, 


| «et of cardboard chips. Some of 

v been coated with a paint which 

«hers have been coated with a 

w* blue. Taking turns, we would 

; to reach inte the container and 

lucky enough to choose one of 

Since you may use only your finger- 

e which are the pink chips 

vou are successful on this task will 
ictly a matter of luck. 


or chance nature of the task had 


vers engaged in a practice trial. 
cores were announced, and the 


ies for betting were enforced. First, 
$1, and, therefore, players could 
other money odds. Second, players 

o give each other handicaps if in 
these were warranted, Thus, in 

, ibe players each put up $1 and then 
th ve of handicap, if any, to be given 
The player scoring highest on each 

the handicap had been added won the 


tre to play along by placing bets with one another 
the scores to be made by the other two subjects. 
ch observer was randomly assigned to a player 


[by coin toss) on whose scores his own bets would 
fide. At this point, the sound system was turned off 
the remainder of the experiment so that observ- 
fs would ro longer be able to hear the interaction 

nthe other room, although they could still view all 
‘the proceedings therein. 


"Dependent Measures 


After the practice session and before each remain- 
trial, all subjects were given a form on which 
were required to make the following estimates: 
their own and their opponent’s scores on the 
Text trial, (b) the average score made by each 
“Player on all the preceding trials, (c) the size of 
handicap for the next trial which, in their opinion, 


ected difference between the players’ performance 
ag i aking these 
ates, the subjects relied on their 
formance, inasmuch as they did not keep records. 
- After giving these estimates, the opening bids 
Were publicly announced, and subjects were asked 


Effectiveness oj the Experimental 
Manipulations 


Task orientation. Several indications as to 
the effectiveness of the skill-chance manipu- 
lation are available. First, since it is reason- — 
able-that most subjects would assume skill- 
based performances to require Atleast some 
effort, whereas those based on chance 
quire less, it was expected that players 
skill condition would devote 


27,56 seconds (F = 322.25, 


cantly, 

A related point of interest is whether the 
two task orientations are maintained over 
trials, or if the belief in skill diminishes wi 
task experience. Although selection latencies 
declined over trials in both the skill and 
chance conditions, the difference at Trial 6 
was substantial (121.66 versus 26.43 sec- 
onds), indicating that differential orientations 
still persisted. 

Negotiator. This manipulation was expected 
to produce different degrees of emotional in- 
volvement between the players and observers. 
An item on the postexperimental question- 
naire asked the subjects to rate their concern - 
over the possibility that they (or their player) — 
might have done poorly on the chip-selection 
task. 
As expected, the observers felt neutral 
about their concern. Oddly, however, the 
players indicated that they were highly un- 
concerned. Although this seems inconsistent 
with the notion that players are apt to be 


TAN p values are two-tailed. 
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TABLE 1 


Mean PROJECTED DIFFERENCES IN PERFORMANCE 
AVERAGES IN 10 ADDITIONAL TRIALS 


Negotiator 


Task — | Performance |_ KATS 
orientation | evel 
Players | Observers 

I wl " 

; High 1.23 1.55 
3e Low | 113 143 

4 High 59 : 

Chance Low 36 85 


Note.—The average observed difference is 2.00. 


more emotionally involved than observers, the 
strong denial of concern among the players 
suggests a different interpretation. Perhaps 
defensiveness was aroused among the players 
by the implication that they might possibly 
be performing poorly. In order to ward off 
feelings of inadequacy, they derogated the 
importance of doing well by indicating a lack 
of concern. Corroboration for this interpreta- 
tion is evident in the skill condition in which 
low-performance players, who should be the 
most defensive about their scores, expressed 
less concern than their more successful op- 
ponents (t = 1.41, df = 14, p < .10). These 
data therefore suggest that important emo- 
tional differences were produced by the 
player-observer manipulation. 


Convergence in Expectations 


Amount oj convergence. Prior to every ex- 
perimental trial, the players were asked to 
estimate their own score on the next trial, 
their opponent's score on the next trial, the 
size of handicap considered fair on the next 
trial, and the projected difference in their 
performance averages over 10 additional trials. 
Observers made identical estimates concerning 
the players! scores. From these predictions, 
three separate indexes of convergence were 
generated: (a) expected score difference, (5) 
fair handicap, and (c) projected average dif- 
ference. The results from all three indexes 
were so nearly identical that it would be re- 
dundant to consider each separately. There- 
fore, data from only the projected average 
difference will be presented. 

If we recall that the observed difference in 
performance was, on the average, two chips, 
the following picture emerges from the pro- 


i 
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jected differences given in Table ! 
vergence (the projected difference 
than 2.00) occurred to some degre 
eight cells. Even in the two cell 
least convergence (observers in th 
dition), the subjects significant!) 
mated the observed difference in 
= 3.72, df = 29, p < .001). As we 
this suggests that convergence | 
determined and a relatively comple 
enon. (5) Convergence was mo:: 
the chance condition. The mean: í 
chance were 1.34 and .68, respec 
21.72, df = 1/14, p < 001). (c) The pla 
observer variable affected the degree of à 
vergence, with players expecting a) sma 
difference in averages than obse: vers (F 
9.62, df = 1/14, p < .01). (d) Although hi 
performance subjects were slight!: more 
rate in their perception of the difference 
low-performance subjects, this effect was 
reliably significant within any o! the 
major cells (F = 1.00, df = 1/14, ; 
(e) As evidence of a difference a 
over trials, estimates became more accura 
Mean estimates for Trials 1-6 were . 
97, 1.10, 1.16, and 1.30 (F = 10. / 
70, p< .005). Although some learning 0€- 
curred, even after the six trials of the pri 
experiment, the subjects were still prone 
underestimate the average observed different 
of 2.00 (t = 5.38, df — 7, p < 01). It i 
dent, therefore, that the convergence eff 
dissipated relatively slowly. (/) The only im 
teraction to reach an acceptable level of 
nificance was the Performance Level X Tri 
interaction (F = 4.30, df = 5/70, p < .005) 
A graph of this interaction is presented 
Figure 1. As contrasted with high-performa 
subjects, lows were less accurate in their es 
mates until Trial 5, whereupon they display 
greater accuracy than the highs. This sug 
that convergence may extinguish faster 
low-performance subjects than for the hig 
and quite possibly it is this fact which 
responsible for the nonsignificance of 
overall performance level effect. 
Pattern of convergence. Table 2 prese 
the subjects’ predictions for each pla! 
score on the next trial. If we recall that sē 
averages for high- and low-performance P! 
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these data reveals that the pattern of con- 
vergence was asymmetrical under skill and 
more or less symmetrical under chance. That 
is, in the skill condition, the subjects expected 
low-performance players to improve consid- 
erably, whereas under chance, they expected 
à decline by high-performance players, as 
Well as improvement by low-performance 
Players, 

To test this statistically, an index of sym- 
metry was derived by converting each sub- 
Ject’s predictions for the two players to devi- 
ations from their respective averages which 
Were then summed algebraically. Given con- 
Vergence, a symmetry score of zero reflects a 
Perfectly symmetrical pattern. Scores greater 
than zero indicate upward asymmetry, and 
those below zero indicate downward asym- 
Er Symmetry scores are shown in Table 3. 

Since the analysis of variance indicated 
d an effect for task orientation (F — 39.08, 

| — 1/14, p< 001), grand means for skill 
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1.40 
4 
1.30 
1.20 A 
* 
d ° =— Pd 
e 
i TL Pp * 
> 1:00 * 
La 
0.90 
e 
0.80 
0.70 " 
+ —— 
0.60 
0.50 
A € High-Performance Subjects 
Eos 4 Low-Perfonmince Subjects 
NER ee rN is Se 
" 1 2 4 5 6 
TRIALS 
Fic. 1. Projected differences in performance averages. 
ers were 6 and 4, respectively, inspection of (.723) and chance (—.172) were compared 


with a value of zero. These tests confirm the 
expected patterns of convergence, since scores 
were significantly above zero under skill (t = 
4.61, df= 14, p < 01) and did not depart 
from zero under chance (t= 1.69, df = 14, p 
< 20). 

TABLE 2 


MEANS or SUBJECTS’ PREDICTIONS FOR Eacu 
PLAYER’s SCORE ON THE NEXT TRIAL 


Negotiator 
Players Observers 
Task Performance 

orientation. leye Predictions for: | Predictions for: 
High | Low | High | Low 

player | player | player player 

. High 5.95 | 4.83 | 6.00 | 4.50 
Sal Low | 5.90 | 4.78 | 6.20 | 4.73 
Ch High 5.10 | 4.80 | 5.20 | 4.64 
[^A Low | 5.05 | 4.76 | 5.06 | 4.70 


Note,—The average scores actual 
low-performance players were 6.00 an 


ly attained by high- and 
d 4.00, respectively. 


po 
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TABLE 3 
MEAN SywMETRY SCORES 
| | Negotiator 
Task | Performance | 
orientation | evel 
Players Observers 
s "High | 75 50 
Skill | Low | 68 | : 
: High | -40 | —. 
Chance Low -19 | =% 


Convergence in Recall 


Aíter each trial, the subjects were asked to 
recall prior differences in performance aver- 
ages. There was no reason to expect in ad- 
vance that convergence in recall would occur, 
but the grand mean (1.78) was significantly 
below 2.00 (t= 7.85, dj = 7, p < .01). Al- 
though it was obviously weaker than con- 
vergence in expectations, it was somewhat sur- 
prising that convergence appears at all in the 
recall data. This memory bias may possibly 
be a carryover from the expectation data 
which were collected at the same time. This 
possibility deserves further attention and 
could be examined by testing for recall with- 
out requiring the subjects tọ indicate their 
expectations at the same time. 


Negotiations 


Prior to each trial, the subjects decided on 
a handicap to be used in determining a win- 
ner, Since there typically was disagreement 
regarding the size of these handicaps, the 
subjects were required to negotiate until they 
reached an agreement. Negotiations began 
following public disclosure of the opening 
bids, which are presented in Table 4. Means 
for high-performance subjects may be taken 
to indicate offers, and those for low-perform- 
ance subjects, to indicate their requests. 

Opening bids were highest in the skill con- 
dition (F = 5.04, df = 1/14, p < .05). Also, 
there is evidence of substantial disagreement, 
since requests were much larger than offers 
(F = 61.55, df = 1/14, p < .001). Of greater 
interest is the Negotiator X Performance 
Level interaction, indicating that initial dis- 
crepancy is higher between observers than 
between players (F = 13.55, df = 1/14, p< 


areik 
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.005). This finding stands out more 
we examine the size of bargaining 
tablished in the various condition 
gaining ranges” the author mea 
ence in opening bids, that is, the 
between requests and offers for 
negotiators. The average ranges, 
in Table 4, indicate that observ: 
ranges than players (F = 14.1: 
p< .005). Thus, there was str 
disagreement between the obser: 
than between the players. 

In competitive bargaining, 
of fairness are typically disregard: 
to insure a favorable outcome. Data 6 
departure of opening bids from wha: was indi. 
cated as being fair show significantly large 
deviations for observers (.76) than for play- 
ers (.23; F = 13.04, df = 1/14, # < .005) 
Thus, observers are more willing than players 
to adopt a typical bargaining st: 

Also of interest is whether ! 
performance subjects were the r 
sive. Given the initial bargaining , 
which party made the largest concessions? 
Data on the size of concessions indicate that 
concession making was fairly ar 
with both parties conceding nearly e 
tions of the original bargaining range 


High Man’s Advantage 


The negotiation data provide two sources of 
evidence on the high man's advantage: (@ 
the size of handicaps finally agreed to à 
(b) the total amount of money won. Data 0! 


TABLE 4 


Mean Size or HANDICAPS AND BARGAINING 
RANGES PROVIDED BY OPENING BIDS 


Negotiator 
went USE Players Observers 
tion level CIE € 
Handi- | Bargain-| Handi- | Bargam 
cap ingrange| cap ingran 
High -70 Jt 
Skill 52 
Low 1.22 2.10 
High -55 H 
Chance 32 a 
Low 87 1.55 


-veal the underhandicapping 
ismuch as low-performance 
task conditions willingly set- 

markedly less than the re- 

lowest average settlement 
players-chance condition 

highest occurred in the 
lition (1.28). This is an 

o that observed for the con- 


final settleme: 
phenomenon, 
subjects in bot 
tle for handi 
quired 2.00. 7 
occurred in 
(.74), where 
observers-ski! 
identical patt 
vergence data 


A more de tive statement of the high 
‘man’s advantave is given by the amount of 
money won high-performance subjects. 
Consistent v ie handicapping, the highest 
average win! occurred in the players- 
chance condition ($2.80), whereas the lowest 
occurred in observers-skill condition 
($1.60). Ove : 60 pairs of negotiators, 51 
high-perform members won at least some 


| 2 lost. Based on these data, 
the convergence phenom- 


money, 7 tiec 
it is evident 


enon strong!) ors successful competitors. 
DISCUSSION 

The most striking feature of the results is 

- the pervasiveness of the convergence effect 


over the several experimental conditions. This 
is consistent with present theoretical formula- 
tions which suggest that convergence is mul- 
tiply determined and occurs in a variety of 
life situations. 

Equally important is the fact that con- 
vergence occurred more completely and sym- 
metrically in certain situations, inasmuch as 
this suggests something about underlying 
Processes, For example, the sizable conver- 
fence in the chance condition indicates that 
luck-related beliefs are basic, and perhaps the 
Most important, determinants. Further indi- 
‘ation of this comes from the postexperimental 
questionnaire data. When asked whether suc- 
cessive runs of successes and failures would 
lead subjects to expect a subsequent change 
_ In the players’ scores, change was nearly al- 
Ways anticipated under chance, but rejected 
Under skill, Furthermore, a statement suggest- 
ing that the two players’ scores would prob- 
ably tend to even out in the long run was 
endorsed only in the chance condition. 

Although less sizable, convergence was ob- 
Served also in the skill condition. It was sug- 
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gested that other processes are responsible for 
convergence in this situation, These included 
notions that the low man has more room to 
improve through learning, and that he likely 
will try hardest on future attempts. It is nota- 
ble that these notions predict a symmetrical 
pattern. Since the patterns of convergence in 
the skill and chance situations were consistent 
with these predictions, it seems clear that the 
processes which cause convergence differ ac- 
cording to the nature of the task. 

Further indication of this comes from ques- 
tionnaire data. In the skill condition, the sub- 
jects agreed with the statement that their 
scores could be improved through learning, 
whereas they disagreed under chance. When 
asked to indicate how much improvement 
might be anticipated, the subjects in the skill 
condition reported that low-performance 
players could expect significantly more im- 
provement (about .36 chips) than high-per- 
formance players, whereas this differential 
improvement was not expected under chance. 
Analogous data were obtained for a question 
on trying. 

The fact that convergence was more promi- 
nent among players than observers suggests 
that (a) certain emotional processes are trig- 
gered by actual participation which are absent 
or minimized in observation, or (5) other 
processes are involved but the effects are in- 
tensified for the players. The analysis based 
on emotional differences is partially supported 
by what seems to be a guarded denial of con- 
cern by players, especially low-performance 
players. More impressive evidence comes from 
the bargaining phase of the interaction, where 
players tended to minimize initial disagree- 
ment more than observers, to negotiate over 
narrower ranges, to align themselves more 
closely with considerations of fairness, and 
generally to behave as though they were not 
eager to dwell on the difference between their 
scores and those of their opponent. This set of 
findings suggests that players experience a 
degree of emotional discomfort not present 
among the observers. 

The analysis which suggests that both play- 
ers and observers use other processes in form- 
ing their convergence but differ in the degree 
to which they are susceptible to these proc- 


3 
| 
| 
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esses is supported only in the chance condi- 
tion, where players were more prone to be- 
lieve that chance would even things out than 
were observers. None of the other question- 
naire data yielded such differences. 

It was anticipated that the emotional dif- 
ferences between players and observers 
would produce an interaction effect with dif- 
ferential convergence for players and observ- 
ers, evident primarily in the skill condition. 
Instead, we are faced with the fact that play- 
ers display stronger convergence regardless of 
the nature of the task. It seems possible that 
emotionality may account for this difference 
in the shill condition, whereas higher accept- 
ance of luck-related beliefs may account for 
it in the chance condition. However, even this 
explanation cannot account for the apparent 
unwillingness of players in both task condi- 
tions to adopt a firm bargaining stance. 

Theoretical interpretations aside, the fact 
that observers are less prone to make judg- 
mental errors such as convergence has practi- 
cal value for decision makers and others who 
must choose among individuals with varying 
levels of talent. 

As a final point to consider, we might again 
call attention to the fact that convergence of 
the sort studied here does not seem to be 
readily extinguishable. Even though the effect 
is muted after increasing evidence of a dif- 
ference over trials, convergence is nonetheless 
present to a significant degree at the experi- 
ment’s end, In light of this, an interesting 
path to pursue would be to study what might 
happen when, and if, extinction of the misper- 
ception finally occurred. On the one hand, it 
is possible that the disadvantaged parties 
would discard the belief that their outcomes 
will improve and begin to seek future compen- 
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sation by pressuring their opponents for 
larger handicaps. On the other ha^c, a second 


possibility is that the disfavored parties would 
not accept the difference in outcomes as being 
of spontaneous origin, but rather as due toa 


biased system which has been rigged against 


them. If so, they might then seek retribution 
from their opponents who have, in their view, 
taken advantage of them. In pursuing this 
possibility, it is interesting to note that our 
data suggest that these retributive claims may 
go unrecognized by the advantaged parties, 
inasmuch as low-performance subjects appear 
to extinguish their convergence more rapidly 
than high-performance subjects (see Fi 
1). 
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PER!ORMANCE IN CONCEPT IDENTIFICATION AS A 


FUNCTION OF COGNITIVE COMPLEXITY ' 


GARY A. SCHNEIDER ax» LEONARD M. GIAMBRA * 
Miami University 


cses derived from integrative complexity theory were tested in a 
t identification task preceded by thorough concept training. Since lew 
suficient amount of information was presented, the subject determined 
ditional information he needed to identify the concept. In accordance 
the theory, higher complexity subjects obtained information most eth- 


dy and made fewer errors than subjects of lower complexity. Expected 


rences in the variety of concept rules utilized by subjects at different 
ative complexity levels in the concept identification task failed to ma- 
ize. This was attributed to the overtraining of subjects in the 10 concept 
prior to the experimental phase, and hence a second study was carried 


information and in error rate. 


5), in addition to Schroder, 
eufert (1967), felt that a cog- 
hould be considered complex 
y when it reflects a high degree 
differentiation and the elements are inte- 
ted hierarchically by relatively extensive 
of relationship. Others (e.g, Clayton 
ickson, 1961; Kelly, 1955) have defined 
Dmitive structure solely in terms of its de- 
of differentiation, Schroder et al., in 
theory of integrative complexity, how- 
tr, stated that although the number of di- 
sions is not necessarily related to the inte- 
ve complexity of the conceptual struc- 
, the greater the number of dimensions, 
lat is, the greater the amount of differentia- 

, the more likely is the development of 
Integy atively complex connections among the 
mensions. Low integrative structure is simi- 
Tat to a hierarchical information integration 
in which the connections among dimensions 
| This study is based in part on a thesis presented 
ty the first author to the faculty of Miami Uni- 
"sity in partial fulfillment of the requirements for 
om master’s degree. 

The authors gratefully acknowledge the assistance 
^ Carol Ummel Lindquist, Paul Sutcliffe, and Gary 
dund in the collection of the data in the reported 
De EN They would also like to thank Patrick 
ET in for his comments regarding this paper. 

^, “quests for reprints should be sent to Leonard 
Giambra, Department of Psychology, Miami 
*Wversity, Oxford, Ohio 45056. 


ith no pretraining on concept rules, In thís replication, lower complexity 
cts used the smallest variety of concept rules on the problem with the 
t concepts possible. However, there were no significant differences among 
iduals of different integrative complexity levels in the efficiency of proc- 


are fixed. The ability to organize alternative 
sets of connections (i.e., new rules) is lacking, 
and hence there is inflexibility in the system. 
Thus, this type of conceptual structure can 
have a small or large number of parts (low 
or high differentiation), but the relationships 
between the parts are relatively static. 
Schroder et al. hypothesized that the indi- 
viduals with this type of cognitive structure 
filter out many of the aspects of the environ- 
ment. Some of the familiar behavioral pat- 
terns generated by low integrative structures 
in Schroder et al.’s theory are categorical 
black-white thinking, minimizing of conflict, 
and the control of behavior by external stimu- 
lus conditions. 

Highly integrative individuals have more 
schemata, that is, ways of organizing observa- 
tions, for forming new hierarchies with the 
imposition of new external conditions. These 
new hierarchies are generated as alternate 
perceptions or further rules of comparing 
outcomes. The key concept for understanding 
this structure is its capacity for being flexible 
in accepting new information. The complex 
structure has more degrees of freedom in 
which to operate and is more subject to 
change as complex changes occur in the en- 
vironment. 

Studies, usually with military game situa- 
tions (Schroder et al., 1967; Streufert, Sued- 
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feld, & Driver, 1965; Suedfeld & Streufert, 
1966), have all supported the notion that 
integratively complex individuals integrate 
information more efficiently than integra- 
tively simple individuals. Streufert et al. also 
reported that complex individuals made bet- 
ter use of novel information; this finding was 
interpreted as representing a greater degree of 
flexible differentiation and integration. Streu- 
fert et al. also demonstrated that “simple” 
individuals sought less information than 
“complex” individuals when the amount of 
information given was increased. 

The use of military game situations as a 
vehicle for the study of integrative complex- 
ity contains a major methodological weak- 
ness, its inexact nature. That is, it is open 
ended, and the experimenter is to a large 
extent unable to exactly specify the amount 
of information contained in the stimulus ma- 
terial. In addition, the experimenter is unable 
to specify exhaustively the alternative formu- 
lations for connecting the stimulus material, 
that is, the rules by which the stimulus ma- 
terial may be related. Thus, the military game 
situations can provide only a crude indicant 
of the amount of information processing that 
the subject actually performs. 

Concept identification has been another ap- 
proach to the investigation of cognitive 
processes. The unlimited memory condition in 
concept identification keeps all previously 
presented stimulus instances before the sub- 
ject. This condition is an effective method 
for minimizing the influence of memory in 
concept identification and for maximizing the 
influence of the information-processing and 
integrative abilities of the subject. Because 
the concept identification paradigm allows 
complete control over the dimensional extent 
of the stimulus universe and of the com- 
ponents of the universe, it can exactly specify 
and actively control the alternate formula- 
tions of that universe, that is, the possible 
rules for connecting the components of the 
stimulus universe. Hence, the concept identifi- 
cation experimental paradigm was used in an 
experiment as a device for investigating hy- 
potheses stemming from the integrative com- 
plexity theory of Schroder et al. (1967) and 
previous research in the area of integrative 
complexity. 
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Specifically, the hypothese 
(a) Integratively complex 
obtain information more effi 
will make fewer errors on « 
tion problems than integratively si 
viduals. This hypothesis wa 
upon the implied greater 
gratively complex individua 
the task in concept identiñcation 
for a manipulation of man; 
tual rule formulations. (5) In 
tification problem, the subj: 
mation about the exempla: 
more stimulus instances and : 
mation about the exemplar status 
maining stimulus instances. | 
sized that high-integrative-c: 
viduals would seek out ini 
more stimulus instances than low-i 
complexity individuals and that 
would be accentuated as the initia 
tion load (i.e, the numb 


known) increased. This hypothesi ] 
upon the opposing tendencies of low- and 
integrative-complexity individuais 
closure and minimize conflict 
finding of Streufert et al. (1965). (c) dm 
gratively complex individuals wil! utilize 
concept rules than individuals who are ii 


cated upon the asserted greater thi 
flexibility of the integratively compl 
viduals. (d) Any individual difference 
concept identification would be better 
dicted by differences in individuals om 
sures reflecting integrative complexity 
by differences on measures reflecting difi 
tiation. This hypothesis has as its basis 
previously alluded-to notion that measure 
integrative complexity incorporate i 
tion-processing capacity, whereas the 
sures of differentiation do not. 


EXPERIMENT I 
Method 


Acquisition of subjects. Two measures were 
of each subject to determine his level of “cor 
ity.” The Sentence Completion Test? (Streu 


3A Pearson product-moment correlation 
cient of .73 was obtained for scorer reliability 
the two judges (GAS was a judge) who rated 
plexity scores on the Sentence Completion 


| Schroder, 196 | the written version of the Ob- 
ject Sorting 7 layton & Jackson, 1961) were 
administered | total of 99 male and female sub- 
jects in under te psychology courses at Miami 
University. © h of these measures, 45 subjects 
were classifie low-, high-, and middle-com- 
plexity grou; classification was done on the 
basis of the lov cores (15 subjects), the highest 
ors (15 and the most intermediate 
sores (15 su Twenty-seven of the subjects 
were classific th measures; thus 63 subjects 
were used. 
Procedure o 
carried out ir 
tion training, 
mental task | 
After the su 
Completion T 
they participa! 
identification ! 
10 concept rule 
M instances ( 
1 to 81, drav 
22 X 28 cm. s 
| one another | 
dimensions were 
fers on the fi 
(triangle, circle 
cards (horizontal 
the 81 cards : 
| the four dimensions. 
The 10 conc rules (Haygood & Bourne, 1965) 
kamed in the training sessions were affirmation, ne- 
gation, conjunction, alternate denial, inclusive dis- 
ction, joint denial, conditional, exclusion, bicon- 
ditional, and exclusive disjunction (presented in that 
order; see Haygood & Bourne, 1965, for a descrip- 
| lion of these conceptual rules). The concept rules 
Tpresent three levels of complexity: Level I being 
the lowest (affirmation and negation), Level III 
being the highest (biconditional and exclusive dis- 
junction), and Level II was intermediate (see Neis- 
set & Weene, 1962). Each concept rule was illus- 
trated by both a real-life example (an example for 
ation was “Vertebrate: must have a backbone”) 
and an example relative to the characteristics of the 
(the example for affirmation was “squares”). 
"ead problem for each concept rule had a list of 
i E numbers that were exemplars of the concept, 
E the unlisted cards being nonexemplars of the 
owl Each period with a particular concept rule 
^v of two sample and at least four training 
ide lems, The subject's task in each problem was to 
d the correct unknown concept (i.e, two rele- 
t attributes that were combined according to 
e stated conceptual rule). Each concept rule train- 
period was completed when the subjects identi- 
Y correctly the two unknown relevant attributes 
DT four out of five problems. 
m. the condusion of the concept identification 
m ig sessions for all 10 concept rules, a subject 
ES perform in a complete learning task. In 
Dlte learning, the subject had to identify the 


ibject tasks. The experiment was 
phases: (a) concept identifica- 
mplete learning, and (c) experi- 
the concept identification task). 
were classified by the Sentence 
nd/or the Object Sorting Test, 
one or two self-paced concept 

g sessions in which they learned 
The stimulus material consisted of 
, numbered consecutively from 
an orderly arrangement on two 
of paper. The cards differed from 
many as four dimensions. These 
umber of figures (1, 2, or 3), let- 
(W, X, Z), shape of the figures 
square), and background of the 
ertical, or diagonal stripes); thus, 
nted all possible combinations of 
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relevant attributes and the unspecified conceptual 
rule on each problem. The performance criterion was 
a correct identification of attributes and rule in 9 of 
10 consecutive problems. 

Following the completion of complete Jearning, 
each subject participated in the concept identification 
task. The concept identification task contained four 
problems, the first two being practice problems, Each 
problem of the concept identification task consisted 
of a “load” condition followed by a “search” con- 
dition. In the load condition each subject was pre- 
sented the numbers of either two or four stimulus 
cards. Each subject received both load conditions. 
The presentation order of the problems was balanced 
across subjects for the two practice problems and 
for the two experimental problems, Hali of the 
cards of a load condition were exemplars, and half 
were nonexemplars of the concept to be identified. 
After receiving his load-card information, the sub- 
ject then listed all the concepts that he thought were 
possible, given the information obtainable from the 
cards referred to in his load. All 10 concept rules 
could account for the information presented in the 
load condition; however, 31 specific concepts were 
possible with the four-card information load, while 
119 specific concepts were possible with the two- 
card information load. Thus, the two-card load 
condition presents more uncertainty than the four- 
card condition because it provides more alternatives 
and also requires more additional information to 
solve the problem. Recall that a specific concept can 
differ from another concept either because it utilized 
a different conceptual rule or because the rule com- 
bines a different set of relevant dimensions or both. 
After the listing of concepts, the subject chose from 
the remaining stimulus cards all the cards about 
which he wished additional information. That is, the 
subject chose from the 77 or 79 stimulus cards not 
given in the load condition those cards about which 
he wished to determine exemplar status. In concept 
identification the exemplar status (yes or no) of a 
stimulus instance entirely determines the information 
obtainable from it. Thus, the more cards selected, 
the more information gained. From the receipt of 
the status of the cards selected, the subject at- 
tempted to identify the two unknown relevant at- 
tributes and the concept rule by which they were 
combined. Subjects were not told the correctness of 
their identifications. The tempo of all sessions was 


subject paced. 


Results 

A one-way analysis of variance (ANOVA) 
across complexity levels was performed for 
two measures (minutes to complete concept 
identification training and minutes to com- 
plete complete learning) to test the possibil- 
ity that effects of complexity on concept 
identification measures may have been present 
prior to these tasks but eradicated by them. 
A one-way ANOVA was also performed to 
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TABLE 1 


Means WITHIN COMPLEXITY GROUPS FOR Four 
MEASURES REFLECTING DIFFICULTY IN 
TRAINING, COMPLETE LEARNING 
(CL), AND CONCEPT IDENTI- 

FICATION Task (CIT) 


SESSIONS 
| | Min. ( ll wr 
Complexity.) Minto | Minto | inciuding | ees" 
classi- complete complete practice) problems 
fication | training | cL Piracy ‘of CIT) 
scr | | 
Low | 191.13 53.60 87.00 1.13 
Middle | 190.47 54.67 75.86 .53 
High 170.47 51.40 85.46 AT 
OST 
Low 192.53 5443 80.47 1.00 
Middle 191.87 50.40 82.00 -87 
High 183.80 51.13 85.67 AT 


Note,.—SCT = Sentence Completion Test, OST = Object 


Sorting Test. 


test the effect of complexity level on the min- 
utes to complete the concept identification 
task and the number of errors for both prob- 
lems of the concept identification task (Hy- 
pothesis a). An error was defined as an in- 
correct concept guess after the load- and 
search-condition stages. See Table 1 for the 
mean of each complexity group on each of 
these four measures. Only with the Sentence 
Completion Test complexity classification 
method was a significant effect obtained. The 
complexity level as measured by the Sentence 
Completion Test had a significant effect (F 
—3.7, df = 2/42, p< .05) on the total 
number of errors for both problems of the 
concept identification task. From Table 1 we 
see that the means of the low-, middle-, and 
high-complexity groups were 1.13, .53, and .47 
errors, respectively, thus supporting Hypothe- 
sis a. The Newman-Keuls post hoc compari- 
son test did not reveal any significant (p > 
.05) pairwise differences. As a further test of 
the “efficiency” component of Hypothesis a, 
one-factor ANOVAs across complexity levels 
were performed on the number of subjects 
reaching the necessary and sufficient infor- 
mation state in the two- and the four-card 
load conditions of the concept identification 
task. Chi-square analyses were also per- 
formed, but no significant (p > .05) effects 
were found. This necessary and sufficient in- 
formation state was obtained by the subject 
choosing additional cards in the search condi- 
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tion, such that all incorrect 
eliminated. Using the ANOVA, the 
to complexity was significant (F 
= 2/42, p < .05) only when t 
load was two cards in the Sentence 
tion Test complexity classification 
The number of subjects in the low, 
and high groups of the Sentence Co 
Test complexity classification group 
reached the necessary and sufi 
tion state in the two-card load w 
and 12, respectively. The Newman 
revealed that the low-complexity 


i 


reached the necessary and suficie 
tion state significantly (p < .05 


than the middle group. Also t 
load condition, a mino i es 
formed on the number of stfoiccis i 


the three complexity levels of t 
Completion Test complexity cla 
group whose number of selected cards ( 
condition) fell below the group median | 
who did not reach the necessary and si 
cient information state. The number of 
jects was 6 in the low-com; i 
in the middle group, and 3 in t! 
The probability of the occurrence of 
more extreme distributions was .017 (i 
than 2 times out of 100). Thus, this dis! 
tion was significantly different from 
pected distribution of an equal numb 
subjects in each complexity group. This ir 
cates that not reaching the necessary s 
sufficient information condition for tht 
group was to some extent due to the low 

ber of cards selected. There was à total o 
errors in the concept identification task, 
17 subjects were in the necessary and 

cient information state, while 15 sub) 
were not. Eight of 17 errors in the low » 
tence Completion Test complexity level, 
8 errors in the middle Sentence Co! 
Test complexity level, and 3 of 7 errors 
high Sentence Completion Test compe 
level were committed by subjects who re 
the necessary and sufficient information 
This distribution of errors within Set 
Completion Test complexity levels seemec 
indicate that errors were not entirely a T€ 
of not reaching the necessary and S 
information condition. 


VA with repeated measures 
(Winer, 1962) was used 
e Completion Test and Ob- 
classifications to test for 
the effects of complexity 
ion load on the following 
„pendent measures obtained 
tification task: (a) Num- 
ber of cards en—the number of cards 
about which a subject asked information in 
the search co n; (b) number of concepts 
—the number concepts that a subject 
thought wer sible after being presented 
the informat ad (two or four cards); (c) 
| mumber of —the number of different 
concept rule: bject used in his listing of 
concepts after being presented the informa- 
tion load; (4 rrect number of concepts— 
the number cepts a subject used (after 
being preser the information load) that 
were not inconsistent with information gained 
in the load condition; (e) correct number of 
tules—the number of concept rules a subject 
“used correct! hat is, with correct relevant 
attributes af ,eing presented the informa- 
tion load; (/) percentage of correct concepts 
—the ratio of Z to b; (g) percentage of cor- 
Tect rules—the ratio of e to c. The analysis 
involving the number of cards chosen (De- 
pendent Measure a) was performed to test 
Hypothesis 5. The analysis involving rules 
(Dependent Measures c, e, and g) was per- 
formed to test Hypothesis c. A failure to find 
an effect of complexity on these and the 
Previously described dependent measures us- 
ing the Object Sorting Test complexity classi- 
fication and fmding an effect of complexity 
Using the Sentence Completion Test com- 
Plexity classification would support Hypothe- 
sis d. Thus, the finding of a significant effect 
on number of errors due to complexity within 
the Sentence Completion Test complexity 
| Gasification group supported Hypothesis d. 
The analyses involving the dependent mea- 
Sures referring to the number of concepts 
Measures b, d, and f) were performed to 
explore additional possible complexity level 
effects, 
oy main effect of information load was 
ys cant for the following measures (df — 
io; ): For the Sentence Completion Test 
/ "iplexity classification group—number of 


5 
[i 


even additio 
in the conce; 
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concepts, F = 37.92, p< 001; number of 7 


rules, F = 55.64, p < 001; correct number 
of concepts, F = 55.64, p< .001; correct 
number of rules, F = 15.13, p < .001; per- 
centage of correct concepts, F = 11.35, p < 
.01. For the Object Sorting Test complexity 
classification. group—number of concepts, F 
= 36.93, p< 001; number of rules, F = 
5.29, p < .05; correct number of concepts, F 
= 92.64, p < .001; correct number of rules, 
F = 5001, p < .001; percentage of correct 
concepts, F = 21.35, p « .001; percentage of 
correct rules, F = 9.08, p < .01. The infor- 
mation-load means for the significant meas- 
ures in the ANOVA are given in Table 2. 

As indicated in Table 2, all significant dif- 
ferences for information load were in the di- 
rection of higher means for the two-card load. 
The main effect of complexity level was non- 
significant (p > .05) on all of the seven de- 
pendent measures in both the Sentence Com- 
pletion Test and Object Sorting Test com- 
plexity classification groups. Also, the inter- 
action between information load and com- 
plexity level for all the measures in the Sen- 
tence Completion Test and Object Sorting 
Test complexity classification was not signifi- 
cant (p > .05). Thus, neither Hypothesis 5, 
c, nor d was supported by the pattern of sig- 
nificant effects on these seven dependent meas- 
ures. 

Pearson product-moment correlations were 
computed using the 99 subjects who com- 
pleted the Sentence Completion Test and Ob- 
ject Sorting Test to assess the relationship 
between these two complexity measures. The 
correlation between the Sentence Completion 
Test and the Object Sorting Test was 04 (N 
— 99, p> .05). In addition, no relationship 
of Sentence Completion Test or Object Sort- 
ing Test score to intelligence was obtained. 
American College Test intelligence scores 
were available for 90 of the 99 subjects who 
completed the Sentence Completion Test and 
the Object Sorting Test. Correlations were .13 
between the Sentence Completion Test and 
the American College Test and .01 between 
the Object Sorting Test and the American 
College Test. Neither correlation was signifi- 
cant (p > .05). 

To explore further the relationships among 
all variables of this experiment, Pearson 
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TABLE 2 


MEANS WITHIN Four-Carp AND Two-Carp PROBLEMS FOR Seven DEPENDENT Meas 
or THE CONCEPT IDENTIFICATION Task 
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SCT complexity classification group OST complexity classif roup 
Measure Complexity Complexity 
Low Middle High M Low Middle | Hig M 
| 
No. cards chosen | 
Four 28.7 344 34.7 32.6 30.1 31.0 3 314 
Two 30.9 42.1 33.6 35.5 36.8 325 | 3 35.2 
M 29.8 38.3 34.2 34.1 33.5 31.8 33.3 
No. concepts | 
Four ; 14.5 7.3 9.9 9.6 10.9 10.1 | 112 
Two 17.5 17.1 22.3 19.0 23.2 16.9 1 194 
M 14.5 12.2 16.1 14.3 17.1 13.5 | 1 15. 
No. rules | 
Four 5.5 49 5.9 5.4 6.0 $4 ; 
Two 6.3 6.3 6.4 6.3 6.9 6.0 1 (64 
M 59 5.6 6.2 59 6.5 5.6 ( 61 
Correct no. concepts = 
Four 6.1 5.0 6.8 6.0 5.0 5.7 T: 5.5 
Two 14.2 12.9 18.6 15.2 16.1 14.0 i1 | 148 
10.2 9.0 12.7 10.6 10.6 9.9 10:1 | 10.2 
Correct no. rules 
Four 45 42 49 45 4.5 43 46) 4$ 
Two 5.3 5.6 6.2 5.7 6.1 5.4 5 | 5.7 
M 4.9 49 5.6 54 5.8 4.9 4^ | 5.0 
% correct concepts Y 
Four 62 75 66 68 57 70 59 62 
: Two 78 79 81 79 76 82 79 79 
M 70 77 74 74 67 76 69 71 
% correct rules 
Four 83 88 84 85 77 88 19 | 8&8 
Two 83 87 95 88 87 91 94 91 
M 88 88 90 87 82 9 8 | 86 
| 


Note,—SCT = Sentence Completion Test, OST = Object Sorting Test. 


product-moment correlations were computed 
among the following variables: (a) the Amer- 
ican College Test, (5) the Sentence Comple- 
tion Test, (c) the Object Sorting Test, (d) 
minutes to complete concept identification 
training, (e) minutes to complete complete 
learning, (f) minutes to complete the concept 
identification task, (g) number of errors in 
the concept identification task (total of both 
problems), (4) number of cards chosen in the 
concept identification task, (7) number of con- 
cepts in the concept identification task, (j) 
number of rules in the concept identification 
task, (k) correct number of concepts in the 
concept identification task, (/) correct number 
of rules in the concept identification task, 
(m) percentage of correct concepts in the 
concept identification task, and (n) percent- 
age of correct rules in the concept identifica- 
tion task. Correlations were computed sepa- 
rately for each load condition for the last 


seven concept identification. task measures 
Table 3 contains the correlation matrices 
Only 56 of the 63 subjects were used in these 
correlations because American College T 
scores were not available for 7 subjects. I 
each load condition most of the significant 
correlations were among the dependent me 
sures of the concept identification task (es 
cluding the number of cards chosen) and ‘the 
correlations of these measures with the time 
necessary to complete the various concept 
tasks, There was a significant negative corre 
lation (r = —.29, p < .05, n = 56) betwee 
the Sentence Completion Test scores and tht 
number of errors in both load conditions ° 
the concept identification task, providing 
further support for Hypothesis a. The corre 
lation of number of cards chosen in the co™ 
cept identification task and the number 9 
errors on both problems of the concept iden 
tification task was significant in both lo 


ons (four cards—r = —.31, p < .05, 
= two cards—r = —.35, p< 01, n= 
the information load of two cards, 
merican College Test scores correlated 
atly (P < .05) with the number of 
in the concept identification task (r 
= 56), the number of correct con- 
in the concept identification task 
= 56), and the percentage of cor- 
pt rules in the concept identification 


thesis a. This hypothesis was sup- 
to the extent that those individuals 
th and middle integrative complexity 
errors (wrong concept guesses) on 
)bdlems of the concept identification 
in those individuals with low integra- 
plexity, and significant negative cor- 
were noted between the Sentence 
tion Test level and the number of er- 
0, in the two-card load condition, with 
uncertainty, higher complexity indi- 
especially those in the “moderate” 
) group, were more efficient than low- 
city individuals in obtaining the appro- 
nformation to reach the necessary and 
information state, where the number 
ible concepts was reduced to one. This 
the case in the four-card load with its 
uncertainty, where only 31 concepts 
possible. Thus, the two-card load condi- 
equired more additional information to 
the problem than the four-card load 
ution. These results can be seen as a con- 
nce of an individual of low complexity 
less able to generate all the concepts 
sible so as to select appropriate cards 
to the necessary and sufficient in- 
state. Perhaps the individuals with 


*** p < 001. 


TABLE 3 
Pearson PRODUCT-MOMENT CORRELATIONS OF 14 VARIABLES FOR THE INFORMATION LOADS OF Four CARDS (LOWER TRIANGLE) 
** p <01, 


AND Two CARDS (UPPER TRIANGLE) 


*p «.05. 


t e 

- Cognitive structure (as measured by d Eg d EN 
Sentence Completion Test), having a 3 49 E n9 mg 
y to avoid delay, to seek closure, and it 223 iu 293 ES 
out many aspects of the environ- alee ae eed H 5.8 $$23 
only asked about cards relevant to the 31/53 i Sa atores íz 
S upon which they focused. Because s if PE: GE AE gg ee 
tive-complexity subjects did not take $ be fs eso eege es 

le concepts into account, this expla- 8 i^ ziii 
Would account for these individuals EPifiBisssoocs g 8 

g the necessary and sufficient informa- Speer Eid 


the fewest number of times. The 


PERFORMANCE IN CONCEPT IDENTIFICATION 267 


Note.— Decimal points have been omitted. 
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individuals with “high” complexity, who are 
theoretically able to generate more possible 
concepts, may have found themselves in the 
position of having too many possibilities with 
which to cope, thus causing them to select 
cards more inefficiently than those in the 
intermediate range of integrative complexity. 
The individuals in the “middle” complexity 
level may have reached the necessary and suf- 
ficient information state more often because 
they generated an optimal number of possible 
concepts, that is, a number that they could 
integrate but one that was greater in number 
than the low group, so as to be able to select 
cards most efficiently. 

Hypothesis b. The integratively lesser com- 
plex individuals asked for approximately the 
same number of cards as the greater-complex- 
ity individuals in the two- and four-card load 
problems, a disconfirmation of the second hy- 
pothesis. However, even though the number of 
cards selected was nearly the same in the two- 
and four-card loads, the number of lesser-com- 
plexity individuals who reached the necessary 
and sufficient information state was, as previ- 
ously referred to, much less in the two-card 
load condition; hence, their choices were not 
as efficient as the more “complex” individuals. 
The military situation of Streufert et al. 
(1965) and the concept identification task 
may not be comparable on the concept of in- 
formation loads because of the difference in 
the nature of the two tasks. In a concept 
identification task, more load cards logically 
reduce the number of possible concepts which 
could account for the exemplar status of the 
load cards, since there was only one “correct” 
concept in each problem. However, in the 
military game situation, more information 
increased the number of possible alternatives. 
Thus, the two-card load condition may have 
been more properly designated as the high- 
load condition since a greater number of al- 
ternative concepts needed to be considered 
relative to the four-card load condition. Thus, 
the subject faced greater uncertainty in the 
two-card load. 

Hypothesis c. A significant difference was 
found between the number of concept rules 
utilized in the two- and four-card loads. There 
were more concept rules as well as more cor- 
rectly utilized concept rules used by all indi- 
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viduals in the two-card load tha 
card load. However, there wer: 

in the concept rule utilization a 
ity groups in either complexi: 
Also, there was no difference 
tion of rules; that is, no compl: 
particular rules more frequent!) 
significant differences amo 
groups (as classified by the Sen 
tion Test) on the number of : 
stated as possibilities was in di: 
to the third hypothesis based 

et al.’s (1967) information-pr: 
However, it is possible that an; 
utilizing concept rules were hi 
training on the very stringent < 
fication training and complete |: 
Replication of this experiment :! 
with an easier performance crite 
concept identification training at 
learning tasks or should only use thi 
information task without the prior bw 
this second alternative has been ! 
reported as the replication experiment. 

Hypothesis d. The somewhat re 
of the Sentence-Completion-Test- 
plexity classifications to produc 
effects on dependent measures í 
identification experiment produce 
support for the presence of additional informa- 
tion-processing elements in the cognitive struč 
ture as described by the integrative complex- 
ity theory. These elements were not taken into 
account by the differentiation theories of com 
plexity as reflected in the Object-Sorting-Test? 
based complexity classifications. 

The presence of a nonsignificant correla- 
tion (.13) between the Sentence Completion 
Test and the American College Test (measure 
of intelligence) demonstrates that all of the 
previously mentioned differences in the con 
cept identification task among individuals of 
varying integrative complexity levels cannot 
be accounted for purely on the basis of intelli- 
gence. Correlations between the Sentence 
Completion Test and intelligence of 42-4 
were reported by Schroder et al. (1967). 
With the two-card load, significant correla” 
tions of several concept identification t 
measures (number of concepts, number 9 
concept rules, and percentage of correct con- 
cept rules) with the intelligence measure wert 


icant correlation of the 
t and the aforementioned 
task measures suggests 

; the more alternatives 
elligence was related to 
formation-processing sys- 
to perceive the logical 

not related to the qual- 

ı processing (number of 
guesses). There would 
more of a strain on the 
SS cause the two-card load 
ficult than the four-card 
more alternatives were 

ce Completion Test level 

the quality of informa- 

nber of errors); thus, in- 

hly correlated measures, 

ot, intelligence and inte- 
both seem instead to be 

s in decision making. This 
ionship could explain the 
nation in the more diffi- 
From this viewpoint, intelli- 
the ability to perceive and 
* elements, while integrative 


The 


ix 


novel information efficiently and to the 
of decision making. 


EXPERIMENT II 

he present experiment was conducted as 
lication of Experiment I. This replica- 
Omitted the concept identification train- 
‘Sessions and participation in the complete 
ig task because it was hypothesized that 
differences in the use of concept rules 
E individuals of differing integrative 
ity were masked by overtraining on 
identification training and in the com- 
arning task. 


ition of subjects. A total of 109 male and 
Mi Subjects in undergraduate psychology courses 
. Snte mi University were administered both the 

“nee Completion Test and the Object Sorting 
e same classification procedures were used 
e described in the original experiment. Thirty- 
T Ï the same subjects were classified by both 
potes; thus, there was a total of 57 subjects in 
Classifications. 
Eure and subjecis tasks. Subjects who were 
d by the Sentence Completion Test and/or 
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the Object Sorting Test participated in the concept 
identification task without any previous concept 
identification training or complete learning experi- 
ence, After the characteristics of the stimulus ma- 
terial were explained and illustrated to the subjects, 
examples of concepts were stated. Two different 
“kinds” (the concept rules of affirmation and ex- 
clusion) of concepts were described, specifically in 
relation to the stimulus material. Tben, the subjects 
participated in the concept identification task, which 
was identical to the task used in the first experi- 
ment. As in the first experiment, examples of all 10 
concept rules were listed as part of the instructions 
of the concept identification task. 


Results 


Neither the Sentence Completion Test nor 
the Object Sorting Test complexity classifica- 
tion methods had a significant effect on the 
total number of errors for both problems of 
the concept (p > .05). The errors in the Sen- 
tence Completion Test complexity classifica- 
tion were: low — 30, middle — 25, high — 25; 
the errors in the Object Sorting Test com- 
plexity classification were: low — 26, middle 
= 24, high = 25. Thus, Hypothesis a was not 
supported. 

Tn the ANOVA of the number of subjects 
reaching the necessary and sufficient informa- 
tion state, the effect due to complexity was 
nonsignificant (p > .05) for both complexity 
classifications. The number of subjects who 
reached the necessary and sufficient informa- 
tion state in the low, middle, and high groups 
in the four-card information load were 10, 11, 
and 9, respectively (Sentence Completion 
Test), and 11, 9, and 10, respectively (Object 
Sorting Test); these values in the two-card 
information load were 8, 5, and 3, respectively 
(Sentence Completion Test), and 8, 4, and 4, 
respectively (Object Sorting Test). Again, 
Hypothesis @ was not supported. 

The main effect of information load was 
significant for the following measures in the 
3 x 2 ANOVA with repeated measures across 
one factor: For the Sentence Completion Test 
complexity classification, number of concepts, 
p< 001; correct number of concepts, p< 
001; percentage of correct concepts, $ < 
.001; percentage of correct rules, p < .001. 
For the Object Sorting Test complexity clas- 
sification, number of concepts, P < 01; cor- 
rect number of concepts, 5 < .001; percentage 
of correct concepts, p < .001; percentage of 
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TABLE 4 


PROBLEMS FOR SEVEN DEPENDENT MEA s 
or THE Concert IDENTIFICATION Task FOR EXPERIMENT II 


MEANs wiruiN Four-Caxp AND Two-CARD 
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SCT complexity classification group OST complexity clas group 
Measure Complexity Comple 
Low Middle High M Low Middle M 
No. cards chosen 
Four 24.4 16.1 11.5 17.3 24.1 13.0 17.9 
Two 22.9 18.1 10.9 17.3 25.9 15.3 18.3 
M 23.7 17.1 11.2 17.3 25.0 14.2 18.1 
No. concepts 2 x 
Four 4.1 7.4 5.8 5.8 6.80 5.53 5.86 
Two 8.1 8.4 7.7 8.1 7.40 7.40 7.42 
M 6.1 7.9 6.8 7.0 7.40 6.47 6 6.64 
No. rules 
Four 2.33 3.73 3.40 3.15 3.20 $.40 | 32 3.09 
Two 2.73 2.93 2.93 2.86 2.87 280 | à 2.80 
M 2.53 3.33 3.17 3.00 3.04 3.10 | i 2.95 
Correct no. concepts | 
Four 1,93 2.80 3.13 2.62 2.20 3.00 2.56 
Two 5.93 6.07 6.60 6.20 5.93 643 | 5.71 
M 3.93 4.44 4.87 4.41 4.07 451 | 444. 
Correct no. rules | | 
Four 180 | 273 | 293 | 248 | 220 | 28 | | 247 
Two 2.47 2.60 2.73 2.60 2.67 2.60 | 2 2.49 
M 2.14 2.67 2.83 2.54 2.44 2.74 | 2.48 
% correct concepts S 
Four 58 50 68 57 48 oy, |S 56 
Two 75 78 87 80 78 85 | 7 | 78 
64 69 78 69 63 75 64 | 7 
% correct rules | | 
Four 77 75 87 80 69 87 & | 81 
Two 92 92 96 93 91 96 82 | 90 
M 85 84 92 87 80 92 84 86 
| | m" 


Note.—SCT = Sentence Completion Test, OST = Object Sorting Test. 


correct rules, p < .05. As indicated in Table 
4, all significant main effect differences were 
in the direction of higher means for the two- 
card load condition. 

The main effect of complexity level was 
nonsignificant (p > .05) on all of the seven 
dependent measures in both the Sentence 
Completion Test and Object Sorting Test 
complexity classifications. However, the inter- 
action between information load and com- 
plexity level for the number of rules in the 
Sentence Completion Test complexity classi- 
fication group was significant (p < .05); the 
interaction was also significant (p < .05) for 
the percentage of correct rules in the Object 
Sorting Test complexity classification group. 
The effect on the number of rules (Sentence 
Completion Test) seems to have been influ- 
enced by the lower number of rules used by 
the low-complexity subjects in the four-card 


load condition stage. This provided some SUP. 
port for Hypothesis b. The effect in percent- 
age of correct rules (Object Sorting Test) 
appears to have been due to the lowest pel 
centage having occurred with the low-comi 
plexity subjects in the four-card load condi 
tion stage and the lowest percentage havi 
occurred with the high-complexity subj 
in the two-card load condition stage. Th 
main effect of complexity level was nonse; 
nificant (p > .05) on all of the seven dep? 
dent measures in both the Sentence Comple 
tion Test and Object Sorting Test complexity 
classification groups. Thus, no support was 
obtained for Hypothesis d. 

Pearson product-moment correlations Welt 
computed using 93 of 106 subjects who conr 
pleted the Object Sorting Test and Sentent 
Completion Test because 13 subjects did 00 
have all of the information available; 


n was .04 (p> .05). American Col- 
scores were available for 93 of the 
s who completed the Sentence 
m Test and the Object Sorting 
the correlations were .02 (p  .05) 
the Sentence Completion Test and 
ican College Test and .28 (p < .01) 
n the Object Sorting Test and the 
College Test. 
me of the 57 subjects (American 
Test scores were not available for 6 
were used in the calculation of the 
ations among the dependent mea- 
hin both the four- and two-card load 
(see Table 5). There was a signifi- 
ive correlation (r = —.31, p < .05, 
between the Sentence Completion 
e and the number of errors in both 
of the concept identification task. 
was a significant correlation (r = 
< 0l, n = 51) between the Sentence 
etion Test and the American College 
‘The significant correlations between 
tence Completion Test and three de- 
measures of the four-card informa- 
d were: number of rules, 7 = .34, p < 
1; number of correct concepts, 7 
| P< .01, n= 51; number of correct 
r= 44, p< 01, n= 51. There were 
nt correlations between the number 
rors and the following measures of the 
-card information load: number of cor- 
concepts, 7 = .43, p < .01, n = 51; num- 
tof correct rules, r = 41, p < 01, n = 51; 
‘Percentage of correct rules, 7 = .27, p< 
n=51. The correlation between the 
of errors and the number of cards 
nin the two-card information load was 
Hr = 27, p < .05,n=S1, 


ssion 


) 


omitting the concept identification 

sessions and the complete learning 
different results were obtained in the 
Pt identification task than in the origi- 
experiment. No significant differences in 
lors among individuals of different inte- 
€ complexity levels were obtained in the 
lon experiment. However, the negative 
tion between integrative complexity 
nce Completion Test scores) and the 
of errors was significant. The task 
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TABLE 5 
PEARSON PRODUCT-MOMENT CORRELATIONS FOR 11 VARIABLES FOR THE INFORMATION Loans or Four Carns (LOWER TRIANGLE) 


AND Two CARDS (UPPER TRIANGLE): EXPERIMENT IT 


sp <.001. 


e identi- 
s 


h problem: 


American College Test 


No. cards chosen 
No. concepts 


Variable 
in the conc 
** 5 <.01. 


fication task (bot 
Correct no. concepts 
9. Correct no. rules 


*p «.05. 


1. Sentence Completion Test 


2. Object Sorting Test 
3. Number of errors 
10. % correct concepts 
11. % correct rules 


4. 
5 
6. 
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may have been too difficult for individuals 
with no training at all in concept identifica- 
tion that it allowed for very little variation. 
However, it is also possible that other un- 
known variables intruded to produce the lack 
of a significant effect due to the Sentence 
Completion Test complexity classification. It 
is clear though that training is necessary to 
produce the significant effect of Sentence 
Completion Test level on the number of er- 
rors obtained in the original experiment. In 
accordance with the first experiment, there 
was also no effect of complexity as measured 
by the Sentence Completion Test or the Ob- 
ject Sorting Task on any of the seven depen- 
dent measures. The results concerning the 
differences in the efficiency in obtaining in- 
formation as measured by the number of sub- 
jects reaching the necessary and sufficient 
information state at each complexity level 
indicated that there were no significant dif- 
ferences among individuals of varying inte- 
grative complexity (Sentence Completion 
Test) in either the four- or two-card load 
problems. Results in the original experiment 
indicated that the individuals of lower inte- 
grative complexity obtained information less 
efficiently than individuals of higher complex- 
ity in the two-card load. Differences, although 
not significant, were also obtained in the two- 
card load problem in the replication experi- 
ment. However, the difference of the replica- 
tion experiment was in a direction opposite to 
that obtained in the first experiment; the 
individuals of low integrative complexity 
reached the necessary and sufficient informa- 
tion state most often, the middle-complexity 
individuals reached the necessary and suffi- 
cient information state second most often, 
and the individuals of the highest integrative 
complexity reached the necessary and suffi- 
cient information state the fewest number of 
times. Possibly, low-integrative-complexity in- 
dividuals may function more efficiently than 
individuals of higher complexity in situations 
where the task is very complex and where no 
prior experience (i.e., training) has occurred. 
Although highly speculative, a possible expla- 
nation might be that high-integrative-com- 
plexity individuals generate so many alterna- 
tives that they are not prepared to integrate 
them without any additional experience. This 
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latter interpretation is cong: 
authors’ previous conclusion (sei 
graph headed “Hypothesis a” i 
sion of the original experim: 
complexity individuals may ge: 
alternatives for maximum «< 
the replication experiment pro 
come equivocal in its support 

d. 
The replication experiment 
to find out whether any difi: 
number of concept rules util 
viduals of varying degrees of integrativ 
plexity (Hypothesis c) were m 
training in the concept ide 
training sessions’ complete le: 
replication experiment did not ' 
differences in the number oí 
utilized by individuals of vary 
complexity levels. However, | 
between information load an: | 
level was significant. Low-cor; n 
tence Completion Test) indi 
fewer concept rules in the four-ca™ 
dition stage than individuals oí 
plexity. The two-card load presen 
with so many possible concep 
that it may have been too diffi: 
subjects who had mo training, henc 
ferences among the complexi 
have been obtained. However, t 
the four-card load was easier (fey 
concepts), thus allowing for differences ? 
tween individuals of varying complexity lev- 
ds. This explanation is, of course, post RO 
and highly speculative; the significant inte 
action could have resulted from the intrusioi 
of other unknown variables, or it could have 
been simply due to chance. The significant 
correlation in the replicated experiment 5& 
tween the Sentence Completion Test and Hi 
American College Test concurs with those ! 
ported by Schroder et al. (1967). Althou 
this correlation was not significant Ih. 
original experiment, it was in the same dire 
tion. However, it is important to note th 
this relationship does not seem to account Í 
all of the significant correlations (with 9 
four-card load) obtained between integral! 
complexity and some of the dependent m 
sures (number of rules, number of cort 
concepts, number of correct rules), bec? 


cation 


etween the American Col- 
measures were not sig- 
> significant correlations 
mentioned interaction do 
rt for Hypothesis c that 


gvide some sup; 

s not obtained i» the original experiment. 

It is important to note that these measures, 
reflect the ability to perceive and or- 


the input elements, and which were 
elate ive complexity on the task 
h no training (Experiment II), were re- 
lated to intelligence in the task with thorough 
raining (Experiment 1). We might specu- 
late that these findings indicate that percep- 
and organizational abilities are related to 
integrative complexity when individuals are 
nvolved in tasks with no training; however, 
s more training and understanding of the 
problem develops, these abilities are then 
ssociated with measures of intelligence. 
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DIMENSIONS OF TRAIT PREFERENCES AND 
PERSONALITY TYPE * 


EMIL J. POSAVAC ? 
Loyola University of Chicago 


A multivariate analysis was conducted to isolate individual differences in ti 
degree to which 28 personality traits and social behaviors were seen as desira 


in friends, One hundred and forty-tl 


fraternities, reported relative preferences between members of all possible p 
of the traits and behaviors, The preference judgments produced seven subj 
dimensions, which were compared to the subjects’ scores on a battery 


personality and social attitude measu 


potheses that the individual differences in the trait preferences are sizable a 
that the subjects’ personalities are systematically related to their preference 
Similarity between the preferred traits and the subjects’ personalities w 
generally found, although some suggestive evidence supporting the compl 
mentarity theory of attraction was also obtained. 


The area of interpersonal attraction has 
received considerable theoretical and research 
attention in recent years (Aronson, 1969; 
Berscheid & Walster, 1969; Byrne, 1969). 
The relationship between the participants’ 
personalities has remained an unresolved is- 
sue, in spite of a number of investigations. 
Byrne and his co-workers (Byrne, Griffitt, & 
Stefaniak, 1967; Palmer & Byrne, 1970) have 
found attraction to be related to personality 
similarity. On the other hand, some inter- 
personal selections seem to be based at least 
partially on complementarity (Rychlak, 1965; 
Winch, 1955, 1967). 

One reason for the lack of consistency may 
be found in the tendency to structure the 
issue in either/or terms. It may be that both 
formulations are valid representations of vari- 
ous forms of the attraction process. Levinger 
(1964) and Cattell and Nesselroade (1967) 
pointed out that happily married couples 
are probably similar on some traits and 


1 This study is based on a dissertation submitted 
to the University of Illinois, Urbana, in partial ful- 
fillment of the requirements of the PhD degree. The 
research was partially supported by Grant MH- 
13892 from the National Institute of Mental Health, 
United States Public Health Service to Nancy Wig- 
gins. The suggestions of Jerry S. Wiggins, who 
served as dissertation chairman, Homer H. Johnson, 
and Nancy Wiggins are gratefully acknowledged. 

?Requests for reprints should be sent to the 
author, Department of Psychology, Loyola Univer- 
sity, 6525 North Sheridan Road, Chicago, Illinois 
60626. 


social 


hree subjects, drawn from three 


n 


res. Support was obtained for the } 


complementary on others. Neit) $ 
discussions, however, dealt with ih 
ity that some people may prefer S 
plementarity in friends or spouses, while 
ers prefer only similarity. In other 
there may be individual differences in | 
adequately a single theory represents a giv! 
person's processes. Tucker (1966) sug 
that different theories, even c 
ories, may all be valid, but for 


trated how an interpersonal 
sired by just a few of the subjects would 
overlooked if the data were only analyze 
detect average sample tendencies. In ord 
detect trait preferences exhibited by on 
small proportion of the subjects, it is n 
sary to use a methodology which can (a) i 
late individual differences, (5) estimate 
preference scale values for these indivi 
tendencies, and (c) relate the personat 
and social attitudes of the subjects to W 
trait preferences. 

A methodology which can handle pret 


preference model E 
(1960). The multivariate model assumes i 
for a given individual, preferences are ui 
dimensional (cf. Torgerson, 1958); howev! 
it allows for the presence of different reta 
preference hierarchies among the subjects. : 
vector-preference model detects such disti 
hierarchies by extracting dimensions from 
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trix rather than from 
mal Variable X Variable 
extracted from a Subject 
re called subject dimen- 
dimensions are related to 
alities, the usefulness of 
iplementarity theories can 
ch dimension, The specific 
ere tested in the present 
vable individual differences 
trait preferences in the 


matrix. Dimensi 
x Subject Matrix 
sions. After thes: 
the subjects’ 
Similarity and 

be assessed for 
hypotheses whic 
study were: (/ 
would emerge 


of subje vensions; (5) these indi- 

l differences would be expressed in dif- 

nt preferen« erarchies; and (c) the per- 
ities and social attitudes of the subjects 
would be relate » the subject dimensions. 
Although it was expected that the subjects 


would, in general, prefer friends to possess 
traits similar to their own, the multivariate 
approach permi the detection of the pres- 
ence of subje vho desired a particular 
complementary relationship. Even if there 
were few such subjects, they could form a 
separate subject dimension and thus be de- 
tected. 


N METHOD 
Subjects 


The 143 subjects were drawn from three social 
rnities on the University of Illinois, Urbana, 
fampus, They were paid participants in a larger 
Study of individual differences in judgment. The 
three fraternities had quite different reputations on 
p One house was professionally oriented and 
ten obtained scholarship awards, but its members 
Were thought to be socially immature by the other 
ding houses. The reputation of the second was 
M a traditional drinking fraternity, Members 
a le third house generally had respectable grades 
enjoyed a high social standing on campus. 


Testing Procedure 


hy of the fraternities were tested in their dining 
trait and the third in a classroom. In all cases 
held eme were obtained in a group session 

e early evening. Personality and social atti- 
© scales were self-administered during the week 


cta 
he the group session at the subject’s conve- 


P E 
‘sonality and Social Attitude Variables 


tsonality and social attitude variables (here- 
maj led "subject variables") induded the Ed- 
{ ersonal Preference Schedule (Edwards, 1954), 
{Crowne we- Crowne Social Desirability Scale 


e & Marlowe, 1964), a social attitude bat- 
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TABLE 1 


Exacr WoxpiNG or Items PRESENTED TO THE 
SUBJECTS AND ACCOMPANYING SHORT NAMES 


Trait and social behavior Short name 
L Doniljansg in interpersonal relations | Dominant 
2. Submissive in interpersonal relations | Submissive 
3, Generally warm in relations with 
others (casy to get along with) Warm 
4. Generally cold in relations with 
others (hard to get along with) Cold 
5. Sociable in groups (extravert and has 
many friends) Extravert 
uiet in groups (introvert) Introvert 
7, Well-adjusted : generally satisfied, 
able loge along with others reason- 
ably well Well-adjusted 


8, Somewhat maladjusted : often in- 
volved in misunderstandings with 
others, dissatisfied with hislife and 


with others Maladjusted 
9, Enthusiastic and informal Enthusiastic 
10, Restrained and formal Restrained 
11. Datesin a socially attractive sorority | Dates 
12, Member of a leading fraternity Fraternity member 
13, Is concerned about his social standing 
on campus Social standing 
14, Would prefer to talk to someone in his 
own major about the things they are Mi 
ajor 


studying than to anyone else 


15, Hasa le point average of 4.6 4.6 
16, Has a grade point average of 3.2 3.2 
17, Intends to go to graduate school Graduate 
18, Often discusses the Vietnam War and 

social changes in the U. S. A. Vietnam 
19, Has participated in demonstrations 

on campus Demonstrates 
20. Likes to discuss art and philosophical 

issues Art-philosophy 
21. Conversation usually concerns pro- 

fessional and college sports Sports 
22, Participates in intramural athletics | Intramural 
23. Centers his conversation around girls 

and dates Girls 
24. Conforms to generally accepted 

standards of dress and grooming | Conventional dress 
25. Dresses like a hippie Hippie dress 
26. Would be considered physically at- 

tractive to a member of the op- 

Attractive 


posite sex 
27. Maintains traditional moral standards 


in his behavior Moral 
28. Feels that traditional moral standards 
(as applied to sex and drugs, for 
example) are not relevant for people 
today Immoral 
tery (Adorno, Frenkel-Brunswick, Levinson, & San- 


ford, 1950), the Social Role Inventory (Simmons, 
1968), a biographical inventory, college entrance 
scores, and peer nominations on 14 personality 
traits. The Social Role Inventory, the only non- 
standard test used, measured tendencies toward 
either a campus social orientation (low scores) or 
an academic orientation (high scores). 


Preference Ratings 


Twenty-eight traits and social behaviors were se- 
lected to represent the main dimensions of person- 
ality and the social behaviors likely to be performed 
by members of college social fraternities and male 
students in general. Table 1 contains the exact 
wording of the items presented to the subjects and 
short names used in all following tables and in the 
discussion of the results. For the sake of brevity, the 
items are referred to as “traits” or “stimuli.” The 
subjects reported their relative preferences between 
the members of all possible pairs of the stimuli in 
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Table 1. Specifically, the subjects were asked to in- 
dicate to what degree they would prefer one trait 
over the other in a male friend. These relative 
preferences were reported on scales such as the fol- 
lowing: 


A B 
Dates in a socially Conversation usually 
attractive concerns professional 
sorority and college sports 
e ud Wo ihe | ala a 3 e 
Prefer A No preference Prefer B 


In order to assess reliability, 14 pairs were re- 
peated, making a total of 392 pairs. The mimeo- 
graphed sheets containing the pairs were presented 
in several random orders, each stimulus appeared in 
the left position nearly as often as in the right posi- 
tion, and the same stimulus never appeared in two 
succeeding pairs. 


RESULTS AND DISCUSSION 
Reliability of the Preference Judgments 


The preference-judgment reliabilities of 8 
of the 143 subjects were below .46 (p > .05). 
These unreliable subjects were eliminated, and 
further analyses were performed with 135 
subjects whose mean reliability for the 14 re- 
peated pairs was .81. 


Extraction of Subject Dimensions 


The sums of squares and cross-products 
matrix among 135 subjects over 378 relative 
preferences were obtained. This matrix was 
subjected to a. principal-components analysis 
in order to extract the dimensions of the sub- 
ject space (Tucker, 1968; Tucker & Messick, 
1963). The location of the break in the eigen- 
value plot suggested that seven was the most 
appropriate number of factors to retain. The 
first seven eigenvalues accounted for 50.3896, 
8.48%, 4.78%, 2.90%, 2.30%, 1.75%, and 
1.65%, respectively, of the total sums of 
squares. 

The dimensions retained for further analy- 
sis were then rotated to an oblique simple 
structure using the Binormamin criterion 
(Harman, 1967; Kaiser & Dickman, 1959). 
This rotation procedure tends to yield dimen- 
sions that closely represent real subjects. If 
the subject dimensions were only mathemati- 
cal trends not representative of real subjects, 
the dimensions would have been less interest- 
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ing psychologically (Cliff, 19553). At this 


point in the analysis, each subje had a pro- | 
jection (or a loading) on each of the rotated 
dimensions. These projections could be treated 
as scores on variables and cov! be corre- 
lated with other measures on th* subjects. 
Trait Preferences Reflected in ‘he Subject 
Dimensions 

Before relating the subject variables to the 
subject dimensions, it was necessary to iden- 
tify the preferences tapped by ihe subject 
dimensions. Using the Eckart-Young thi 
(Eckart & Young, 1936; Tucker & Messick, 
1963), it was possible to calcii^ie the pro- 


jections of the 378 stimulus pairs on each of 
the seven subject dimensions, The projec 
tions were rescaled and treated as relative 
preferences of conceptual subjects who were 
pure representatives of the seven subject di- 
mensions. The projections for each dimension 
were arranged in a 28 X 28 i terstimulus 
preference matrix, which was skew sym- 
metric, with zeros in the main ci»gonal. The 
cells indicated the degree that the column 
trait was preferred over the row trait. The | 
preferences ranged from —3 to +3 and were 
interpreted as unit normal deviates (L. R 
Tucker, personal communication, February 
1968). Thus, the Thurstone scale values for 
a subject dimension were given by the means 
of the columns of the interstimulus matri 


gins, Wiggins, & Conger, 1968). These scale 
values and the mean scale values for the whole 
sample are contained in Table 2. Since the 
preferences of the subject dimensions were 
quite different from the mean preferences, its 
clear that there were individual difference 
present in the judgments of the subjects. The 
scale values are interpreted below, along with 
the correlations between the subject variables 
and the preference dimensions. 


Relation of the Preference Dimensions and 
the Subject Variables 


Since each subject had a projection on each 
dimension, these projections were correlat 
with the subject variables. Table 3 contain 
the subject-variable correlations. When ? 
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TABLE 2 
STMI CALE VALUES FOR THE ENTIRE SAMPLE AND FOR THE SUBJECT DIMENSIONS 
JM Mesa entire Subject dimension 
sample* 
| I Il Ill IV v VI VIL 
1. Dominant EE os ee ales ae = 

2 Submissive | VS Se NE E ee i-a 
3, Warm 1.70 AT 1.30 1.08 1.49 ‘50 ‘91 E 
4 Cold ieee 7982] 05100) et a 
5, Extravert 1.27 ‘87 ‘98 1.07 ‘06 | — .09 33 “49 
6, Introvert | — "73 | —.60 | — 43 | — 71 | — 06 27 | = 30 | — 35 
T. Well-adjuste: "oU 46 1.14 1.04 141 70 -86 85 
8, Maladjusted | -iu 5-44 | So 5104/72520 octo E 57 
9, Enthusiastic | 149 16 1.33 1.19 1.44 02 67 ‘88 
10, Restrained Mie ed Peet all ESOS SOT est 107 7|: SSH 
Il. Dates - ‘09 Sp | — 45 46 | —.76 | — 35 22 EST 
12, Fraternity member .01 74 — A3 52 — .16 — 07 -00 — .50 
13. Social standin ‘01 EL EUST 34 | — .41 40: [heed 35 
14. Vietnam 1 BOE — 42 1.40 — 23 .26 — 37 — 58 — AT 
45, Demonstrates | — 182 | =) 46. | = 1,045 do 06 | — 59 | — 99 | —101 
is Art-philosopl BE 160|—43|-2|-0.-—.6 38 
p Major Ew 1.16 Hale EAS 1.27 «66 78 
B ports .02 33 = 31 — 74 — .36 — 19 1.56 — .22 
rIntremural | “39 58 01 oi) = 880) = 20 156 | — .34 
10. Girls = 0 "39. Os A N 26 | = 76 | o 52 1.13 
pi — 22:583. TERME 69 | — .54 45 Q3 | — 40 
ES | = 38 "24 | — 28 | — 16 | — 44 22 (06 | — .56 
E As uate — | — .08 — .63 29 E — .27 32 16 — .65 
A e iona dress 33 80 — .51 — 24 .26 .80 .05 1 
Sebi dress —1.00 — 8 — 42 | — 44 72 — 66 | —145 —1.25 
2, Attractive | 31 1.03, | —.22 FEX REESE TUS (ieee pa NETT 08 
1. Moral | 18 | essetis: ee d530 48 155 | — 12 A0 
. Immoral — 3g Jamas — 15 "31. | —.1f | —128| —.3 1.09 


‘The maximum po 
M ane the scale 
on of a stimulus wit ime i i 
E ooa sum. vithin a dimension with its posit 


limited proportion of a large number of cor- 
relation coefficients is found to be statistically 
Significant, care must be exercised in interpre- 
E because the probability of committing 
Thus I error is considerably above 05. 
E. i a the cases where a subject dimension 
ble E dia very few significant subject-varia- 
Eus ates, interpretations of subject per- 
other t type were very tentative. On the 
Bie and, when a consistent pattern of 
herd variable correlations was obtained, the 
s were viewed with much more confi- 
ence, 
E presenting the interpretations of the 
Eas dimensions and the associated per- 
m : t» it should be noted that the 
Init alues were only measures of relative 
Fiar preferences. All subjects were social 
ca ©. members, and it can safely be as- 
En that most of them liked fraternity 
ers as friends. While some subject di- 


Je scale value was 3.00 and the minimum possible was — 3.00. 
values of the seven dimensions may not be based on the same scale, it is best to compare the relative 
ion in another dimension, rather than to compare numerical scale values 


mensions produced negative-preference scale 
values for *Fraternity member,” this was only 
a relative departure from the typical pattern 
and should not necessarily be construed as a 
total rejection of the fraternity system. 

Dimension I: Conventional friend. Dimen- 
sion I represented preferences for a friend 
with the conventional values as defined by 
the traditional social fraternity subculture 
(“Attractive,” “Dates,” *Extravert," “Con- 
ventional dress,” "Social standing," and 
“Fraternity member”). The traits with the 
most negative scale values were those associ- 
ated most strongly with deviations from this 
role (*Immoral," «4 .6,” “Hippie dress,” “Art- 
philosophy,” and ‘Demonstrates’’). 

The subjects whose preferences were most 
like Dimension I tended to be like the con- 
ventional social fraternity member (Sanford, 
1962). Their social attitudes were conserva- 
tive (+ Social Attitude Battery, + religious 


=)? A 
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attendance). They were sociable (+ afilia- (+ Marlowe-Crowne Social Desi: .' ‘lity Seale, 

tion, —Social Role Inventory) and were + anxious). It seemed that t gave up 

concerned about what others think of them some of their personal freedom in order to fit 
TABLE 3 

CORRELATIONS OF THE SUBJECT VAxIABLES WITH THE SUBJECT Dimension 


Subject dimension 


12 
Affiliation = 
Intraception 1 
Succorance -M 
Dominance a 
Abasement = 
Nurturance = 
Change 14 
Endurance = 
Heterosexual —15 
Consistency =n = 
Peer ratings 
Calm 12 — 20* 14 
Anxious us — pem —16 —10 — 
Cooperative — —13 16 = P 
Competitive 11 -15 — -15 2 A 
Poli: 16 — 10 -— "i 
Crude tac 14 V E 2 p 
ible ert) 17* i. —14 
U dable — 15 — ES -23 FS 
Sociable — 15 — |-17 10 EY 
Unsociable 16 —13 11 14 "m 
Well-adjusted 10 E: = — i2. eae 
Maladjusted = 10 um — |-x*|-H 
Domineering 17* 12 — — —16 = ZA 
Submissive -— — = 20* 18* = E 
Marlowe-Crowne Social Desirability Scale 19* —15 — 15 18 = ^B 
Social Role Invento —41***|  25** | —15 — 28*** | —20 = 
Entrance test PERETE College Test) 
English —10 18 E = SE 16 
Mathematics —19 = = 13 — |-13 A 
Social Science —19 12 — 14 — |- PT 
Natural Science —18 — — — |-20* 17 
Composite —21* 12 = E E E 15 
Biographical inventory 9 | 
i. = 12 2 = |=16 zz 
Height -17* — |-10 — = mc "m 
Year in school € 18 jt | —38 
Grade point average —26* EX e 12 15 = » | 
Prestige value of college major 12 —12 |-11 — 18* E 0 
Religious attendance frequency 19* = |=17 21* 20* 15 SÉ 1 
Mother's educational level 36 | ote | — 2 16 — 1 


autonomy, + coopera- 
their academic standing 
vere somewhat less than 
int average, — composite 
Test, — mother’s educa- 
showed an above-average 
+ heterosexual). Thus, it 
for subjects whose pref- 
his dimension, attraction 
nality similarity. 
itellectually and politically 
tive to the other dimen- 
se preferences were most 
most strongly preferred 
politically aware friends 
' Vietnam," and “Major”). 
tended to follow this pat- 

h “Demonstrates” had a 

ale value, it was the ninth 
it on Dimension II, com- 

e twenty-fourth most pre- 

sample as a whole. 

;ose preferences were most 
| tended to have more edu- 
i to be upperclassmen. They 

y had above-average aca- 
i + Social Role Inventory). 
is academic orientation agreed well with 
preferences; however, there were insuf- 
ent additional correlates to permit a more 
complete assessment of these subjects’ person- 


interests 


imension III: Social, immoral, and bright 
. The preferences isolated by Dimen- 
ILI were similar to those of Dimension I, 


ality were also preferred, while stimuli 
ing to political interests were not re- 
ted quite as strongly. 
Subjects whose preferences were like Di- 
on III did not have guilt feelings (— 
pun ent) and reported an interest in women 
heterosexual). Both of these character- 
were similar to traits preferred in the 
friend, but there were not enough sig- 
“ant subject-variable correlates to draw 
elinite conclusions concerning support of a 
kory of interpersonal attraction. 
_ Dimension IV: Interpersonally wholesome 
nd. Dimension IV represented exception- 
Y strong preferences for the socially desir- 
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able personality traits (“Warm,” “Well-ad- 
justed," and *Enthusiastic"), The magnitude 
of the scale values of these items indicated 
that they were more strongly preferred by 
the subjects whose preferences were like Di- 
mension IV than by the other subjects. The 
least preferred characteristics referred to fra- 
ternity activities and academic interests. A 
moderate preference for “Dominant” was 
unique to this dimension. The most preferred 
friend thus was exceptionally nice or nur- 
turant. 

While there were few significant subject- 
variable correlates with this dimension, there 
were some interesting relationships, Subjects 
whose preferences were like Dimension IV 
were conservative (— change, + religious at- 
tendance), responsible, calm, and submissive. 
It thus seems that there was some evidence 
supporting the complementarity theory of 
attraction, since a calm, submissive person- 
ality is complementary to an outgoing, active, 
dominant personality. 

Dimension V: Serious and traditional 
friend. The preference tendencies isolated by 
Dimension V seem to describe the "ideal 
young person" from the viewpoint of many 
parents and traditional state legislators. A 
student having the preferred traits (*Moral," 
“Major,” “Conventional dress,” and *Well- 
adjusted”) and not having the nonpreferred 
traits (“Immoral,” “Hippie dress,” “Demon- 
strates,” “Dominant,” and “Girls”) would 
probably keep busy with coursework and sel- 
dom question the values held by his parents 
or the institutional norms of the university. 

Taken together, the subject-variable corre- 
lates of this dimension seem to suggest that 
subjects whose preferences were most like 
Dimension V were similar to the anal re- 
tentive character of Freud (Maddi, 1968). 
These subjects were submissive to external 
authority (+ deference, — autonomy, + sub- 
missive, + religious attendance) and conserv- 
ative (— change). Stubbornness and orderli- 
ness were indicated (+ endurance, + order). 
They did what students should do, namely, 
study (+ Social Role Inventory). Finally, if 
the negative correlations with aggression (— 
aggression) and sexual interests (— hetero- 
sexual) indicated denial of unwanted ten- 
dencies, then the personality type is nearly 


| 


a, 
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complete. Dimension V tapped preferences 
for conservative friends by subjects who were 
themselves very conservative. 

Dimension VI: Athletic friend. The ideal 
friend desired by Dimension VI was a person 
strongly interested in athletics and, more 
moderately, in girls. Otherwise, the pref- 
erences of Dimension VI were for a person 
who was rather conventional. 

The subject-variable correlates suggested 
that people like Dimension VI were somewhat 
lower in academic ability (— English, Ameri- 
can College Test) and reported a lower iden- 
tification with an academic outlook (— Social 
Role Inventory). These subjects were repre- 
sented less often than average in peer nomi- 
nations for negative personality traits (— 
crude, — undependable, — maladjusted). On 
the other hand, they were rated as relatively 
more competitive. They tended to be lower- 
classmen, There were some similarities be- 
tween the preferred friend and the subjects' 
personalities, but not enough to warrant gen- 
eralization to a theoretical position. 

Dimension VII: Immoral, girl-watcher, 
friend. The traits most highly preferred by 
subjects like Dimension VII (*Girls" and 
*Immoral") tended to describe a friend who 
was somewhat of a "swinger." The least pre- 
ferred traits concerned more radical political 
behaviors and interest in academic work. 

The subjects whose preferences were most 
like this dimension did not readily yield to 
authority (— deference), were not especially 
well-socialized (— polished, — well-adjusted), 
were younger than the typical subject, and 
chose higher status majors. While these cor- 
relates were few in number, they did suggest 
a young, immature college student. Poorly 
socialized individuals, therefore, preferred a 
friend who tries, at least, to be a swinger. 
This relationship suggested a variant of com- 
plementarity theory which Cattell and Nessel- 
roade (1967) called “completion.” The com- 
pleteness principle refers to the tendency of a 
person to be attracted to others who possess 
a desirable trait, to the extent that the person 
himself lacks the trait. It seemed that some 
subjects might have wished to learn better 
social techniques from friends and/or to gain 
recognition through association with social 
swingers. 


Emi J. Posavac 


A Further Test of Similarity Theory 


Considering each subject < sion sepa- 
rately permitted the examination of conflict- 


ing theories. It was also possible to conduct a 
test of similarity theory usi: subjects" 
fraternity affiliations. If people are attracted 
to similar peers, then it is ' that the 
members of a social fraternity ild report 
similar trait preferences. Str ipport for 
similarity theory would have b ;btained if 
all the members of one frat: had large 
projections on one dimension 5 f there had 


been one such dimension for € 
However, more than three sub 
were actually found, and an a! 
test of similarity theory was ch 
criminant analysis procedure 
1958), treating the subject’s 
the subject dimensions as variabiss, was used 
to find the best linear combinations of the 
subject dimensions that could discriminate 
among the fraternities. After sho,ing that the 
mean preferences of the fraternities differed 
from each other (Mahalanobis D = 39.61,* 
df= 14, p < .001), each subject was classi- 
fied according to which of the three discrimi- - 
nant functions his preferences were most simi- 
lar. These classifications were not random over 
fraternities (x? = 31.23, d — 4, p< .000). 
Random assignment to three groups would 
have resulted in 3396 correct classifications; 
however, the discriminant anal; correctly 
grouped 5495 of the subjects. These results 
concurred with the conclusions drawn from 
the subject-dimension analysis; while most 
subjects seemed to conform to similarity 
theory both in their reported preferences for 
friends and in their choices of social fraterni 
ties, similarity theory did not account for 

of the trait preferences, nor did the members 
of any one of the fraternities completely agree 
on the preferred characteristics of a friend. 


3 Mahalanobis D? may be used as a chi-square. 


fraternity. 
dimensions 
nate overall 
en, A dis- 
( &nderson, 
jections on 
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TEMPORAL CHANGES IN PREDECISION PREFERENC 


ES 


AMONG CHOICE ALTERNATIVES 


DARWYN E. LINDER,! CAMILLE B. WORTMAN, anv JACK W. BRE! 
Duke University 


In a recent study, Linder and Crane reported evidence in support of the | 
reactance theory prediction that ratings of the attractiveness of two deci: | 
alternatives will converge as the decision point approaches in time. 5 


alternative explanations of their data 


founding of the amount of time remaining until the decision with the am 


can be based on the inadvertent con 


je 


of information about the alternatives that the subject might reasonably exp 
to obtain. In the present experiment, time until the decision was varied in 
pendently, while expectations about additional information were held constan! 
Predecisional convergence of the attractiveness of decision alternatives v 


again demonstrated by data showing that the shorter the amount of ti 
remaining until the decision, the more nearly equal the attractiveness of i 


alternatives. 


Evaluative preferences among choice alter- 
natives during the predecision period have 
been assumed to be stable (Festinger, 1964) 
or under certain conditions to become more 
disparate (Janis & Mann, 1968; Mann, Janis, 
& Chaplin, 1969). Recently, however, Linder 
and Crane (1970) used Brehm's (1966) 
theory of psychological reactance to predict 
that predecision ratings of the attractiveness 
of the alternatives involved in a choice will 
converge as the time at which a committing 
decision must be made draws near. While 
Linder and Crane offered evidence in sup- 
port of this hypothesis, their method of ma- 
nipulating the amount of time remaining un- 
til the decision confounded that variable with 
the amount of additional information that 
the subject might plausibly expect to obtain 
about the alternatives. The present study was 
designed to eliminate this confounding and 
thereby render less plausible those alternative 
explanations that assume that the subjects 
expected different amounts of information in 
the different experimental conditions. 

The prediction that predecision ratings of 
the attractiveness of decision alternatives will 
converge as the decision point draws near in 
time was derived in some detail in the Linder 
and Crane (1970) study and is not repeated 
here. To test their predictions, Linder and 
Crane recruited subjects for a “National 

1 Requests for reprints should be sent to Darwyn 


E. Linder, Department of Psychology, Duke Uni- 
versity, Durham, North Carolina 27706. 


Foundation for Opinion Research Interview" 
and told them that they id be inter 
viewed by one of two men equa!ly well trained 
for the job. Subjects were told that because 
of the very personal nature of the interview, 
they would be allowed to choose the man 
with whom they wished to work after meeting 
simultaneously with both of them. The sub- 
ject then read descriptions of the two men 
and was asked to make an initial rating of 
how much she would like to be interviewed 
by each one. Linder and Crane manipulated 
the amount of time remaining until a decision 
would be made by varying the length of the 


expected “get-acquainted” period. Three ex 
perimental conditions were created by telling 
independent groups of subjects that they 
would have 3, 8, or 15 minutes to speak with 
the interviewers before making a decision: 
The data indicated that the shorter the 
amount of time remaining until the decision 
the smaller the discrepancy between the rat- 
ings. Also, subjects who merely rated the pe 
interviewers on the basis of the description 
placed their ratings about as far apart as di 
subjects in the 15-minute condition. _ f 

The Linder and Crane (1970) experimen 


provides some corroboration for the react 
theory account of predecisional cogo) 


processes, but the manipulation of time V 
the decision was, as noted earlier, confoun 
with the amount of information that the 7 
ject might expect to obtain from the P 
acquainted period. The reactance theo 


282 


formulation would be more strongly supported 
by a study in which a manipulation of time 
remaining until a decision was not confounded 
with the amount of information that subjects 
might expect to obtain. 


\LETHOD 
Subjects 

Subjects for thi: 
introductory level p 
yersity. Students 
course at this level 


riment were recruited from 
ology classes at Duke Uni- 
d for the first time in a 
required to participate in a 
certain number cf experiments, but they are free to 
choose from ar > rather large number offered 
each semester. Subjects made appointments to par- 
ticipate by signi ı appointment sheet for a Na- 
tional Foundation for Opinion Research Interview to 
be conducted by “Carter and Williams.” All partici- 
pants were fern id each experimental session 
| was attended by only one subject. 


Procedure 

Upon reporti: the room designated on the 
appointment sheet, fhe subjects were met by a 
female experimenter who explained that she was an 
assistant to the project for the National Foundation 


for Opinion Researc) 
‘the interview, 
‘Prior to the inter 
scribed the bogus 
Research project. 

ultimate usefuln: 

followed the script 
(1970), except. f« 


and would not be conducting 
would provide an orientation 
ew. The experimenter then de- 
N i3] Foundation for Opinion 
isizing the importance and 
f the results, This performance 
ecounted by Linder and Crane 
ı few instances of rephrasing to 
accommodate the speech patterns of a different ex- 
Perimenter, Briefly, the subject was told that she 
Would be interviewed concerning matters ranging 
ftom “dating patterns and social regulations to 
Your own views on sex.” She was told that in order 
to facilitate honesty and candor during the inter- 
Mew, she would be able to choose her interviewer 
Tom two graduate students in clinical psychology. 
The 10-minute condition was created by allowing 
ely 3 minutes to elapse before the experimenter 
turned to the subjects room to administer the 
E scales. In the 10-minute condition, the experi- 
mw er remarked that one of the interviewers was 
En up" but that he would be through in 
inr h minutes, so that the subject would be mak- 
ES decision in about 10 minutes, after the 3- 
[p A chat, In the two experimental conditions 
ie subjects rated the attractiveness of the two 
9) interviewers, expecting a total of either 10 
3 ès or 3 minutes to elapse before making a 

Mmitting decision 
Gets in a no-decision control condition were 
de to a procedure identical to that used by 
tion E Crane to create a similarly labeled con- 
intervi hey were told that they would not be 
viewed, but that in order to determine the 
le effects of the written descriptions, they were 
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being asked to read them and to rate the attractive- 
ness of Carter and Williams as interviewers. After 
the ratings had been obtained from all subjects and 
the additional questionnaire completed by subjects 
in the 3-minute and 10-minute conditions, the ex- 
perimenter provided a full explanation of the pur- 
poses of the study and disclosed the deceptions that 
had been employed. 

Subjects in this study were randomly assigned to 
conditions, with the constraint that an equal num- 
ber be exposed to each treatment. The experimenter 
was informed before the subject arrived whether she 
had been assigned to the no-decision control. How- 
ever, if the subject had been assigned to one of the 
other two conditions, the experimenter did not 
learn which time interval to employ until she left 
the subject's room, after having delivered the initial 
instructions. 


RESULTS AND DISCUSSION 


The results reported below are based on 
data collected from 36 subjects, 12 in each of 
three conditions. Subjects indicated how 
much they would like to be interviewed by 
Carter and Williams on two 31-point scales 
identical to those used by Linder and Crane 
(1970). The main dependent variable was the 
absolute value of the difference between these 
ratings. The mean difference between ratings 
was 7.4 points in the 3-minute condition, 12.8 
in the 10-minute condition, and 10.9 in the 
no-decision control condition. The null hy- 
pothesis of no true mean diíference between 
these three conditions was rejected (F = 
5.28, df = 2/33, p < 02). Parametric con- 
trasts (Winer, 1962) revealed that the 3- 
minute condition differed significantly from 
both the 10-minute condition (F = 10.27, df 
= 1/33, p < .005) and no-decision control 
condition (F = 4.29, df=1/33, p< 05). 
These data, then, clearly replicate the results 
reported by Linder and Crane. 

One purpose of this experiment was to ex- 
plore the possibility that the results obtained 


2 Forty-one subjects participated in this experi- 
ment, but the data collected from five subjects were 
discarded. The data from three subjects were de- 
leted from the statistical analysis when the experi- 
menter learned that they had discerned the nature 
of the deceptions employed. Also, postexperimental 
interviews revealed two subjects for whom the pro- 
cedure had not had the intended effects. One subject 
stated that she had made a final decision on the 
basis of the written material, and another said that 
she rated the two men equally attractive as inter- 
viewers because she “didn’t want to hurt anyone's 


feelings." 
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by Linder and Crane were due to the sub- 
jects’ expectations of obtaining different 
amounts of information in the different ex- 
perimental conditions. In the present experi- 
ment, all subjects expected a 3-minute inter- 
action with the potential interviewers, and 
subjects in the 10-minute condition should 
have expected no more information than sub- 
jects in the 3-minute condition. The results of 
the two experiments are, however, strikingly 
similar, making it difficult to retain any al- 
ternative explanation of the Linder and Crane 
data based on the assumption that the more 
information a subject expected to obtain, the 
more disparate his ratings of the two inter- 
viewers. Since subjects in both experiments 
confronted the same decision, with time until 
commitment as the common independent 
variable, it seems preferable to assert that in 
both cases the difference in attractiveness of 
the two alternatives was a direct function of 
the amount of time remaining until the 
decision. The authors wish to interpret this 
functional relationship, of course, as a mani- 
festation of reactance aroused by the loss of 
freedom implied by any felt preference for 
one of the alternatives over the other. 
Reactance theory is not, of course, the 
only conceptual formulation useful in at- 
tempting to understand predecisional behav- 
ior. Predecisional preferences for information 
have been studied by Mills and Ross (1964) 
and Mills (1965). Some of these data are not 
easily accounted for by reactance theory. 
Other writers (Janis & Mann, 1968; Mann 
et al., 1969) have predicted and demonstrated 
predecisional spreading of decision alterna- 
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tives. What is needed is an attempt to de 
lineate the separate domains of application 
of these several formulations ar to specify 
the conditions that govern their differential 
applicability. While the focus of concern in 
this paper has been to deal with some possible 
objections to the Linder and Crene experi- 
ment, perhaps its more importa ;ntribution 
will be to provide, in combination with that 
study, an empirical base broad enough to 
support continued investigation ‘nto prede- 
cisional cognitive processes. 
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ATTEMPTED LEADERSHIP IN THE LEADERLESS GROUP 
DISCUSSION AS A FUNCTION OF MOTIVATION 
AND EGO INVOLVEMENT * 
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basis of the Bass leadership model, it was hypothesized that ego- 
subjects attempt more leadership than do non-ego-involved sub- 
ad that this difference is greater under conditions of high motivation. 
ch-motivated subjects were expected to attempt more leadership 
v-motivated subjects under ego-involved conditions. The subjects 
$ Air Force ROTC cadets separated by means of a subject question- 


naire into high- and low-motivated groups (high or low in desire to enter 
advanced military training, respectively). Ego involvement was manipulated 
by instructions, Tt was found that non-ego-involved subjects attempted more 


leadership (p < 01) than did ego-involved subjects under both conditions of 
motivation. Although the results reject our hypothesis based on the Bass 
model. they are in line with findings by Back, that task commitment leads to 
restricied participation and “businesslike behavior"; such an effect is a possible 
explanation for the reduced attempted leadership by the ego-involved subjects 
in the present study. 


The leaderless group discussion is particu- 
larly suitable for leadership studies because it 
forces the group members to make their own 
decision regarding procedures and leads to 
Competition for leadership, providing that the 
problem deals with a topic familiar to all 
"Brüup members. lis use in attacking the prob- 
lem of small group leadership has been ini- 
tially limited, however, largely because of the 


difficulty encountered in finding a reliable 
index of leadership in group discussions. 
The pioneering studies concentrated on per- 


fecting the leaderless group discussion as a 
technique, discovering many variables that 
determine how subjects will interact in 
‘Small problem-solving groups, but holding 
m abeyance the questions of leadership and 
attempted leadership. 

Bass and Wurster (1953a, 1953b), Ziller 
and Exline (1958), and more recently 
Kiessling and Kalish (1961) found little or 
no relationship between leaderless group dis- 
ssion scores and the age of the subjects 
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when the groups were sexually homogeneous. 
Also, no direct relationship between physical 
position in the group and leadership was 
shown in the 250 studies reviewed by 
McGrath and Altman (1966) or in an earlier 
study by Bass and Klubeck (1952) that dealt 
with seating arrangement and proximity, al- 
though recent research by Ward (1968) indi- 
cates that seating arrangement may act 
as an “implicit communication net” in small 
discussion groups. 

A later study by Bass, Pryer, Caier, and 
Fling (1958) examined attempted leadership 
as a function of member control over others 
in the group, motivation, and task difficulty. 
On the basis of their desire to enter advanced 
ROTC (as ascertained by a previously ad- 
ministered questionnaire), the subjects, male 
college sophomores, were placed in high-, 
medium-, or low-motivated groups. Problems 
of varying difficulty were used as group tasks, 
and members were arbitrarily assigned differ- 
ing amounts of control over others. On the 
basis of a previously formulated theory (Bass, 
1955), attempted leadership was defined as 
any behavior directed toward the changing of 
any behavior of another person. Therefore, 
since practically all talk in leaderless dis- 
cussions is assumed to be directed toward 
influencing other members (Bass, 1949), par- 
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ticipation in the discussion and attempted 
leadership were held to be synonymous. At- 
tempted leadership was thus operationally de- 
fined as the amount of time each subject 
spent talking. 

Bass found that less "leadership" was at- 
tempted when the problems were easy, and 
that control over other members resulted in 
most attempted leadership under conditions 
of high motivation and problem difficulty. 
This conclusion was reached on the basis of 
an analysis of variance that showed a signifi- 
cant interaction between control and motiva- 
tion, and between control motivation, and 
difficulty. 

The function of increasing the task diffi- 
culty or the amount of member control may 
well have been to increase the subject's self- 
commitment to the task (ego involvement). 
When the subject is ego involved, he may be 
expected to attempt to exert more influence 
in the leaderless group discussion (Bass & 
Dunteman, 1963); this tendency should be 
strongest when the subject is also highly 
motivated. 

Fouriezos, Hutt, and Guetzkow (1950) 
commented on this same idea in an earlier 
study of 72 decision-making conferences, 
which showed that behavior that stems from 
“self-oriented needs" is essentially competitive 
behavior, with each subject attempting to use 
the group to his own advantage. A person 
high in ego-related or self-oriented needs 
should work for a group goal only if the 
group goal is synonymous with his own per- 
sonal goals. The researchers found that these 
groups, which exhibited high frequencies of 
self-oriented behavior, met for longer periods 
of time but completed fewer agenda items 
than did groups rated low on such needs. Tt 
can be expected, although no data were col- 
lected to verify the idea, that such groups 
demonstrated considerable attempted leader- 
ship, although little successful leadership. 

Iverson and Reuder (1956), in a review 
of 173 studies dealing with ego involvement 
as an experimental variable, found that verbal 
instructions that present a standard task in 
problem solving as a test of intelligence or 
performance serve to increase the subject's 
ego involvement in the task. They define ego 
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involvement as commitment to t 
ing from the possibility of in! 
the need to maintain self-estee 
ure to perform well on the tas! 
The relationship between ego 
and quality of response appear 
linear, maximally facilitative n 

Katz, Roberts, and Rob 
among others, attribute the di 
of extremely high ego invol 
presence of excessive drive, 
tested hypotheses of increased 
so far proven inconclusive. It ca 
therefore, that the presentation 
discussion as a test increases | 
mitment to the task, thus arous 
are satiable only through group 
consequence should be increased 
for leadership on the part of th 
groups, providing the anxiety 
ego involvement is not sufficient 
disruptive of task behavior. 

Although the Bass findings cons 
basis for the present work, an 
hypothesis can be drawn from findi 
Back (1951), who notes that t 
ment leads to “businesslike” behav 
part of the subjects; his discu 
to participate only to the extent : 
the attainment of the goal. This finding rum 
contrary to predictions resulting from the 
work by Bass, that task-committed (eg 
involved) subjects should attempt more leat 
ership under conditions of high motivation, 
with the expectation that they will participate 
more (ie, spend more time talking) than | 
will non-ego-involved subjects. 

The purpose of the present study was !ó 
test two hypotheses: 


1. Ego-involved subjects attempt more 
leadership in the leaderless group discussio? 
than do non-ego-involved subjects. This dit 
ference is greater when the subjects are alo 
highly motivated. i 

2. Highly motivated, ego-involved subjects 
attempt more leadership than low-motivate 4 
ego-involved subjects, while there is no sign" 
cant difference between the high- and low 
motivated, non-ego-involved subjects. 


METHOD 


periment were 55 male col- 
the basic Air Force ROTC 
versity, Catholic University, 
lege. The fact that the sub- 
srticipate rather than given 
s not expected to bias the 
| Adams (1959) found that 
iot differ significantly from 
t subj m the same subjective pool. 

ntiated in terms of motiva- 
estionnaire designed to deter- 

r desire to enter advanced 

s et al, 1958). Among a 
items" dealing with military 
© pertinent item was included: 


and training, 


v "important 
"Force ROT: 


for you to enter Advanced 


A great 

S Fairly m 

To some t 
K Comparative! 

F Not at al! 


little 


;ded either A or B were clas- 
ited; D and E responses were 


s et al (1958) found that the response to this 


Mor the population they used was consistent 
101 level of sirnificance with another five-choice 
Right now, how desirable is a career in the 


My service to vou?" It may be expected that 
tts interested in entering the program are more 
y motivated to perform well than are lower 
fated subjects on any task relevant to their 
t training 


tre 


€ two experimental treatments were designed to 
€zo involvement for one group and to hold 
ctor at a minimum in the other. Half of the 
Motivated subjects and half of the low were 
in each of the treatment groups. Subjects in 
involved groups received instructions to the 
t the results of their performance on the 
ould be made available to the Aerospace 
100 instructor for use in evaluating their 
training performance, These cadets’ signatures 
ced on a test roster, and their names were 
© the tape “for the record.” It was assumed 
le presentation of the task as a test, plus the 
to the interest of the instructor and the 
Orce in the subject’s results systematically 
ego involvement in the task (Iverson & 
» 1956). 
9-involved subjects were instructed that the 
e of the tape was to provide a record by 
the Air Force could review the use of a free 
On technique in decision making, possibly for 
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use in the future evaluation of freshman Air Force 
ROTC cadets. They were cautioned not to use names, 
but to refer to each other by number if such direct 


to speak freely. 

There were five subjects in each group. One alter- 
nate subject was assigned to each of the low- 
motivated groups, because an earlier study (Bass et 
al, 1958) found that these subjects tend not to report 
to the test area. The subjects were tested during 
regular dass hours to insure that the time of day 
that the groups were run did not present any 
systematic error in the results. 

The cadets were directed by their Aerospace Studies 
instructor (an Air Force commissioned officer) to 
report to one of the offices in the department, where 
they were seated at the sides of a trapezoidal table, 
allowing each member of the group to clearly see 
each other member. The experimenter, also a uni- 
formed Air Force officer, read to the group a pre- 
composed set of instructions pertinent to the group 
being tested, explaining the presence of the tape 
recorder and outlining the time limits involved. 
Although this procedure is not as realistic as it could 
be if maps or scale models were used, Aikman, 
Lorge, Tuckman, Spiegel, and Moss (1953) reported 
results which indicate that the degree of remoteness 
from reality has little or no effect on the quality of 
the solutions reached by discussion groups. A 10- 
minute time limit was used in order to produce 
an acceleration in verbal output without imposing 
undue psychological pressure on the subjects. 

The problems used in this study are identical to 
those used in the Air Force Squadron Officer School, 
and they dealt with effective and ineffective per- 
formance in leadership positions by American of- 
ficers during the Korean War. In all, there were 
seven problems; no group used more than three in 
the course of the 10-minute discussion. 

Following the discussion period, the subjects were 
administered a brief posttest that was essentially 
directed toward determining whether or not the 
subjects were able to discern the actual purpose of 
the study, and to record the subjects’ evaluation of 
how hard they tried to perform on the task. 

A blind technique for information processing was 
employed to control the effects of observer bias. The 
actual discussions were recorded on magnetic tape 
and analyzed for leadership-directed content by two 
trained observers who were ignorant as to the nature 
of the experimental treatment of the groups they 
were monitoring at any given time. The procedure 
for the analysis was essentially the one followed by 
Bass and Norton (1951), with each subject receiving 
one total score for the entire discussion. 


RESULTS 


The subject questionnaire was administered 
initially to 27 freshmen males from Catholic 
University, 41 from Howard University, and 
100 from Maryland State College. Correla- 
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SEX, DEPENDENCY, AND HELPING ' 
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Previous research has demonstrated that females are more likely than males to 


respond to dependency by giving help. 


upon to help females and males to help males, As a consequence, it is not 


clear whether the difference in help is 
or to the sex of the person receiving 


explanations accounts for previous findings. Dependency, sex of potential 
helper, and sex of dependent person were all varied factorially, and an intera: 
tion of dependency and sex of the dependent person was obtained. Dependenc 


made no difference in the help given 
dent received significantly more help 


and they also received marginally more help than males who were dependen: 


There were no effects due to the sex 
implies, therefore, that sex differences 
to the sex of the person receiving help 
it. 


Considerable evidence indicates that per- 
sons who are dependent are more likely to be 
helped than persons who are not dependent 
(Berkowitz, 1970; Berkowitz & Daniels, 
1963, 1964; Bryan, 1966; Goranson & Berk- 
owitz, 1966; Rosen & Bielefeld, 1967; Schop- 
ler & Bateson, 1965). However, there are 
factors that modify the positive relationship 
between dependency and helping. For exam- 
ple, Schaps (in press) and Schopler and Bate- 
son (1965) showed that dependent persons 
were helped more when the costs of helping 
were low than when they were high. Schopler 
and Bateson also found that males and fe- 
males differed in the amount of help they 
gave; when it cost little to give help, they 
found that females helped other females who 
were dependent more than they helped females 
who were not dependent, and that the reverse 
relationship held for males. Berkowitz (1970) 
reported finding the same interaction of de- 
pendency and the sex of the potential helper. 

Schopler and Bateson interpreted this in- 
teraction by postulating two distinct proc- 
esses: (@) among females, but not among 
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In all of these cases, females were called 


due to the sex of the person giving hel; 
it. An experiment tested which of these 


to males, but females who were depen 
than females who were not dependent, 


of the potential helper. The experiment 
in previous research were probably due 
and not to the sex of the person giving 


males, dependency serves as a cuc to § 
responsibility and consequent kep gi 
(b) among males, but not amo»: femal 


dependency serves as a cue that status í 
ferences between a dependent an 
pendent person will be reduced if the dep 
dent person is given help (see Schopler, 19 
for further discussion of the status 
process). 

Research on dependency and helping 2 
typically involved a potential helper of d 
same sex as the dependent person, and Schi 


in their willingness to help dependent pers? 
or whether both sexes have learned to M 
females more when they are dependent t 
nondependent, and to help males more Wi 
they are nondependent than when they 8 
dependent. In other words, the level of í 
pendency can serve as a discriminative | 
either for differential behavior by the sexes 
helping relationships (a personality expla 
tion) or for similar behavior by the SÉ 
each of which gives more help to fem! 
when they are dependent and to males w 
they are not dependent (a pure stimulus 
planation). The personality explanation 
dicts an interaction of dependency and. 
sex of a potential helper: Females $^ 
help a dependent person more than à i 
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ıl responsibility norm), 
xow the opposite effect 
esis). The pure stimulus 
im interaction of depen- 
if a dependent person: 
5ve females more help 
dent than when they are 
should give less help to 
to nondependent ones. 
id personality explana- 
y exclusive. It is possible 
cted effects. 


dependent one 
while males sh 
(status threat | 
explanation prc 
dency and the 
Both sexes sh 
when they are de; 
not, while both 
dependent mal« 
The pure stim 
tions are not mu 
to obtain both 


Same-sex dya nay have been used in 
previous resea helping because aid to 
the opposite se n be interpreted, not as 


altruism, but a 
ingratiation is a 
site-sex dyads 
males will be h: 
by males, and 
more by males t? 
there should be 
the potential hel; 


lf-seeking ingratiation. If 
rminant of help in oppo- 
vould be expected that 
d more by females than 
females will be helped 

: by females. In this case, 
interaction of the sex of 

z and the sex of the depen- 
:ratiation is more likely 
ider low dependency, a 
would be expected. A 
eraction of the sex of the 
d the sex of the dependent 
icted by taking a similarity 
: s that which Piliavin, Ro- 
tin, and Piliavin (1969) used to explain the 
ttend they observed for whites to prefer help- 
ig whites and blacks to prefer helping blacks. 
According to this similarity explanation, males 
should help males more than females, while 
females should help females more than males. 
There are five requirements for a strong 
‘est of these different predictions. The first is 
t the dependent person's sex should not be 
confounded with his or her other personality 
characteristics, which occurs whenever one 
Es or one male serves as the cue to sex. 
5 minimize this confounding in the present 
cmd the subjects never saw or heard the 
Eat person, and were merely exposed 
E eleted cues that carried only incidental 
poo about other personality character- 
The second requirement is that subjects 
i uld not perceive the dependency as the 
“pendent person's own fault because this 
Might depress the frequency of all helping 


three-way interact 
quite different ix 
potential helper < 
Person can be pr« 
perspective, such : 
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(Schopler & Matthews, 1965). To control for 
this in the present experiment, the attempt 
was made to make clear to all subjects that 
the dependency was not caused by the de- 
pendent person's incompetence, but rather by 
a request from a superior. 

The third requirement is that subjects 
should not think that they are in an experi- 
ment, and to this end, the subjects were led to 
believe that because the scheduled experi- 
ment could not take place, they were being 
asked to do the experimenter a favor. 

The fourth requirement is that the cost of 
giving help must be low. Schopler and Bate- 
son found the interaction of sex and depen- 
dency of potential helper only when the 
helper did not have to make a substantial 
sacrifice in order to aid the other. Therefore, 
the attempt was made to keep the cost of 
helping relatively low in the present study by 
choosing a simple task and by promising the 
subjects credit for research participation if 
they granted the experimenter's favor. 

The fifth requirement is to attempt to in- 
sure that the behavioral measure of helping 
will be sensitive to subtle differences in how 
help is given. For example, both males and 
females may give the same amount of help 
when a task begins, but may give different 
amounts sometime later. If this were so, sex 
differences might only be found in interac- 
tions that involve the length of time that 
subjects have spent on a task. Hence, a con- 
tinuous measure of helping was obtained over 
a 30-minute period, and a simple time-blocks 
variable was formed by dichotomizing the 
period into two 15-minute blocks. 


METHOD 

Subjects 

Subjects were 113 undergraduate introductory psy- 
chology students at the University of Ilinois at Chi- 
cago Circle, who volunteered to participate in order 
to fulfill part of a course research requirement. They 
were randomly assigned to the dependency and sex 
of experimenter conditions. The data from 9 sub- 
jects were omitted from the analyses, 4 of them be- 
cause another person entered and remained in the ex- 
perimental room, 3 of them because of inadvertent 
experimenter failure to set up materials correctly, 1 
of them because he misunderstood the instructions, 
and 1 of them because he guessed the true purpose 
of the experiment. These 9 had been assigned to 
seven of the eight experimental conditions, so it 
seems unlikely that there was any relationship be- 


292 


tween attrition and treatments. Data were thus 
analyzed from 104 subjects, 13 per cell. 


Design 


The purpose of the design was to test predictions 
from a personality and a pure stimulus explanation 
of previous data on the relationship of sex and 
dependency. It also allowed tests of whether cross- 
sex ingratiation occurs, and whether similarity of 
sex is related to helping. The design involved one 
within-subjects variable (time blocks) and three 
between-subjects variables (dependency, the sex of a 
potential helper, and the sex of a dependent person), 
combined into a 2X2X2X 2 factorial. 


Procedure 


Subjects volunteered for an experiment entitled 
“Class Projects,” and no cues to the experimenter's 
sex were evident when they signed up. When the sub- 
jects later appeared for their appointment, signs di- 
rected them to a room where the title of the experi- 
ment and the experimenter’s name were conspicuously 
posted. 

No experimenter was present to meet the subject, 
but there was a handwritten note on the door that 
was addressed personally to him or her. It served to 
explain the task and to manipulate the dependency 
variable. Among other things, the note requested 
the subjects to collate and staple copies of an 18- 
page questionnaire. Hence, a collating rack and an 
electric stapler were on the table in the experimental 
room, The electric stapler made a loud and distinc- 
tive noise when it was operated, and this served as 
a response indicator, which was recorded by an 
assistant in a nearby room. Subjects worked for 30 
minutes after their first response, and the assistant 
recorded the time of each response to the nearest 
second. 

At the end of these 30 minutes, the assistant en- 
tered the room and explained that the experimenter 
was not going to return. If the subject left the room 
during the 30 minutes and it did not appear as 
though he would return (e.g., wearing coat, carrying 
books), the assistant intercepted him at the door. In 
each case, the subject was then debriefed and sworn 
to secrecy before being dismissed by the assistant. 
The entire procedure took less than 1 hour for each 
subject. 


Independent Variables 


Sex of experimenter. In the condition where the 
experimenter was male, his name was posted at the 
experimental room as Peter Stone, and there were 
several sex-related cues in the room. There was a pipe 
in the ash tray, a man's raincoat and umbrella on 
the coat rack, and a sports magazine and a large, 
black briefcase on the floor. These cues were par- 
ticularly conspicuous because the room was small 
and, apart from these cues, contained only the few 
materials necessary for the experimental task. 

In the condition where the experimenter was fe- 
male, her name was posted as Nancy Stone, and the 
sex-related cues were an empty package of cigar- 
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ettes heavily advertised as a "woma: 
ginia Slims), together with lipstick. 
butts in the ash tray; a woman's ra 
and umbrela on the coat rack; 
magazine, which lay on a slim, mu 
briefcase. 

Dependency of the experimenter 
dependency condition, the handwritt 


rand" (Vir 
ked cigarette 
val, sweater, 
a woman) 
red striped 


the high. 
vote on the 


door read: 
I'm terribly sorry I’m not here, y discussion 
section leader wanted to see me same time 
you signed up for. We are doing dual proj- 
ects for our Psych. 100 discus tion—just 
about our entire discussion secti le depends 
on the project. I have over 100 s scheduled; 
for two hours from now, and ! ! t finished 
preparing my questionnaires. 
Since I am really in a bind, would lease help 
me out by assembling and staplin ionnaires? 
The stuff is all in the room, Just put the 
together in order, staple them in upper left- 
hand corner, and drop them in the ' 
I should be back in a short w^ Of cou 
you'll get credit—it's just that I ] the help 
right now. Thanks. 
Nancy S. [o er S.] 
In the low-dependency condition, th: te was 
actly the same except that the phrase "two hours 
from now” in the first paragraph was changed to 


“next week," and the phrase “Since I 
bind” in the second paragraph was o: 

Sex of subject. Fifty-two males anc 
were run in a random order. 

Time. The 30 minutes spent on the task 
divided into two 15-minute blocks for analysis. A 
measurement problem arose because some subjects) 
after working on the task for a while, began col- 
lating several questionnaires before stapling all of 
them in rapid succession. If more than two time 
blocks were distinguished, many block boundaries 
intercepted a run of collating. As a result, a count 
of the number of questionnaires stapled during this 
block would be artifactually low. Fortunately, 
use of only two blocks avoided this measurement. 
problem, although the simple dichotomy restricts 
conclusions that can be drawn regarding the nature 
of the helping function over time. 


Dependent Variable 
ques- 


The dependent variable was the number of | 5 
tionnaires stapled. This is presumably an indicit? | 
of the help given to the experimenter by the subj 1 


RESULTS 
ance 


As Table 1 shows, an analysis of vari 
of the number of questionnaires assem) | 
revealed a significant interaction of depen” 
dency and sex of experimenter. The intera 
tion is depicted in Figure 1, and it is due Pi 
marily to the greater help received by female 
experimenters under high- than under [of 
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rABLE 1 
ANALYSIS OF V \NCE ON THE NUMBER 
or QUEST AIRES ASSEMBLED 
Source df MS F 
Between subjects 
Dependency (A 1 | 117.00 2.13 
Sex of experime: 1 | $56 <1 
Sex of subject ( | v4 22.23 gi 
AXB | 1 | 22848 4.17* 
= AXC or 27.77 «1 

BXC 1 16.17 «1 
AXBXC |- nde E 

Error betwee: 96 54.82 

Within subjects 

Time blocks (D Led 56.08 9,07*** 
AXD 1 15.08 2.44 
BXD | 1 | 5002| 809** 
CXD E~. 31 <1 
AXBXD |. 32 .02 <4 
AXCXD DN: 1.23 <1 
BXCXD | 1 A7 <4 
AXBXCXI [08 233: eed 

Error within | 96 6.18 

* 
ERO 
=p «.005 


dependency conditions (F = 6.18, df= 1/ 
6, p< .05).* There was also a marginal ten- 
dency under high dependency for females to 
Teceive greater help than males (F = 2.78, df 
=1/96, p< .10). 
_ There was also a significant main effect of 
time blocks and a significant interaction be- 
teen sex of experimenter and blocks (see 
Table 1 and Figure 2). Female experimenters 
teceived more help during the second half 
than during the first half of the 30 minutes 
(F=17.14, df=1/96, p<.001), while 
E was no difference for male experiment- 


DISCUSSION 


: in the present study, male and female po- 
tential helpers did not differ in the help they 
E, to dependent or nondependent persons. 
ndeed, all F values involving the sex of the 
p helper were less than one. There is 
te evidence, then, that females reacted to 
E with greater feelings of social re- 
de sibility or nurturance, Nor is there evi- 
ence that males help dependent persons more 
is ue, a finding that has previously 
ph obtained in a naturalistic setting where 
E strength and exposure to potential 
E harm were concomitants of giving 
“p (Piliavin et al., 1969). 

* Comparisons were cedure 
Sutlined in Winer nee o es 
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Dependency 

Fic, 1. Number of questionnaires assembled as a 
function of dependency and sex of dependent person 
(experimenter) . 

The fact that males did not help dependent 
males less than nondependent ones suggests 
that status threat did not operate among males 
in the present study. This lack of support 
for the status threat hypothesis may have 
resulted because the subject, who was called 
upon to give help, may have seen the depen- 
dent experimenter as his status equal, since 
the subject thought that both were students 
in the same introductory psychology course. It 
is also possible that the subject saw the ex- 
perimenter as his status superior because the 
latter was, after all, the experimenter. How- 
ever, in the studies by Schopler (1967) and 
Schopler and Bateson (1965), the dependent 
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Fic. 2. Number of questionnaires assembled over 
15-minute time blocks as a function of sex of de- 
pendent person (experimenter). 
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person was subordinate in status to the poten- 
tial helper. It may be that the particular 
status difference that Schopler used is neces- 
sary for the operation of status threat. 

In the present study, there was neither 
evidence that male potential helpers gave 
greater help to females than to males nor that 
female potential helpers gave greater help to 
males than to females. This weakens the in- 
gratiation argument for avoiding opposite-sex 
dyads in research on helping. However, in this 
experiment there was no face-to-face interac- 
tion with a person of the opposite sex, and it 
may be that such interaction is necessary for 
ingratiation or similarity effects. 

The present study did support a pure stim- 
ulus explanation of sex differences in helping. 
Although dependency was not a significant cue 
for determining helping reactions to males, it 
was significant for determining reactions to 
females. This makes it likely that the Sex x 
Dependency interactions found in previous 
research with same-sex dyads are due, not to 
the sex of persons who can give help, but to 
the sex of persons who need help. 

The present study also showed that both 
males and females increased their help giving 
over time when a female requested help, but 
not when a male did. It would be tempting to 
speculate that the greater help given to de- 
pendent females resulted from an increase in 
helping over time. However, there was no evi- 
dence that the increase in the amount of help 
given was greater when the female needing 
help was dependent, as compared with when 
she was not dependent. 

The absence of a control group for which 
dependency was moderate or unspecified 
makes it unclear whether females were helped 
more because they were dependent, whether 
they were helped less because they were not 
dependent, or whether both causal forces op- 
erated. Inspection of the present interaction 
and of the three interactions reported by 
Schopler and Bateson led to the speculation 
that both forces operated. All four interactions 
showed a consistent tendency for females to 
be helped more than males when they were 
dependent, and for males to be helped more 
than females when they were not dependent. 
Thus, in all these studies, help seems to have 
been withheld from females who asked for it 
and were not immediately dependent on the 
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potential helper. Perhaps nondependent fe 
males who ask for help are seen by both males 
and females as exploiting any advantages 
they think their sex might have, while depen- 
dent females are seen as acting le. 

The failure of the dependency variable to 


affect the help given to males wight be ex- 
plained in the following way. ^ ales in the 
United States culture are supposed to be inde- 
pendent and competent—they not ex- 
pected to ask for aid. If they isk, they 
may be perceived as acting ou: of role. A | 
likely inference that a potentia! helper might 
draw from an out-of-role request for help is 


that irrespective of the level of dej [ 
male requesting help must have an extra 
nary need for it. 
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The po 
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found to be 


An increasing number of studies have indi- 
d that frequency and affect are corre- 
(Zajonc, 1968). Subjects prefer the 
muli they have seen most often. In an at- 
empt to explain this phenomenon, Harrison 


1968) suggested the response competition 
Ypothesis, An unfamiliar stimulus in some 
ay resembles many other stimuli and tends 
ses appropriate to these 
g competition among the 
sponses is associated with a state of tension 
‘fect. However, repetition of 
the elimination of some 
sponses, Reduction in 
ition causes a reduction in 
qa live affect and therefore a more positive 
dung of the stimulus. In summary, (2) repe- 
Mon causes a reduction in response competi- 
Hn, and (5) a reduction in response compe- 
Hon causes an increase in positive affect. Evi- 
Mice has been gathered to support both seg- 
Wnts of the hypothesis (Harrison, 1968; 
latlin, 1970). 
) Since it seems likely that response competi- 
n may in fact be a mediator in the fre- 
Pency-affect relationship, the concept of 
Ponse competition must be clarified by de- 
ng its relationship to other psychological 
lables. For example, it would be interesting 
D learn whether it is correlated with stimulus 
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COMPETITION, RECOGNITION, AND AFFECT 
MARGARET W. MATLIN : 


ive correlation between frequency and liking has been explained by a 

hypothesis: (a) Repetition causes a reduction in response competi- 
id (b) a reduction in response competition causes an increase in posi- 
Evidence has been gathered previously to support both parts of 
sis. The present experiment further examines the concept of 
ompetition, specifically in relation to stimulus recognition. Of three 
recognition, only recognition latency proved to be consistently 
response competition manipulation, Furthermore, recognition was 
related to affect only under certain conditions. Finally, the data 
:dditional evidence of the correlations between frequency and liking 
veen response competition and affect. 


recognition. In the present experiment, stim- 
uli whose level of response competition had 
been previously varied were given to the sub- 
jects in a recognition test. The hypothesis is 
that response competition and recognition are 
inversely related. Words associated with high 
response competition should be the least often 
recognized and have the longest latency before 
recognition. Furthermore, novel stimuli closely 
resembling these high response competition 
stimuli should generate a high false alarm 
rate. 

Response competition was manipulated in 
the same way as it was in Experiment III of 
a series of studies performed by the author 
(Matlin, 1970). Tt can be argued that among 
the many covert responses that subjects make 
to a novel stimulus, the pronunciation of that 
stimulus is one of them. It is thus possible to 
manipulate pronunciation response competi- 
tion by varying the stability of syllable ac- 
centuation within the stimulus. In the present 
study, frequency of stimulus exposure was 
also manipulated. The effects of both varia- 
bles, response competition and frequency, on 
recognition and affect were examined. 

In addition to determining the relationship 
between response competition and recognition, 
it is possible to study the correlation between 
recognition and affect. Several questions may 
be asked: (a) Of all the stimuli which the 
subject says are familiar (those stimuli to 
which he says old"), will he prefer those he 
has in fact seen (“old”/old) to those he has 
not seen (“old”/new)? (b) Considering the 
stimuli which he calls “new,” will those stim- 
uli he has in fact seen (“new”/old) be pre- 
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ferred to those he has never seen (“new”/ 
new)? (c) Considering the stimuli which the 
subject has in fact seen previously, will he 
prefer the words he says are familiar (*old"/ 
old) to those he says are unfamiliar (“new”/ 
old)? (d) Of the novel stimuli, will he prefer 
the words he says he recognizes (“old”/new) 
to the words he does not recognize (“new”/ 
new)? 

The affect data also provide a check on 
two hypotheses tested in the past: (2) the 
general hypothesis that affect varies with 
familiarity—subjects should prefer stimuli 
seen six times to those seen three times, 
which in turn should be preferred to those 
never seen before; (5) the second half of the 
response competition hypothesis—stimuli in 
the low response competition condition 
should be preferred to stimuli in the high 
response competition condition. 


METHOD 
Material 


Nonsense word stimuli were formed in the follow- 
ing manner. Each of the 18 words began with a 
different consonant, thereby utilizing all consonants 
except Q and X. Vowels (V) and consonants (C) were 
written on slips of paper and placed in separate piles. 
Letters were then drawn in the order CVCCVCV, 
thus forming seven-letter, three-syllable nonsense 
words. Words were rejected if they could be pro- 
nounced in only two syllables (e.g, WANMIZE) or 
pronunciation was ambiguous (e.g., LuTHoBA). Eight- 
een other words, to be used as novel stimuli in the 
recognition test, were formed in a similar manner, 
with each word matched to a stimulus word begin- 
ning with the same letter, 

Stimuli were underlined to indicate which syllable 
was to be accentuated, In the high response compe- 
tition condition, each of the three syllables was 
accentuated equally often throughout presentation 
(eg, NANSOMA, NANSOMA, NANSOMA). In 
the low response competition condition, only one 
syllable was accentuated in the course of the experi- 
ment; for each stimulus, the syllable to be accented 
was chosen randomly (e.g., WUTMIJO). In a third 
response competition condition, stimuli appeared 
with no underlining to indicate pronunciation, and 
subjects were free to determine the accentuation for 
a word (eg., FUBVIZA). It was felt that this con- 
dition would approximate most closely the natural 
exposure situation. Frequency of exposure was also 
varied: subjects pronounced each word either three 
or six times. All nonsense words rotated through all 
three response competition levels and both frequency 

levels, resulting in six different stimulus conditions. 
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Equipment 


Words for familiarization trainin typed on 
3 X 5 inch index cards. The order of cards in the 
deck was randomly determined, taki: into account 
the frequency with which individus! stimuli were te 
appear. For recognition testing, words were again 
typed on index cards. None of the w had under- 
lined syllables. Finally, two-page bootlets listing all 
words, followed by 7-point scales going from "good" 
to “bad,” were distributed for affect judgment. 
Procedure 

Subjects were told that they were to pronounce 
nonsense words. Pretraining consisted o! pronouncing 
13 nonsense words: 4 with the first syllable under- 
lined, 4 with the second syllable, 4 with the third 
syllable, and 1 with no underlining. The experimental 
stimuli were then presented. When the incorrect 
syllable was accented or the pronunciation wat 
otherwise incorrect, the experime said “try K 
again.” Incorrect pronunciations v not corrected | 
more than once in succession. Subjects were ther 
told that they would be shown n ise stimuli, 
some of which they had pronounced before and 
some of which they had never seen. They were to 
respond “old” or “new” to each w Using a hid- 
den stopwatch, the experimenter recorded the latency 
from the time the index card was ed over, Te 
vealing the nonsense word, to the time of the sub- 
ject’s response. Finally, subjects were >d to rate 
the words on the basis of how much they liked them. 
Subjects 

Subjects were 48 naive students in introductory 
psychology at Wagner College. Four men and 4 


women were assigned to each of the six stimulus 
conditions. 


RESULTS AND DISCUSSION 
Pronunciation Period 


First, descriptive data from the pronuncia 
tion period are considered. Inspection of the 
accentuation of all free choice stimuli It | 
vealed that subjects showed an overwhelming 
preference for emphasizing the second syllable. 
Averaging across all subjects, 12.3% of the 
words received primary stress on the first syl- 
lable, 83.496 on the second, and 4.3% on the 
third. These results confirm the importance 0 
including representatives of all three accenti 
ation patterns among the low response compe 
tition stimuli. Subjects were quite accurate M 
their pronunciation of the stimuli for whi 
accentuation was specified. The average SY 
ject made 2.6 errors in accentuation and o 
7 other pronunciation errors (e.g, JIPZP- 


I 


> pronunciation of 54 


, Frequency, and 


hypothesized that stimuli in the high 
competition condition would not be 
zed as well stimuli in the low re- 
competiti: the free choice condi- 
Three measures of recognition were 
red from the data: accuracy of recogni- 


nition, and false alarms 
nuli beginning with the 


cognition. The number of 
failed to recognize in a 
vsed as a gauge of lack of 

. Subjects were generally 

ir recognition of the stim- 
stion of items in the low 
esponse competition condi- 


z scores were converted by the 
X + 2) before the data 
alysis of variance indi- 
' was a significant varia- 

dj = 1/210, p < 01). As ex- 
, creasing the frequency of exposure 
es the probability of recognition. Re- 
competition was also significant (F = 
bdf = 2/210, p < .05). A Newman-Keuls 
sis (Winer, 1962) indicated that the high 
e competition stimuli were recognized 
often than words in either the low re- 
competition or free choice condition 
X 01), although the latter two conditions 
"hot differ significantly from one another. 
Only significant interaction was that 
equency X Response Competition (F = 
E ,di— 2/210, p < 01). A Newman-Keuls 
alysis showed that the average number of 


} TABLE 1 
_ AVERAGE NuwmrR or RECOGNITION ERRORS 


Response competition condition 


High Low 
response response Free choice 
competition competition 
A7 23 47 
1.02 48 50 


Number of possible errors = 3. 
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TABLE 2 
AVERAGE Latency or RECOGNITION (in SECONDS) 


Response competition condition 


Fry High Low 
response response Free choice 
competition competition 
6 1.7 1.6 1.5 
3 2.0 1.7 1.7 


errors in the low frequency, high response 
competition condition differed at the .01 level 
from all other groups. (Although some other 
comparisons showed borderline significance, 
only the comparison of low frequency, low 
response competition versus high frequency, 
free choice reached the .05 level of signifi- 
cance.) This indicates that stimuli eliciting 
high response competition may not be recog- 
nized following a low frequency of exposure. 
With repetition, however, they are recognized 
just as well as stimuli in the other conditions. 
At a high frequency of exposure, then, a ceil- 
ing effect seems to cancel the differences 
among the response competition conditions. 
Latency of recognition. Table 2 indicates 
the average latency of recognition. A latency 
was included in the average only if the stimu- 
lus had been recognized by the subject. La- 
tency scores were transformed by the formula 
100/x prior to analysis. When analysis of 
variance was performed, frequency was a sig- 
nificant variable (F = 15.73, df = 1/210, p 
< 01). Not only are frequently encountered 
stimuli recognized better, but they are recog- 
nized faster. Response competition was also 
significant (F = 14.25, df = 2/210, p < 01). 
Newman-Keuls analysis showed that high 
response competition stimuli are not recog- 
nized as fast as stimuli in the other groups 
(low response competition, p< .05; free 
choice, p < .01), but again these latter two 
groups did not differ from one another. Un- 
like in the case of the error analysis, the Fre- 
quency X Response Competition interaction 
was not significant; it seems that the latency 
measure is more sensitive to subtle differences 
among the response competition conditions, 
even when the stimuli have been encountered 
six times. The Frequency X Stimuli interac- 
tion was significant (F — 3.08, df — 5/210, 
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< .05), as was the Response Competition X 
Stimuli interaction (F — 4.21, df — 10/210, 
p < 01). In neither case did an examination 
of the stimuli in each condition reveal a sys- 
tematic pattern or obvious explanation for the 
interaction. 

False alarms. The data on false alarms were 
also analyzed. It will be recalled that each of 
the novel words in the recognition test was 
matched to a stimulus word by virtue of be- 
ginning with the same letter of the alphabet. 
The central hypothesis is that stimuli in the 
high response competition condition will not 
be as easily recognized as stimuli in the other 
conditions. Possibly, confusion in the identi- 
fication of a stimulus increases the probabil- 
ity of false recognition of the novel stimulus 
beginning with the same letter. The incidence 
of false alarms should therefore be greatest 
among stimuli resembling the low frequency, 
high response competition words. On this 
premise, each false alarm was classified for 
every subject according to the frequency and 
response competition conditions of the stim- 
ulus with which it could be confused because 
of the same initial letter. It is readily apparent 
from the data (Table 3) that no frequency or 
response competition effects exist. 

In summary, one index of recognition (rec- 
ognition accuracy) indicates that response 
competition level affects recognition only at 
low levels of exposure, a second index (recog- 
nition latency) indicates that it affects recog- 
nition at all levels of exposure, and a third 
(false alarms) indicates that it does not 
affect recognition. 


Response Competition, Frequency, 
and Affect 

Following the recognition test, subjects 
rated all stimuli (Table 4). An analysis of 


TABLE 3 
AVERAGE NUMBER OF FALSE ALARMS 


Response competition condition 


Frequency 


High Low 
response response Free choice 
competition competition 
6 29 41 | 19 
3 25 | 27 | 27 


| 


Note.—Number of possible false alarms = 3, 
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TABLE 4 
AVERAGE RATINGS OF STUCULI 


Response competit i ondition 
Eeay High Low 
response respon Free choice 
competition competiti 
6 3.3 2.8 2.8 
3 3.7 | 3.2 3.2 


acd = good; 7 = bad. Average rati 


variance demonstrated that ! frequency 
and response competition were highly signifi- 
cant (frequency, F = 21.9, dj /210, p< 
.01; response competition, F = 10.1, df =2/ 
210, p < .01). A Newman-Keu!s analysis in 


dicated that high response comp 
uli were rated significantly 

than stimuli in the other two conditions (f. 
< 05), although these latter two did m 
differ from each other. The Stimuli X Fre 
quency interaction was also significant (F= 
3.3, df = 5/210, p < .05), although no reason 
for the interaction could be derived from 
examination of the data. The stimuli X Re 
sponse Competition interaction reached bor- 
derline significance (F = 2.5, dj = 10/210, f, 


ition stim: 
re negative 


4.1 (where 1 = good and 7 = bad), while the 
average rating of the three-exposure stimuli 
was 3.4, The difference between these groups 
was significant (t = 3.89, df = 47, p < Ol): 
(In actuality, of course, the novel stimuli I 

been seen once during recognition testing 
during which the three-exposure stimuli wert 
also given an additional exposure.) Thus 
novel stimuli are liked less than low-frequen 
stimuli, even when the average rating for low 
frequency stimuli includes the relatively W 
popular high response competition words. 


« .10). i 
The average rating of the novel stimuli was 
Recognition and Affect 


Accuracy of recognition, The affect data 
are also useful in analyzing the relationship 
between recognition and liking. Four com- 
parisons are possible (not every subject Y 
a recognition error or a false alarm, S0 an 
parisons are based on only partial popu“ 
tions): (a) “old”/old versus old” /nef* 
Does affect depend on whether the subject 


| 


ao 
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fact seen the stimulus, or are all stimuli he 
considers old judged the same? For each of 
the 27 subjects * made at least one false 
alarm, average a scores were compared 
for false alarms for correct recognitions 
(based on the low juency, free choice con- 
dition). A 7 test wed that subjects do in 
fact prefer the s li they have truly seen 
before (t= 2.44, df — 26, p<.02), even 
though they cannot distinguish between the 
two sets of stimuli in a recognition test. (b) 
“New” /new versus ^new"/old: Similarly, are 
all stimuli considered new judged the same, 
even though the subject may actually have 
seen some of the siòmuli before? For each sub- 
ject who made a ist One recognition error, 
mean affect for nition error stimuli was 
compared with m: affect for all new stim- 
uli. There was a icant difference between 
the two groups | , df = 42, p< .02). 
Subjects prefer stimuli they have truly seen 
before, even if they cannot recall having seen 
them. Comparisons 1 and 2 therefore indi- 
cate that affect varies with the objective his- 
tory of exposure, even when the subjective 
impressions of fz arity are identical. The 
two other comparisons involve a different ques- 
tion: When the objective history of exposure 
is identical, do ratings depend upon the sub- 
jective impression of familiarity? (c) *New"/ 


Old versus “old” /old: When subjects have in 
fact seen all the stimuli, do they prefer the 
Stimuli they recall having seen? Each sub- 
Ject’s mean rating for recognition errors was 
Compared with the mean rating for the words 
in the same frequency and response competi- 


tion conditions that were correctly recog- 
nized. The difference was significant (4 = 
Eu, df = 42, p < 02). (d) “Old”/new ver- 
E new"/new: When all stimuli are novel, 
0 subjects prefer the stimuli they think they 
E before? Here, the difference was 
i Significant and was in fact in the opposite 
E m from the prediction (t < 1.00, df = 
E^ 0). Subjective impressions of fa- 
Niliarity therefore seem to be important only 


NI 
n the case of words that have been seen 


fore. 
— of recognition, It is also possible to 
ES recognition in terms of latency, which 
: found to be a very sensitive measure in 
analysis of response competition and rec- 
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ognition. Are stimuli that are recognized 
quickly preferred to stimuli recognized more 
slowly? For each subject, average affect for 
the three correctly recognized stimuli having 
the shortest latencies was compared with the 
average of the three having longest latencies. 
A t test revealed only borderline significance 
(£1.90, df — 47, p< 10). It therefore 
seems that the only dimension of recognition 
that is relevant to affect is whether the sub- 
ject recognized the stimulus or not. Given 
that he has correctly recognized it, affect does 
not depend on the speed of recognition. 


CONCLUSION 


The data clearly support the general hy- 
pothesis that frequency and affect are related, 
because words seen six times were preferred 
to words seen three times, which were in turn 
preferred to novel stimuli. Furthermore, the 
inverse relationship between response compe- 
tition and liking was again demonstrated. Tt 
is significant that the data confirm the results 
of a previous study (Matlin, 1970) even 
though different subjects, different stimuli, 
and different numbers of exposures were used. 

The constancy of the correlation between 
response competition and affect underlines the 
importance of investigating the relationship 
between response competition and other psy- 
chological variables. It is possible that re- 
sponse competition is so closely correlated 
with an established variable, such as stimulus 
recognition, as to render its introduction 
superfluous. On the other hand, partial cor- 
relation with an established variable would 
help in the delineation of the concept of re- 
sponse competition, which previously has 
been defined only operationally. 

In investigating the relationship between 
response competition and recognition, only 
one index, latency of recognition, was found 
to be sensitive to the manipulation of re- 
sponse competition at both frequency levels. 
Thus, response competition and recognition 
are related, but they are not identical. It 
would be interesting to repeat this experiment 
using the tachistoscope to provide a more 
sensitive index of recognition speed and, in 
addition, to determine whether the effect 
would hold true for stimuli at even higher 
frequencies. 


eg ee et aS 
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It is interesting to note that latency of free 
association had been used as an index of re- 
sponse competition in previous studies (Har- 
rison, 1968; Matlin, 1970), and now la- 
tency of recognition also appears to be corre- 
lated with response competition. Just as a 
long latency in a free association test indi- 
cates conflict over what a stimulus reminds 
the subject of, long latency in a recognition 
test may indicate conflict over whether a 
stimulus is familiar. If a subject does not 
know what a stimulus reminds him of, he is 
also unsure whether it is familiar or not, 

To the extent that stimulus recognition is 
related to response competition, it should be 
related to affect. Statistical analyses showed 
that affect depended on whether or not the 
subject had truly seen the stimulus, but it 
was less clear if affect depended on whether 
he said he had seen the stimulus. In the case 
of novel stimuli, affect was no more positive 
for stimuli believed to be familiar. On the 
other hand, in the case of familiar stimuli, 
affect did depend upon subjective familiarity. 
However, a further analysis revealed that 
affect for familiar stimuli was unrelated to 
recognition latency. Clearly, stimulus recog- 
nition and affect are not related in a simple 
manner. It is possible that the subjective fa- 
miliarity is in fact related to affect, but a 
second factor is responsible for these incon- 
sistent results. At least in some cases, the 
subject presumably is aware that he has made 
a mistake in recognition. The negative affect 
from the mistake becomes attached to the 
stimulus. If stimuli on which subjects make 


more mistakes are liked less, th: 
and "old" /new are rated more n: 
ratings of *new"/old stimuli a 
tive, the difference in affect | 
and “old”/old words is increa 


'new" /old 
'ative, Since 
nore nega. 
"een them 
eyond its 


natural level (which is revealed ` the border- 
line significance of the recog» on latency 
data) and the comparison is ly signifi. 
cant. On the other hand, sin atings of 
“old”/new words are also more tive, they 


are brought closer in affect to t new 
stimuli, accounting for the sin 
two groups. The preceding expl: 
upon two assumptions: that 
aware when they are uncertain 
certainty is unpleasant. The fi: 
could be tested by obtaining 
certainty from the subject after h 
“old” or “new.” That uncertainty : 
ant, or that competition among :-* response 
“old” and “new” is unpleasant, is- in fact, a 
rewording of one hypothesis central to this 
experiment. 
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hologists view attitudes as 
ned evaluativ: onses that are acquired 
formity with the principles of learning 
Fishbein, 1967; Rhine, 1958; Staats & 
j 1958). Accordingly, when a neutral 
us (attitude object) is paired contigu- 
with other stimuli that regularly evoke 
ative responses, the evaluations attached 
other stimuli become associated with 
Previously neutral stimulus. The degree of 
ion acquired by the attitude object, 
is a function of the consistency of rein- 
Bent or strength of association, and 
ance to extinction or change is inversely 
to consistency of reinforcement during 
sition, 

€ predicted effects of partial reinforce- 
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REINFORCEMENT EFFECTS IN ATTITUDE 
ISITION AND COUNTERCONDITIONING ' 
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theory principles of acquisition, partial reinforcement, and counter- 
vere applied to the development of an attitude. Four reinforcement 
: O6, 80%, 70%, and 50%, were used to establish a positive at- 
lowing attitude acquisition, the subjects either had their attitudes 
| by a questionnaire or they experienced a counterconditioning procedure 
to change their acquired attitudes. Polarity of the acquired attitude 
irectly with the percentage of reinforcement during acquisition, and 
to counterconditioning was found to vary inversely with consistency 
rcement. Attitude assessment results indicated that the attitude object 
acquired evaluative meaning. It is concluded that partial reinforce- 
cts in attitude acquisition and change may be obtained in con- 
with learning theory principles. 


» 


ment on attitude acquisition have been rea- 
sonably established in both classical condi- 
tioning (Insko & Oakes, 1966; Staats, Staats, 
& Heard, 1960) and concept-formation para- 
digms (Rhine & Silun, 1958). However, the 
inverse relationship between consistency of 
reinforcement and extinction or countercondi- 
tioning has not been clearly substantiated. 
Insko and Oakes (1966) failed to obtain any 
extinction even when subjects may have been 
aware of reinforcement contingencies and the 
demand characteristics of the experimenter. 
Insko and Oakes suggested that during the 
course of conditioning, the stimuli had ac- 
quired symbolic meaning that could only be 
changed by counterconditioning. Rhine and 
Silun (1958) attempted to change attitudes 
acquired under various partial schedules by 
a counterconditioning procedure. Although 
Rhine and Silun found that attitudes acquired 
under a 100% schedule changed most readily, 
the data from the partial reinforcement 
groups did not show the predicted ordering 
in counterconditioning. 

The present experiment was designed to 
extend the previous work on partial and con- 
tinuous schedules on attitude acquisition and 
to document their effects on countercondition- 
ing. The study used the concept-formation 
paradigm introduced by Rhine and Silun. On 
each trial the subject is presented with an 
evaluative trait and is required to indicate 
whether or not the trait is characteristic of 
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the people. As in probability learning situa- 
tions, the degree of association is manipulated 
by varying the proportion of traits that are 
reinforced “characteristic.” 

According to the Rhine (1958) and Fish- 
bein (1967) formulations, the traits evoke 
evaluative mediating responses that become 
attached by conditioning to the attitude ob- 
ject. The subjects learn the concept that the 
group is good (or bad), and the strength of 
the concept or the degree of attitude is a 
function of the consistency of reinforcement. 
Responses on subsequent trials are mediated 
by the degree of concept learning. Attitude 
concept reversal by counterconditioning was 
predicted to be inversely related to prob- 
ability of reinforcement in acquisition. 

The measure Rhine and Silun (1958) used 
to assess attitude strength involved the same 
association task as in acquisition but without 
feedback. This measure indicates only that 
the subjects learn to associate traits of a 
particular evaluation with the attitude object, 
but not that the concept itself has acquired 
meaning and is capable of eliciting an evalu- 
ative response. The present study, therefore, 
attempted to assess the subjects’ attitudes 
toward the concept more directly through an 
evaluative scale traditionally used to assess 
attitudes. 


METHOD 
Experimental Design 


Three independent variables were manipulated in 
the present study: type of adjective, percentage of 
reinforcement consistency, and postacquisition experi- 
ence, Two types of adjectives were presented: half 
of the subjects were exposed only to positive adjec- 
tives, and half were exposed to both positive and 
negative adjectives. Within each adjective presenta- 
tion condition, the subjects were equally divided 
among four percentages of reinforcement: either 
100%, 80%, 70%, or 50%. Following the acquisition 
phase, the subjects were divided into either a 
counterconditioning group or an attitude assessment 
group. 


Subjects 


The subjects were 160 students enrolled in the 
introductory psychology course at the University 
of Massachusetts, who volunteered for the experi- 
ment to obtain required experiment participation 
credit. Ten subjects were randomly assigned to each 
of the 16 cells of the experimental design. The 
subjects were run in groups ranging in size from 
one to eight. 
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Stimuli 


The stimuli were adjectives selec m Ander 
son’s (1968) list of 555 personality t adjectives, 
in which the normative rating oí likableness 
of a person described by each tra viven. The 


positive and negative adjectives w^ 
the 150 most favorably rated an 
unfavorably rated traits in the list 


iken from 
75 most 


Procedure 

Each subject was told that a lit: wn group 
of people had been studied by »pologists, 
missionaries, physicians, and others that thes 
visitors had agreed upon a numb: personality 
traits that characterized these peop! n each trial, 
the experimenter would read a trait, ead the sub- 
ject’s task was to predict whether ‘he trait was 
“characteristic” or “not characteris of this un- 
named group of people. After the t made à 
response, the experimenter informed the subject as t 
whether the trait was or was not : cteristic of 
the group of people. 

Reinforcement schedules during ition. Four 
reinforcement schedules were used stablish & 
positive attitude during 100 acquisiti: ils: 100%, 
80%, 70%, and 50%. Subjects in t ll-positive- 
stimulus presentation group receive jO positive 
adjectives and were reinforced with “ch: vacteristic” of 
“not characteristic’ according to appropriate 
schedule. Subjects in the mixed-stimuius group Te- 
ceived 50 positive and 50 negative adjectives. The 
positive adjectives were reinforced “cl iracteristic" 
and the negative adjectives were reinforced “not 
characteristic" according to the appropriate schedule. 
For example, the subjects in the 80* nforcement 


s and 10 
c," and 10 
tives rein- 


condition received 40 positive a 
negative adjectives reinforced “cha: 
positive adjectives and 40 negative a 
forced “not characteristic.” Adjective: randomly 
sampled from the adjective pools tw and wert 
ordered in two random sequences. Two random 
sequences of reinforcement were used for each 
schedule of reinforcement. 

Counterconditioning. Half of the subjects served 
in the counterconditioning phase of the study d 
signed to change the previously acquired positive 
attitude to a negative attitude. These subjects ™ 
ceived, without interruption, 50 additional adjectives 
Subjects in the all-positive-stimulus groups rece! 
50 positive adjectives that were reinforced 100% 
of the time as “not characteristic.” Subjects 1, 
mixed-stimulus groups received 25 positive adjectives 
reinforced “not characteristic” and 25 negative adject- 
tives reinforced “characteristic” on 100% of the tna™ 
Two samplings of adjectives, orders of presentation 
and sequences of reinforcement were used. 

Attitude assessment. The subjects who We 
counterconditioned had the strength of their att, 
toward the hypothetical group of people assessed 11 
stead. Upon completion of the acquisition phase; te 
subjects were given a 17-item Thurstone-type ae 
scale. The scale measures attitudes toward any 8? 


re not 
ttitude | 


d 


„åpna " 
aa apt 
~., - 
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, an 11-point scale (Remmers, 1967). An example 
Ma highly favorable item is “Tend to improve 
NW Broup with which they come in contact.” The 
ects were forced to choose the three items with 
Pach they most agreed. 


RESULTS AND DISCUSSION 


The major dependent variable in the study 
E number of "characteristic" and “not 
Fe teristic” responses appropriate to a posi- 
attitude (ie., positive adjectives called 
B and negative adjectives called 
aracteristic"), The data were analyzed 
Ocks of 10 trials. Since there were no 
ciple differences between the two types 
i adjective conditions, the data were pooled 
T 8 factor. 
3 IBure 1 presents the mean number of re- 
55 appropriate to a positive attitude 
T all subjects during the 100 acquisition 


ATTITUDE ACQUISITION AND COUNTERCONDITIONING 


$—e9 100s Grour 
&----& = 808 GROUP 
Gere 708 GROUP 
Pane m O——O0 — sos GROUP 


COUNTERCONDITI ONING 


BLOCKS OF TEN TRIALS 


Fic. 1. Learning and counterconditioning curves for the four percentage 
of reinforcement groups. 


trials and for the half of these subjects who 
received 50 counterconditioning trials. The 
analysis of variance performed on the data 
confirmed the picture provided by Figure 1. 
There was a general increase in learning over 
trials (F = 12.25, df = 9/1368), the different 
schedules of reinforcement produced differ- 
ences in response strength (F = 144.22, dj = 
3/152), and schedules of reinforcement inter- 
acted with trial blocks (F = 3.56, df = 27/ 
1368). In the last block of trials, the mean 
number of appropriate responses was 9.95 
for the 100% group, 8.05 for the 80% group, 
7.65 for the 70% group, and 5.57 for the 
50% reinforcement group. 

The data in Figure 1 show that the greater 
the reinforcement consistency or strength of 
association, the more likely the subject was 
to say that a positive trait was associated and 
that a negative trait was not associated with 
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the attitude object. The data further show 
the event matching expected in a probability 
learning situation. 

More importantly, the data from the sub- 
jects who completed the attitude scale demon- 
strated that an attitude toward the object was 
acquired and that the intensity of the atti- 
tude was related to consistency of reinforce- 
ment. Mean attitude scores on the Thurstone 
scale were 8.45 for the 100% group, 7.45 for 
the 80% group, 7.30 for the 70% group, and 
6.20 for the 50% reinforcement group. 
Scheffé (1959) tests indicated that all differ- 
ences between groups were significant (df = 
1/72, p < .01), except for the 80% versus 
70% reinforcement groups comparison. Thus, 
through an instrumental learning process, the 
attitude object had acquired evaluative mean- 
ing and was capable of eliciting an attitudinal 
response. Attitudinal intensity varied with 
consistency of reinforcement, 

Figure 1 displays the predicted partial rein- 
forcement effects on the counterconditioning 
data. There was a general decline in the num- 
ber of responses appropriate to a positive atti- 
tude over counterconditioning trials (F = 
68.99, df — 4/288), and mean level of re- 
sponse was inversely related to consistency of 
reinforcement during acquisition (F = 8.27, 
dj — 3/12). Differences in trend over the last 
block of acquisition and the five blocks of 
counterconditioning were obtained, which, in 
general, indicated that the 100% reinforce- 
ment group learned the new attitude more 
rapidly than the partial reinforcement groups. 

A large difference remained between the 
continuous reinforcement group and the 
partial reinforcement groups even after 50 
counterconditioning trials, as shown in 
Figure 1. This result suggests that resistance 
to attitude change may be produced from 
even small amounts of partial reinforcement. 
Since attitudes in everyday life are most likely 
acquired via a partial reinforcement schedule, 
they therefore would be expected to be fairly 
resistant to change but less polarized than at- 
titudes formed under a continuous schedule. 
As Staats et al. (1960) have suggested, dif- 
ferences in continuous and partial reinforce- 
ment schedules may be related to the effects 

of one-sided and two-sided communications, 
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Continuous reinforcement and one-sided com- 
munications may be analogous, with both of- 
fering only one kind of information which 
may result in more polarized attitudes that 
are not very resistant to change. Partial re 
inforcement and two-sided : 
both offering two kinds of inf 
result in attitudes that are we: 
resistant to counterpropagancs. Lumsdaine 
and Janis (1953) have found that attitudes 
formed by two-sided communications were, in 
fact, more resistant to counlerpropaganda 
than were attitudes formed by one-sided 
communications. 

This experiment showed the «applicability 
of phenomena in the area of learning to atti- 
tude formation and change. The obtained 
results agree with a number of mediational 
theory formulations of the attitude learning 
process (Fishbein, 1967; Rhine, 1958; Staats 
& Staats, 1958). However, exp lanations based 
on cognitive expectancy theory (Iiumphreys, 
19393, 1939b) and other theor: ical | formula; 
tions (cf. Kimble, 1961) also « 
for the present data. The prese! 
was not designed to distinguish betwe 
explanations, if indeed such a test is possible. 

A cognitive interpretation, however, quite ; 
naturally raises questions A it awareness of 
reinforcement contingencies anc (hat the data 
merely reflect demand characteristics of the 


situation, These questions have stirred cot 
siderable controversy in the atti tude learning 
area (Insko & Oakes, 1966; E ler, Collins, 
& Miller, 1969; Page, 1969; Staats, 1969): 
Two points are appropriate bete First, 1 


though the subjects were no doubt aware 4 
the reinforcement contingencies, demand | 
characteristics could hardly produce the e 
fects obtained under counterconditioning 
Second, the present experimental situation 
embodies certain typical features of real life 
where the demands of influencing agents alio 
are present, Parents who teach attitudes 
groups exert demands on the child, too. 
search directed at ascertaining the d 
to which the established attitudes ke 
internalized and are retained is clearly need 


E 
Here, theoretical notions of social influen 
1959; Kala 


o 


and power (French & Raven, 


1960; Kelman, 1961) are germane. 
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EFFECT OF INCREASED RESPONSIBILITY ON 
BYSTANDER INTERVENTION: 


THE PRESENCE OF CHILDREN * 
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Scarborough College, University of Toronto 


To examine the effects of focusing responsibility on bystander interventi: 
college students encountered one of two emergencies either alone, with t 
nonresponding student confederates, or with two nonresponding child cx 
federates. The prediction that intervention would be maximal when child 
were present was not supported. Subjects responded significantly more rapi 
to the emergency when alone than when any confederate, whether child 
adult, was present (p< .025). Subjects did, however, respond more rapi 
when the children were present than when the adults were present (p < .0 
The type of emergency did not result in significant differences in intervention 
Explanations to account for these results are discussed. 


The question of why people fail to help in 
an emergency has been raised and studied by 
a number of investigators (see Krebs, 1970). 
Of these, Darley and Latané (1968), Latané 
and Darley (1968), and Latané and Rodin 
(1969) have found that the larger the group 
present when an emergency occurs, the less 
likely any one person is to intervene. Maximal 
intervention seems to occur when a person is 
faced with an emergency with no one else 
present. This finding holds for internal emer- 
gencies, that is, one that threatens those 
present, such as smoke entering a room 
(Latané & Darley, 1968), as well as for 
external emergencies, such as an accident in 
an adjoining room (Darley & Latané, 1968; 
Latané & Rodin, 1969). 

The question that remains to be asked and 
answered is, How can intervention be in- 
creased in time of emergency? The above in- 
vestigators have hypothesized that the pres- 
ence of others plays a dual role, the others 
supply cues as to appropriate behavior in the 
face of novel stimuli, and at the same time 
they allow for a diffusion of responsibility, 
such that no one person can be blamed for 
not having intervened. If we accept this dual 


1 This research was supported by Canada Council 
Grant 69-0510 to the author. The author is grateful 
to V. Simmelhag for her invaluable assistance in 
conducting the experiment. 

2 Requests for reprints should be sent to the 
author, Department of Psychology, Scarborough 
College, University of Toronto, Toronto 5, Ontario. 


role hypothesis, it suggests thet by focusing 
responsibility on one person ev hough oth- 
ers are present and minimizing ‘he oppor- 
tunity for comparison with the others, we 
should be able to maximize inte: vention. One 
experiment (Korte, 1969), which manipulated 


focus of responsibility by limiting its dif- 


fusion to others present at the time of an 
emergency, found that for an external emer 
gency, a greater percentage oí subjects re- 
sponded when responsibility was not diffused 


than when it was. However, Korte did not 
try to increase responsibility above that 
faced by an individual who is alone 

If, in addition to focusing re ibility, we 
were to increase it by having dependent oth- - 
ers who were not suitable comparison figures, 
we should find subjects, when faced with an 
internal emergency, intervening even more 
than when alone. This hypothesis is based 
partially on intuition and partially on the 
research that Berkowitz and his associates 
have conducted (see, e.g., Berkowitz & Dat 
iels, 1963, 1964), in which they have found 
that increased dependency leads to a greater 
helping. 

In the present experiment, college stude 
were placed in a room with children and face 
with an emergency, either external or internal 
It was assumed that children would (a) 1 
crease responsibility by focusing it on the 
one adult present and (5) fail to serve as 
cue sources as to what the appropriate I 
sponse should be. The specific predictions 


nts 
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stervention would be found, 
emergency, when the 

vith two, nonresponding 

ın intermediate amount 

ı the subject was alone; 
rvention when the sub- 
children who did not re- 

y. No predictions were 
e effect of type of emergency on 


cx 


iETHOD 


) first-year male students at 
the University of Toronto 
experiment as part of an 
course requirement. 


he subjects w 
thorough Col 


imental conditions: the sub- 
of two emergencies—smoke 
ı or someone getting hurt in 
ihe emergencies occurred either 
with two children present, or 
sent, When the subject was to 
ger alone or with the children 
Uone before being shown into 
m, alone. When the subjects 
xperimental condition in which 
e emergency in the presence of 
adult confederates, they met the 
" ng as subjects, in the waiting 
experimental rooms before being 
Drought into the experimental room and seated 
round a table together 
experimenter introduced herself and ex- 
d that she was not with the psychology de- 
tment but with Industrial Research Ltd., and 
it the psychology department had lent their 
ties for this experiment, The experimenter indi- 
a tape recorder and told the subjects that the 
mictions for the experiment were all on tape; she 
plained that she would not remain but would 
Ve and return in about 45 minutes, by which 
me they should be finished, The subjects were told 
j at the purpose of the experiment was to determine 
E ther groups or individuals came up with better 
Solutions to problems. Their task was to read two 
oblems set out in front of them and then attempt 
5 Solve them. At this point, in the alone and adult 
fonfederates conditions, the experimenter left. In the 
d confederates condition, two children, ages 4 
6, were brought into the room by another ex- 
rimenter before the first experimenter left. This 
Cond experimenter explained that she had been told 
SY the staff developmental psychologist to have the 
i dren wait in this room until he needed them for 
«P research, The first experimenter expressed dis- 
asure at this arrangement, but ultimately acqui- 
ted. Both experimenters left the room, leaving the 


adjoining room ; 
they were a? 
two adults 
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subject alone with the children. During the course 
of the experiment, the children played with coloring 
books and did not speak to each other or the 
subject. 

After the experimenter left the room, the subjects 
turned on the tape recorder and heard that they 
were to be in the individual solution condition; 
their task was to read the problems over and then 
write down any thoughts that they might have and 
any solutions they came to. They were instructed to 
work silently and not communicate with one an- 
other. 

In all conditions approximately 20 minutes after 
the subjects were first ushered into the experimental 
room, they were subjected to one of the two emer- 
gencies (see "Emergencies"). Each subject was timed 
from the start of the emergency to the time they 
left the room or 5 minutes had elapsed, whichever 
occurred first. The confederates in both the child 
confederates and the adult confederates conditions 
completely ignored the emergency. On termination 
of the experiment, all subjects were thoroughly de- 
briefed. 

Emergencies. Each group was subjected to one of 
two “emergency” treatments: (a) Internal emer- 
gency—dry ice was placed in an immersion heater, 
and the resulting heavy white smoke poured into 
the experimental room from a crack under a table; 
(b) external emergency—subjects were led to be- 
lieve that there was a workman working in the area. 
This was done at the beginning of the experiment 
by having the experimenter ask the subjects to wait 
a little longer in the hall, where they were seated, 
since there was a workman making alterations in 
the room they were to use. Shortly after leaving the 
subjects in the room, the experimenter started a 
high fidelity stereo tape recorder (in the room ad- 
joining the experimental room), on which there 
Were occasional sounds of hammering and a power 
saw, followed by the emergency in which there was 
first the sound of the power saw, followed by the 
scream and a crash, then a moan, and “Oh, my leg!” 
and more moans. 


RESULTS 


In this experiment it was decided, a priori, 
to include all subjects, regardless of suspicion. 
As Darley (1967) has pointed out, a situa- 
tion must first be defined as an emergency 
before a decision about whether or not to 
intervene can be made. Outside the labora- 
tory, unambiguous emergency situations rarely 
exist; for example, a man lying in the gutter 
may have had a heart attack or he may be 
drunk. In the laboratory, many of these par- 
ticular ambiguities may not exist, but there 
are others that do, for example, Is this an 
emergency or is it part of a psychology ex- 
periment? Whether the situation is defined by 
the subject as an experiment or an emergency 
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TABLE 1 


PERCENTAGE or Groups LEAVING IN RESPONSE 
TO An EMERGENCY 


| Confederates 


federates’ 
Condition —— | confederates 
| Adult | Child (alone) 

| I 
Smoke 16 50 100 
Scream 16 50 83 


Note.—Percentage based on N = 6. 


may determine his response. In addition to 
this, the suggestion by Latané (1967), that 
suspicion voiced at the end of the experiment 
may be a rational for nonintervention, 
prompted the decision not to exclude any sub- 
jects who expressed suspicion. If greater sus- 
picion were to be found among those who 
did not respond to the emergency by inter- 
vention, Latané's suggestion would receive 
some support. 

To analyze suspicion, the subjects’ re- 
sponses were divided into two categories: (a) 
correct suspicion about the hypotheses and 
(5) other suspicion, which could be about the 
authenticity of the emergency (it did not 
seem like real smoke) and incorrect suspicion 
about the hypotheses (you wanted to see if 
the noise would keep us from solving the 
problem). Every subject in the experiment 
voiced some suspicion which fell into one of 
these two categories. Only 10 subjects enter- 
tained hypotheses that fit into Category a, 
and these were evenly divided among the 
six experimental conditions; of these 10, 5 
intervened and 5 did not. As all the subjects, 
regardless of whether or not they intervened, 
expressed some suspicion, it does not seem 
that suspicion was a rationalization for non- 
responding as Latané (1967) has reported. 


TABLE 2 


Mean Reaction Time (tn SEcoNps) to RESPOND TO 
AN EMERGENCY 


Apranam S. Ross 


There are two dependent messures which 


can be considered as indications of interven. 
tion. The first, the proportior of subjects 
leaving the room when the emerzency occurs, 
is presented in Table 1. Analysis of the data, 
after using an arc-sine transfor: ation, indi- 
cates that there was no main -'ect due to 
type of emergency (F < 1) nor was there a 
significant interaction (F <1). However, 
whether or not there are confec crates present 
and the type of confederate s have an 
effect on whether or not people leave (F= 
10.68, df = 2/0, p< .005). 

The second dependent measure that may 
be considered is the reaction time of the sub- 
jects. If all subjects who did sot leave are 
assigned a reaction time of 300 seconds (the 
time at which the emergency ; terminated 
if no one responded), then v rive at the 
means presented in Table 2. 1 nalysis of 
these means should and does agree with that 
of the proportion of subjects !cxving. There 
was no significant main effect of type of 
emergency (F < 1) nor was there a signif- 
cant interaction (F < 1). The only significant 
result is a main effect due to presence and 


type of confederate (F = 9.77, @ = 2/30, ? 
<.025).2 A set of orthogonal contrasts 
(Winer, 1962) when applied to these three 
means indicates that those who are alone 
leave significantly faster than those in either 
of the other two conditions (F = 13.61, df= 
1/30, p < .025) and that subjects in the adult 
confederates condition left more slowly than 
3 Since the reaction time scores were truncated at 
300 seconds, the distribution of scores is obviously 
skewed. Taking this into account, the probably 
levels reported are more conservative than 


tabled values for those degrees of freedom, for © 
ample, a tabled value of .025 is reported as .05. 


TABLE 3 


, ` To. 
Mean Reaction Times or Tuose SUBJECTS W 
RESPONDED TO THE EMERGENCY 


Confederates No Confederates No $ 
Condition confederates Condition -rna cones 
Adult Child (alone) Adut | Ch — 

Smoke 268.00 | 19168 78.08 Smok 175.00 (1) | 99.33 (3) | 8133 
Scream 269.44 | 16256 | 126.56 cream | 184.00 it) 3866 (3) | 47.75 W 
SEE mber of 


Note.—N = 6, Maximum time for any individual was 300 
seconds. 


Note.—The number in parenthesis refers to the n" 
subjects responding. 


child confederates condi- 
1/30, p < .05). 
he reaction times reveals 
; have found (e.g., Korte, 
are going to leave, leave 
eave being 2 subjects in 
ites condition who left at 
minutes, Except for these 2 sub- 
of 16 subjects left within 2 min- 
)f the emergency. An analy- 
times of only those sub- 
ble 3) indicates a significant 
> type of emergency (F = 
, $ < .01), which is probably 
the length of time it took 
^ a noticeable level. There 
t main effect due to presence 
lerate (F = 21.15, df = 2/ 
vain using orthogonal con- 
ad that the 2 subjects in the 
condition who left, left sig- 
slowly than did the 16 sub- 
xer two conditions (F = 5.27, 
5). There was no difference 
mes of subjects leaving in the 
confederates conditions (F < 


DISCUSSION 


es of the present experiment 
irmed. The prediction that 
paired with nonresponding 
erates would show the least 
amount of intervention was confirmed. Con- 
ary to prediction, however, the subjects 
ed with nonresponding child confederates 
ed an intermediate amount of interven- 
n, and those who were alone showed the 
Breatest amount of intervention. The results 
Of the alone and adult confederates for the 
internal emergency conditions replicate the 
Tesults of Latané and Darley (1968), and 
the Tesults for the external emergency condi- 
Hons replicate the results of Latané and 
Rodin (1969), 
q The results of the child confederates condi- 
“on can be accounted for if we drop the as- 
Sumption, mentioned in the introduction, that 
dren will not serve as cue sources for 
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increasing the number of subjects who inter- 
vene (when compared to the adult confed- 
erates condition), but at the same time they 
May serve as models of nonintervention and 
decrease the number of people who intervene 
(when compared to the alone condition). 

The results of the child confederates condi- 
tion could also be explained in terms of a 
conflict that the subjects may have found 
themselves in. On the one hand, the subject 
may feel that investigating the emergency or 
going for help is the appropriate response, but 
on the other hand he may feel that it is unwise 
to leave the children in the face of the poten- 
tial hazard. Nonexperimental reasons may 
have inclined some of the subjects to resolve 
the conflict in one direction and some to 
resolve it in the other. Although there is 
some question as to the utility of reaction 
time as an indicant of conflict (see Berlyne, 
1966), if we do consider length of time to 
respond to the emergency as an indicant of 
conflict, the insignificant difference (F < 1) 
in reaction time between only those subjects 
who responded in the alone condition and 
only those who responded in the child con- 
federates condition would indicate no greater 
conflict in one condition than the other. In- 
deed, it appears that judging by their reac- 
tion times, the subjects in the adult confed- 
erates condition experienced the greatest con- 
flict. Thus, it does not appear that conflict 
can account for the results in the child con- 
federates condition. 

One question raised by this study which 
deserves further investigation is the character- 
istics of effective models for increasing or 
reducing intervention. Bandura and his asso- 
ciates (see, e.g., Bandura, Ross, & Ross, 
1963) have investigated the effects of dif- 
ferent models on aggressive behavior, but to 
date no one has done this for helping behav- 
ior. Krebs (1970) reported a number of 
studies on the effect of modeling on helping 
behavior, but none compare the character- 
istics of different models, for example, chil- 
dren versus adults and males versus females. 
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c effective. 


suggestion refers to a 
| a hypnotized person to 
| manner after the termi- 

{ Barber, 1962, p. 321].” 
son (1941) have described 
or as being involuntary, 
and compulsive. Barber 
nced the above view and 
> induction (and presumed 
state") is not necessary to 
th a suggestion to perform 
ed acts, pos!-experimentally [p. 205 |." 
hypotheses can be derived from the 
empirical generalization. The first hy- 
stated by Barber (1969) was: “It 
an be hypothesized that hypnotic and non- 
Otic subjects will not comply with sug- 
ms to carry out post-experimental acts 
are led to believe that the experimenter 
Not know whether or not they complied 
05]." A study by Orne, Sheehan, and 
(1968) claims to have demonstrated 
the posthypnotic behavior of excellent 
ic subjects is independent of the knowl- 
or presence of the experimenter. In the 
et al. study, the posthypnotic test was 
istered by the experimenter's secretary. 
e, the subjects had reason to believe 
She would report their responses to 
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OF *HYPNOSIS" AND TASK MOTIVATIONAL 
CTIONS IN ATTEMPTING TO INFLUENCE 
iE "VOLUNTARY" SELF-DEPRIVATION 

OF MONEY 


IAN WICKRAMASEKERA * 
nial Health Clinic and University of Illinois Medical School, Peoria 


hypnotic subjects, similarly selected and treated, were given a 
ic suggestion under two apparently different conditions. Subjects in 
nental condition were administered the suggestion after a standard- 
notic” induction. Subjects in the control condition received the 
gestion after a set of task motivational instructions. The suggestion 
ithin the context of the procedure labeled "hypnosis" appeared to 


the experimenter, It appears therefore alli 
Barber's (1969) hypothesis was not ade- 
quately tested by the Orne et al. study. 

A second hypothesis is that there will be 
no statistically significant difference between 
hypnotized and nonhypnotized excellent hyp- 
notic subjects on a postexperimental act if 
they are similarly selected and treated. The 
present study tested this second hypothesis. 


METHOD 
Subjects 


The subjects in this study were undergraduate 
college students, white females, between the ages 
of 18 and 20, who agreed to participate in a 
“hypnosis-imagination emotional arousal study” for 
the payment of $5. Subjects were told specifically 
that they would not be paid for the group screening 
on the Harvard scale (used as a preliminary test to 
select subjects with superior hypnotic ability) and 
that the author intended to measure with the gal- 
vanic skin response the degree of emotional arousal 
when certain events were imagined. From the large 
pool of subjects who offered themselves for group 
screening, the first 20 subjects who scored between 
10 and 12 on the Harvard scale were selected for 
the study. The 20 subjects were randomly assigned 
equally to control and experimental groups. All 20 
subjects were told: “You will be paid $5 for your 
participation in this emotional arousal study, im- 
mediately after your individual session with me.” All 
procedures were administered by the author. 

The subjects assigned to the hypnosis group were 
"induced" individually with the Barber (1969) hyp- 
notic induction procedure and then given the follow- 
ing suggestions from the Stanford Hypnotic Sus- 
ceptibility Scale (Form C and Profile Scale) and the 
Barber Suggestibility Scale: (a) thirst hallucination 
(1 minute), (b) taste hallucination (24 minutes), 
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TABLE 1 


Number or Hours THAT ELAPSED SINCE DELIVERY OF 
THE SUGGESTION AND REQUEST FOR THE PAYMENT 
CHECK IN THE Hypnosis AND TASK 
MOTIVATIONAL GROUPS 


Subject Hypnosis group | Task xou tioni? 
1 312 22 
2 172 200 
3 196 29 
4 213 30 
5 288 46 
6 129 38 
7 330 112 
8 312 143 
9 240 21 

10 360 6 


(c) hand analgesia (14 minutes), (d) heat hallucina- 
tion (2 minutes), and (e) a posthypnotic suggestion. 

Subjects in the control group were individually 
administered the Barber task motivation instruc- 
tions (see Barber, 1969, p. 46) and told: "Please 
close your eyes, and imagine what I ask you to, 
without being hypnotized. It is important that you 
do not become hypnotized.” Control subjects were 
asked to imagine the following scenes: (a) drinking 
their favorite beverage (1 minute), (b) eating their 
favorite food (23 minutes), (c) putting on their 
favorite clothes (13 minutes), (d) taking a warm 
shower (2 minutes), and (e) posthypnotic suggestion. 

The following posthypnotic suggestion was read to 
both the control and experimental subjects: 


Please pay close attention to what I am going to 
say to you now. In a few minutes I will ask you 
to open your eyes. You will then have the oppor- 
tunity to collect $5 for your participation in this 
study. After you open your eyes you will start 
to feel something very interesting. You will notice 
no desire to accept payment for your participation 
in this pleasant and unusual study. In fact, you 
will even feel a certain pride in giving freely of 
your time for the advancement of science. You will 
not want to cheapen what you have done today 
by placing a price on it. You will find it much 
easier and more personally satisfying to ignore 
entirely the fact that you previously agreed to take 
money for the privilege of participating in this 
study. You will feel a certain pride in your inner 
freedom to refuse money for your participation in 
this test of your imagination. 


All subjects were connected to a galvanic skin 
response machine (LaFayette Model 760C) before 
the induction procedure. The following instructions 
were read to both the control and experimental 
subjects after they had opened their eyes: 


Thank you for your participation in this study. 
Your $5 is in an envelope with your name on it 
on the table in the next room. You may pick it up 
on your way out or you can have it mailed to 


IAN WICKRAMASEKERA 


you by calling this number (th 
the experimenter's telephone nun 


vere handed 
t his home) 


any time of the night or day ou have not 
called me within 15 days, I will ne that you 
do not want the money, and I lispose of it. 
RESULTS 
The data analyzed in this study consisted 
of the number of hours elapsed between the 
delivery of the posthypnotic suggestion and 
the receipt of the telephone call from the sub- 
ject requesting that her check be mailed (see 


Table 1). This type of data can be regarded 
as interval scale data, but to avoid the restric- 
tive assumptions associated ) parametric 
statistical procedures and because of the very 
small size of the present sample, the data 
were analyzed with the Wilcoxon matched- 
pairs signed-ranks test. The test indicated a 
statistically significant difference between the 
data for the hypnotic group and the “task 
motivational” group. Since the direction oí 
the difference was predicted beferehand, the 
null hypothesis was rejected at the .005 level 
for a one-tailed test. No checks were picked up 
on the way out, immediately after the experi- 
mental or control procedures. One subject 
did not request payment at the end of the 
experimental period and refused payment even 
afterward. 


DISCUSSION 


Currently, there appears to be no objective 
independent means of inferring the existence 
of an altered hypnotic state of consciousness: 
A small number of imaginative experiments 
(Bower & Pribram; ? Deckert, 1964; Parrish, 
Lundy, & Leibowitz, 1969) suggest a direc 
tion from which this kind of independent 
specification may come. It appears, howevel, 
that recent reviews do not mention thes 
studies in the above specific context. 

From the point of view of controlling and 
predicting the behavior of the subjects Jh 
this study, the determination of whether or 
not they were hypnotized does not see" 
important, but what appears to be demon 
strated is that there is a difference in the 
degree of subject compliance with verb? 


*G. Bower and K. Pribam. Unpublished er 
script. Cited by E. Hilgard, Hypnotic Susceptib 
New York: Harcourt, Brace & World, 1965. 


instructions are given 
a procedure labeled 
i with the same instruc- 


ons given within the context of a procedure 
not labeled hypnosis. 

Tt may be argued that this study simply 
Wlustrates compliant behavior. This is spe- 


ly what the author was attempting to 
onstrate and measure. Future studies can 
the strength of similarly developed com- 
nt behaviors by manipulating upward the 
ty of money or the quantity of time 
he verbal suggestions (i.e., 
y deadline, etc.). The 
jluntary-involuntary dimension formulation, 
ough an intellectually interesting one, 
not seem to have much relevance to the 
tical problems of the stimulus control of 
or. 
‘Tt may also be argued that the procedure 
d with control subjects reduced their moti- 
i to comply with the posthypnotic sug- 
tion. For example, screening the controls 
h a hypnotic procedure, but failing to test 
em with it, may have been a “letdown” for 
m. On the other hand, it may be argued 
t it made the experiment more interesting 
) them because they were exposed to a vari- 
ety of procedures (hypnotic instructions plus 
task motivational instructions). If the experi- 
ence of the control subject was more inter- 
esting, it was better fitted to the common 
ationale (e.g., since this was an interesting 
experience, you should not accept payment 
for participating in it) which the subjects 
1 Were offered for complying with the post- 
hypnotic suggestion. Hence, it may be argued 
that rather than being let down, they had a 
motivational advantage over the hypnotic 
jects. 
Tt has been empirically demonstrated 
(Barber, 1969) that task motivational in- 
structions can elicit a degree of compliance 
qual to hypnotic instructions. In the present 
Study, such task motivational instructions 
Were used to elicit posthypnotic compliance. 
Mence, the control subjects were treated with 
4 procedure that has appeared to be equally 
Powerful. The speculation that changing 
Motivational procedures (hypnotic screening 
to task motivational instructions) reduces the 
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probability of compliance remains to be 
empirically demonstrated. 

It may also be argued that since the control 
subjects were not hypnotized, they were not 
led to believe that the experimenter expected 
them to comply with the posthypnotic sug- 
gestion. For example, telling the control sub- 
jects not to become hypnotized may have 
implied that they were not expected to com- 
ply with the posthypnotic suggestion. Also, the 
control subjects were given tests that were 
similar to but different from those that were 
given to the hypnotic subjects. It may be 
argued that these tests were less pleasant than 
those that were given to the hypnotic subjects. 
If they were experienced as less pleasant, this 
may have implied to the control subjects that 
the rationale for complying with the post- 
hypnotic suggestion was not intended to ap- 
ply to them. From the above rationale, it 
may be inferred that the experimenter was 
covertly telling the control subjects in at least 
two ways that he did not expect them to 
comply with his posthypnotic suggestion. 

First, it is a matter of empirical fact that 
all of the control subjects complied to some 
degree with the posthypnotic suggestion. 
Second, it is also a fact that three of the 
control subjects complied almost as well as 
those exposed to hypnotic instructions. Hence, 
the data suggest that all control subjects 
knew that they were expected to comply with 
the posthypnotic suggestion. 

Third, an attempt was made to disguise the 
purpose of this experiment by presenting it 
as a test of “hypnosis, imagination, and emo- 
tional arousal.” But there is no indication of 
how effective a disguise it was. When in- 
formally questioned after the 15-day deadline, 
only one subject (in the experimental group) 
stated that she believed that the experiment 
was primarily intended to test the strength 
of a posthypnotic suggestion. The posthyp- 
notic suggestion was carefully worded to avoid 
any implication that compliance with it was 
contingent on the experience of hypnosis. 
Compliance was related implicitly to the 
satisfaction derived from participation in a 
pleasant and interesting experience, and not 
necessarily a hypnotic one. But there appear 
to be no independent objective means of 
determining how the subjects interpreted 
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these instructions. Apart from all the above 
possible inferences and speculations, the em- 
pirical observation was that the verbal sug- 
gestion given within the context of a procedure 
labeled hypnosis appears to have been more 
effective than within the context of the 
control procedure. 

This study suggests that it may be em- 
pirically more productive to think of hypnosis 
as a socially learned type of cognitive set 
about hypnotic behavior, and that this cogni- 
tive set can be the starting point for some 
subjects for profound psychophysiological and 
motivational alterations. Hence, even if the 
belief in hypnosis as an “altered state” turns 
out to be a delusion (e.g. like certain reli- 
gious and scientific beliefs), it may be a useful 
delusion, in that it may be used to provide 
some people with a rationale for extending 
themselves in ways in which they would not 
usually. Beliefs that can have profound physi- 
cal and motivational consequences are clearly 
powerful tools for behavioral engineers and 
clinicians. 

It would appear that the major role of 
money in the posthypnotic suggestion gives 
the suggestion a peculiarly relevant and “life- 
like” quality insofar as the behavior of the 
college undergraduate is concerned. Request- 
ing subjects to “voluntarily” deprive them- 
selves of a generalized reinforcer (Skinner, 
1953), in this case, money, makes provision 
for the demonstration of the effectiveness of 
verbal stimulus control over a wide range of 
alternative behaviors and motivational con- 
tingencies. The apparent demonstration of the 
continued effectiveness of verbal instructions 
after the subject had returned to his natural 
habitat and the artifacts and props of the 
experimental situation were “faded” seems 
to suggest that verbal instructions given 
within the context of an operation labeled 
hypnosis become peculiarly effective. 

A very important result of this study is 
that, with one exception, the desire to be 
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paid eventually became stronger tham 
desire to comply with the pos'hypnotie 
gestion, This result illustrates the limits 
single cognitive structuring procedure @ 
ducted within the context of vader p 
cedure labeled hypnosis. It also «uggests wi 
(e.g., individualized rationales reinfo 
ment schedules) in which more ible co 


tive sets may be programmed sur 
objectively after the individual ha: returned 
his natural habitat. It would ! 
to know how long such “sel 
or “self-control” can be 


privatior 
maintained 


individualized reinforcement schedules * 
boosters. : 

After the termination of the experime 
period, a check was mailed to ‘he subj 
who had not received paymen partic 
tion. The check was returned ^: ‘he exp 
mental subject and had to be mailed tol 
several times before she finally accepted it 
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sent research is an investigation of the aspiration responses of 120 
rade male and female students including Negro and white subjects of 
cioeconomic status and white subjects of middle to high socioeconomic 
rhe Rotter Level of Aspiration Board served as the behavioral task, 
í aspiration was related to sex, with lower-class Negro and middle-class 
rales giving higher aspiration estimates than their female counterparts, 

^y of shifts in expectancy following performance was different across 

with lower-class Negro females giving the highest and middle-class 
y nales the least number of shifts. Unusual shifts were related to race, 
‘ hite subjects giving less erratic responses than the Negro subjects. 
I tions of results are discussed in regard to minority group membership. 


xpectancy of success. 
Beginning with Dembo in 1931, the con- 
cept of level of aspiration has a rich theo- 
retical and empirical history (Lewin, Dembo, 
Festinger, & Scars, 1944; Rotter, 1942a, 
1942b, 1943, 1945, 1954; Sears, 1940, 1941). 
However, little specific level-of-aspiration re- 
search is readily available within contempo- 
rary literature, probably because a number of 
tisk-taking and goal-setting experiments have 
been couched within the theoretical frame- 
work of need for achievement (Atkinson, 
1958; McClelland, Atchinson, Clark, & Lowell, 
1953). Among the few experiments bridging 
the need-achievement-level-of-aspiration re- 
search, Hancock and Teevan (1964) found 
that subjects with high fear of failure chose 
more difficult odds on the initial trial of a 
tisk task and made more “irrational” moves 
3 odds after success or failure than did 
hope of success" subjects. Birney, Burdick, 
and Teevan (1969) also reported a series of 
studies demonstrating that subjects with high 
fear of failure give aspiration estimates that 
are initially discrepant from their perform- 
ance and come to reflect actual performance 
very slowly, if at all. On the other hand, sub- 
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jects with hope-of-success motivation seem 
quite skilled at making aspiration estimates 
that are close to performance scores. These 
authors also wrote that their review of the 
level-of-aspiration literature and their use of 
the technique *have convinced us that the 
role of aspiration and its dependence on moti- 
vation are not well understood [p. 190]." 

In view of the continued interest in level 
of aspiration and particularly the relevance 
that this concept may hold for a number of 
social phenomena occurring today, continued 
investigation of aspiration estimates and a 
clearer delineation of demographic and be- 
havioral correlates are most appropriate. Spe- 
cifically, level-of-aspiration research may be 
relevant in coming to understand more defini- 
tively the rising expectations and often sub- 
sequent frustrations of minority group mem- 
bers. If individuals are unrealistic in the 
appraisal of their probabilities of future suc- 
cess, two likely behaviors can be predicted. 
When estimates of success are unrealistically 
high, individuals may well be disappointed 
with their subsequent performance. When ex- 
pectancies are quite low, they may fail to 
perform at levels which would bring the de- 
sired accomplishment. Additionally, discrep- 
ant predictions may well serve a defensive 
function. Clearly, a further understanding of 
the ways in which expectancy estimates are 
related to a behavioral history and the im- 
plications of the association of aspiration to 
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race, sex, and social class variables is indi- 
cated. 

Lefcourt and Ladwig (1965) found that 
young Negro adult prison inmates were sig- 
nificantly less likely to be consistent in their 
aspiration estimates following a behavioral 
skill task than were their white counterparts. 
In contrast, in a gambling situation, Lefcourt 
(1965) found that Negro subjects performed 
in a more cautious and less erratic fashion 
than whites. Lefcourt suggested that the Ne- 
gro seems motivated to avoid failure in skill 
situations and to achieve success in chance 
situations. He interpreted his findings in the 
light of previous research on internal versus 
external control of reinforcement (Battle & 
Rotter, 1963; Lefcourt & Ladwig, 1965; Rot- 
ter, Seeman, & Liverant, 1962). This early 
research on generalized expectancies of locus 
of control, as well as the research of Cole- 
man, Campbell, Hobson, McPartland, Mood, 
Weinfeld, and York (1966) and Zytkoskee, 
Strickland, and Watson (1971), suggests that 
members of a number of minority groups, 
particularly Negroes, are more externally ori- 
ented than whites; that is, they see the events 
that happen to them as a result of luck or 
powers outside their control, as opposed to 
these events following from or being contin- 
gent on their own behavior. 

One must be cautious, however, in attrib- 
uting characteristics of minority group popu- 
lations to race differences without considering 
the powerful impact of socioeconomic status. 
The reviews of Shuey (1958, 1966), covering 
a number of research efforts, reported school 
achievement and intelligence test differences 
among racial groups. Bloom, Whiteman, and 
Deutsch (1963), however, suggested that the 
association of environmental conditions with 
social class placement is stronger than with 
race. They implied that social class member- 
ship may be the variable most responsible for 
the reported racial differences on intelligence 
and achievement test scores. Additionally, 
Tulkin (1968), controlling for socioeconomic 
status, found no racial differences among the 
upper socioeconomic group, although race 
differences remained significant among the 
lower socioeconomic subjects. A number of 
questions regarding race and social class re- 
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main unanswered, and the present study is an 


attempt to consider further these variables in 
relation to aspiration expectancies, 

The present experiment includes controls 
for socioeconomic level and extends aspira- 
tion research to an adolescent population, 
Based on previous findings, it was hypothe- 
sized that Negro subjects will be less con- 
sistent and realistic in their expectancy re- 
sponses following performance than will white 
subjects. Additionally, with a control group 
for socioeconomic status, the relative contri- 
butions of race and social class to expectancy 


responses were examined. Finally, the data 
were analyzed across male and female sub- 
jects so that the relationship of sex to aspira- 


tion estimates could also be considered. 
METHOD 

Subjects 

One hundred and twenty ninth: students, age 
14-16, from three urban public high schools served 
as subjects. The low-socioeconomic subjects were 
obtained from two schools serving inner-city dis- 
tricts classified by the school ı as poverty 
areas. These schools enrolled c! of parents 
whose median income falls at $3,000 or below per 
year, Although integrated, one of ‘hese school 
was predominantly black and one predominantly 
white. Twenty male and female Negro students 
from the predominantly black s | served as à 


low socioeconomic Negro group. Twenty male and 


female white students from the predominantly white 
school served as a low socioeconomic white group. 
The middle to high socioeconomic group was com- 
prised of 20 male and 20 female students of 3 
school, predominantly white, which was located in 
a middle- to upper-class suburb. 
Experimenters 

Experimenters were four white undergraduate psy” 
chology majors, two males and two females. Pri? 
to the experiment proper, experimenters were traine 
in administering the level-of-aspiration task. The 


female experimenters ran female subjects, and m 
experimenters ran male subjects 


Procedure 
" M jon 
The research was accomplished in coope 
with the public school systems over a perio 


z 9. 
approximately [ 196 


2 months in 
the experimental testing was from 2 p 
utes. The Rotter Level of Aspiration Board (Rot 
ter, 1942b) served as the behavioral task- 

board is designed with a groove leading to 3 sei 
of numbers, The subject hits a stecl ball along 


rics 


- shifts—are available 
- reported median J 


OOO U————————————S 


to obtain as high a score as 
ictice hits, the subject com- 
hits each. Before each trial, 
he score he expects to make 
‘ive hits. The relatively high 
lows for reliable measures of 
well as affording opportunity 
shifts in expectancy following 
id any unusual shifts (giving a 
failure or a lower estimate 
ormative data (W — 205) for 
i indexes—aspiration level (D 
shifts, and number of unusual 
for adults (Rotter, 1954). The 
score is 1.99, with a standard 
he mean frequency of shifts is 
1 deviation of 4.70. No unusual 
more than half the subjects, 
had one or more, Rotter reported 


groove and attemp! 
he can. After several n 
pletes 20 trials of 

the subject estimat 
on the total of th 
number of trials 

aspiration level, 
to note frequency 
success or failure : 
higher estimate 
following suc 
three level-of 
score), frequency « 


deviation of 2.79 
9.53, with a ste 
shifts appeared 
while about 45% 


an odd-even reliability for frequency of shifts to 
be 83, and for D s to be .76. 

These three i served as the measure of 
aspiration performance in the present study. The D 
score is the mean of differences between each per- 


formance score and the following estimate and is the 
numerical index of aspiration level—the higher the 
D score, the more aspiring the subject. Frequency 
of shifts in estimate has been thought to relate to 
general stabilit; self-confidence, with a high 
number of shiíis terizing individuals who do 
not use previou rience to establish consistent 
estimates of their rerformances, Unusual shifts like- 
Wise suggest fai of the subject to establish reliable 
estimates and an unusual, inconsistent, and 
somewhat erratic 


response, 
TABLE 1 
Group MEANS AND STANDARD DEVIATIONS FOR 
.. THE LEVEL OF ASPIRATION INDEXES 
Fre- Un- 
Groups D score| quency | usual 
of shifts | shifts 
E = 
Negro males: Low SEL 
3.69 | 16 5.55 
ED 251 | 3.88 | 2.09 
Negro females: Low SEL 
145 | 1721. | 44 
SD 
j 3.73 2.27 | 2.65 
White males: Low SEL 
x 240 | 15.55 | 2.15 
we 3.17 3.72 | 2.28 
nite females: Low SEL 
d 2,4 | 15.60 | 2.10 
we? 2.52 3.08 | 1.68 
Rt males: Middle SEL 
i 3.30 | 12.95 | 1.20 
Wee 2.15 | 4.90 | 1.24 
hite females: Middle SEL | 
x 14 14.75 1.50 
SD 2.14 | 3.32 | 1.40 
Note, SEL * socioeconomic level. : 


0 for all groups, 
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TABLE 2 


ANALYSIS OF VARIANCE FOR ASPIRATION RESPONSE 
ACROSS SUBJECT GROUPS 


Index SS df MS F 

D score 

Between groups 23415 | 5]46.83| 6.13** 

Within groups 870.40 | 114 | 7.63 
Frequency of shifts 

Between groups 194.08 5|38.82| 2.96* 

Within groups 1,494.25 | 114 | 13.10 
Unusual shifts 

Between groups 228.17 5 | 45.63 | 11.045*** 

Within groups 435.30 | 114 | 3.82 

* 
MGE 
> c 001. 


A modification of Rotter’s instructions was used 
for the experimental task. Briefly, each subject was 
told to try to obtain as high a total score as possi- 
ble and to estimate each time what he thought his 
score would be on the upcoming trial. 


RESULTS 


Means and standard deviations for the 
three aspiration indexes for each subject 
group are shown in Table 1. Results of a 
one-way analysis across groups for each of 
the aspiration indexes are shown in Table 2. 

The analysis of variance for D scores gave 
an F of 6.13 (p< .001) between groups. 
Further computation with the Newman-Keuls 
statistic suggested that this group difference 
was due primarily to a response difference 
between sexes. Negro males and white males 
of the higher socioeconomic class gave sig- 
nificantly higher D scores (p < .01) than did 
their female counterparts. The white male and 
female subjects of the lower socioeconomic 
group appeared quite similar in their level of 
aspiration responses and fell between the 
other male and female groupings. 

Results of the analysis of variance com- 
puted for frequency of shifts gave a between- 
group F of 2.96 (p < .05). Looking at Table 
1, there is an obvious difference in frequency 
of shifts, with the highest frequency found 
among the Negro subjects, followed by the 
lower-class whites, with the least number 
given by the middle-class whites. According 
to the Newman-Keuls statistic, the significant 
F was due to the marked differences (p < 
01) between the middle-class white males and 
the Negro females. 
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Considering unusual shifts, the analysis of 
variance across groups yielded an F of 11.95 
(p < .001). Again, by means of the Newman- 
Keuls statistic, it was possible to see which 
groups were contributing to this difference. 
Both Negro males and females were signifi- 
cantly different (p< .01) from the four 
white groups and were not significantly dif- 
ferent from each other, Thus, this difference 
appears to be primarily related to race, with 
Negro subjects giving a larger number of 
unusual shifts than did the white subjects. 


Discussion 


Results clearly indicate that aspiration re- 
sponses following behavioral performance are 
related to the race and sex of subject. Fre- 
quency of shifts was significantly different 
across groups, with Negro females and white 
middle-class males being most extreme. The 
Negro students, both male and female, also 
demonstrated a more erratic and unusual 
pattern of estimates than did the white sub- 
jects, as evidenced in their significantly higher 
number of unusual shifts. These results raise 
the possibility that Negro secondary students 
of this age are not incorporating immediate 
past performance into their aspiration esti- 
mates in the same way as their white counter- 
parts, Similar to the Lefcourt and Ladwig 
(1965) findings, the Negro subjects appear 
to be responding in a more chance-oriented 
or risky fashion than the whites, as if their 
expectancy for future success is only mini- 
mally, if at all, related to past behaviors on 
their part. This expectancy may well reflect 
the reinforcement history of members of mi- 
nority or disadvantaged groups who often find 
that their behaviors have little to do with the 
subsequent events that happen to them. These 
discrepant scores are also similar to those 
found by Hancock and Teevan (1964) and 
Birney et al. (1969) among fear-of-failure 
subjects and raise an interesting theoretical 
question. To what extent is a belief in ex- 
ternal locus of control related to fear of 
failure? Birney et al. reported a finding by 
Teevan and Hartsough of a .36 correlation (n 
= 49, p < .05) between a belief in external 
control as measured by the James-Phares 
scale and fear of failure as assessed via the 
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scoring of hostile press imagery. Continued 
research linking these two vari would be 
particularly useful in clarifyin: ‘he concep 
tual underpinnings of need f ievement, 


locus of control, and aspiration arch, 

If these results generalize to other behav- 
ioral activities, such as acad: achieve 
ment and work success, it ma that the 
Negro individual is less likely t the white 
to believe that his efñcacy mpetence 
will bring him future rewards his is the 
case, it is not surprising to expec: that many 
minority group individuals m nave par 
ticular difficulty in acquiring the behavioral 
response contingencies in an academic situa 
tion that would allow for most efective moti- 
vation and performance. Perh: e acquir- 
ing of an expectancy that beha loes make 
a difference is a precursor to : ore tradi- 
tional educational methods. 

A serious methodological criticsm may be 
leveled against the present research in that 
only white experimenters were utilized, and 
race of experimenter is known to affect the 
performance of Negro student: differentially 
(Dreger & Miller, 1968; Katz 4; Shuey, 
1958, 1966; Trent, 1964). It could well be 
that the responses of the Necro students 


would have been different in response to à 
Negro experimenter. Using the Rotter board, 
Eppes (1969) has collected aspiration T€ 
sponse data on Negro and white seventh grad- 
ers and found that while Negro students do 
respond differentially according to race of 
experimenter, the Negro students give more 
unusual shifts for the Negro experimenter 
than for the white. Consequently, it would 
seem that the strength of the findings of the 
present study would have likely been em 
hanced by the use of a Negro experimentet, 
but not lessened. In line with the present T€ 
search, Eppes found that regardless 0 the 
race of the experimenter, the Negro student 
gave significantly more unusual shifts than | 
the white students, but were not significantly 
different as regards frequency of shifts 0 
level of aspiration. Eppes did not, howevel; 
find a significant difference across sexes as 
was found in the present research. His use 0 
only male experimenters and lower-class sub- 
jects may have contributed to this discrep- 
ancy. 


tions were made about 
piration responses to 

g to learn that in gen- 
were more cautious and 
responses than were the 

: to note that these sex 

st pronounced for the 
ver socioeconomic status 
whites, but were not 

or the white subjects of 

it does appear that both 
| and middle-class white 

to be considerably more 

he males. These results 
learned behavior on the 
that their ambitions and 

ot be as readily fulfilled as 
for males. Consequently, 
female is more protective 
iot set her sights as high 
ihe other hand, perhaps the 
red to achieve beyond his 


aspiration estimates are 

ace and sex suggests a number of 
h avenues, including continued investi- 
of developmental antecedents of as- 
, delineation of specific relationships 
aspiration and achievement, and 
isideration of attempts to modify aspira- 
ôn and perhaps subsequent behavioral re- 
mses. Certainly the finding that Negro 
ts are more discrepant and erratic in 
ration estimates following performance 
are their white counterparts when socio- 
homic level is controlled suggests that in- 
d concern with finding more effective 
ional methods for reaching minority 
foup individuals is necessary. 
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IN OF FEELING, BELIEF, AND INTENTION TO ACT 
EE COMPONENTS OF ATTITUDE AND THEIR 
BUTION TO PREDICTION OF CONTRACEPTIVE 


BEHAVIOR ' 


VIRUPAKSHA KOTHANDAPANI * 
University of North Carclina 


and 36 nonusers of contraceptives among low-income Negro women. 
tration of these 12 scales on 50 users and $0 nonusers yielded a 12 X 12 


Most studi f attitude have failed to 


obtain a signi t relationship between atti- 
tade and behavior (e.g, Festinger, 1964; 
Fishbein, 1967; LaPiere, 1934; Mann, 1959). 


Some of the possible reasons for such failures 
Ar as follows: (a) Definition of attitude 
solely in terms of affect may be inadequate. 
Affect may not be the sole predictor of be- 
havior, or affect may facilitate behavior but 
Pot initiate it; (ò) attitude toward an inap- 
Propriate stimulus object may have been 
measured: (c) the instruments for measuring 
fither the attitudes or the behaviors under 


` This article is based in part upon a dissertation 
submitted to the University of North Carolina, 
Chapel Hill The author is indebted to Vaida D. 
Chairman of his dissertation committee, 
for her invaluable help in the direction of the 
He also thanks John Thibaut, Chester Insko, 
- Woligang Stroeba, Forrest Young, and Lawrence 
Snes for their pertinent suggestions. The valuable 
Comments of Michael Ross in the preparation of this 
Judy are greatly appreciated. The author is deeply 
indebted to Moye Freymann, Director, 
Population Center, University of North Carolina, 
Hill, for financing and inspiring this research 


ate Tests for reprints should be sent to the 
reel who is now at the Department of Psychology, 
lin College, Franklin, Indiana 46131. 


ind intention-to-act components supported a mcd hypothesis: The 
neto-act was a better predictor actual 
pede 


ior attitude structure and educational programs are considered. 


An attitude is generally defined as a learned 
to respond to an object or class 
favorable or un- 


out that measuring attitude in terms of affect 
does not lead to the prediction of a specific 
behavior because there may be a number of 
behaviors that are equally appropriate for an 
individual occupying a particular position 
along the affect continuum. Chein (1948), 
following Doob, proposed a multicomponent 
definition of attitude. He suggested that two 
people may be equally favorable toward an 
object but may have different cognitions or 
beliefs about the object. In accord with Chein, 
Fishbein (1965) has suggested that it is not 
necessarily “inconsistent” for an individual 
(a) to have a favorable attitude toward some 
“object” and (5) to believe that the “object” 
has some negative characteristics, qualities, 
and attributes. 
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Recent research in the domain of attitude 
(Harding, Kutner, Proshansky, & Chein, 
1954; Insko & Schopler, 1967; Katz & Stot- 
land, 1959; Rosenberg & Hovland, 1960) has 
identified three classes of evaluative re- 
sponses—affective (feeling), cognitive (be- 
lief), and behavioral—as components of atti- 
tude. However, Woodmansee and Cook 
(1967), in a factor-analytic study, did not 
find evidence for the tripartite division. Fish- 
bein (1967) questioned the usefulness of tak- 
ing three component measures on an indi- 
vidual to predict behavior. On thé other hand, 
Ostrom (1969) demonstrated the superiority 
of verbal measures of the behavioral com- 
ponent (items relating to past and present 
behaviors and behavioral intentions) in pre- 
dicting overt behavior. However, he only 
found minimal support for the construct va- 
lidity of the tripartite classification of the 
evaluative responses. Ostrom obtained a high 
intercorrelation among the three components, 
with the uniqueness of each component. con- 
tributing very little additional variance. He 
suggested that the low unique variance of the 
attitude components could be due to the 
homogeneous nature of the attitude issue 
(church) and the relatively homogeneous 
population (college students) he employed in 
his study. 

The present study, following closely upon 
the design of Ostrom’s work (1969), at- 
tempts to establish the convergent and dis- 
criminant validities of the three components 
of the attitude, namely, feeling, belief, and 
intention to act. Campbell and Fiske (1959) 
pointed out that validation is typically con- 
vergent, a confirmation of a trait (com- 
ponent) by independent measurement pro- 
cedures. Validation is also discriminant, in 
that one trait should not correlate highly 
with another trait with which it is intended 
to differ. The most convenient way to obtain 
both convergent and discriminant validities is 
to derive a multitrait-multimethod matrix 
that presents all intercorrelations resulting 
when each of the several relatively indepen- 
dent traits (e.g., feeling, belief, and intention- 
to-act components) is measured by each of 
several measuring techniques (e.g, Thur- 
stone, Likert, Guttman, and Guilford tech- 
niques). 
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In the present research, while the 
and belief components wer: ical to 
and cognitive components, the scope of 
behavioral component was narrowed to 


tion to act or behavioral inter: 
and present behavior statem: 
interpreted as statements of fact, 


or disagreement with them likely to | 
misleading as an index of attitude, On ti 
other hand, intentions to act ike into ex 


sideration the behaviors in hypot 
ations. This would include the sit 
variables emphasized by La 


than Ostrom’s global behavioral 
measures. Furthermore, in the present s 
the limiting features of Ostrom’s study- 
church as an attitudinal topic 4 
as respondents—were eliminated 
ing a controversial topic, namely 
trol, and by employing op: 
users and nonusers of birth con 
as respondents. T 

The first of the hypotheses may be state 
in the context of the mulütrait-multim 
matrix approach of Campbell and Fiski 
When the three components of attitude (fee 
ing, belief, and intention to act) are, eae 
measured by four techniques (Thurston 
Likert, Guttman, and Guilford), a compo 
should correlate higher with itself when mea 
sured by two methods than with anoth 
component when measured by the same tw 
methods. This hypothesis is based on th 
assumption that each component has a uniq 
set of antecedents, distinct from those for t 
other two. A 

Of the three components of attitude, 
tention to act is assumed to be most hi 
predictive of behavior, since behavior 
intention to act presumably share commi 
determinants. The second hypothesis stat 
that the intention-to-act scale, when © 
pared to feeling or belief scales, should P 
a better predictor of overt behavior. 


METHOD 


Four verbal measures of each attitude com 
(feeling, belief, and intention to act) were 
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dently constructed «ne the techniques of equal- households were selected until a qualified respondent 
appearing interv rhurstone & Chave, 1929), was found (Kish, 1965). Both respondents and 
summated. ratings cert, 1932), scalogram analysis interviewers were paid for their participation. By 
(Guttman, 1944) self-ratings (Guilford, 1954). ^ assigning different interviewers to conduct a second 
The administratio: of the 12 scales (three com- interview on a randomly selected group of re- 
ponents measured "ur methods) yielded a 12 X  spondents, we were able to obtain a simultaneous 
12 multitrait-mul':^c!hod correlational matrix that check of respondent and interviewer reliability 
facilitated the t ; of discriminant and con- (Rosenthal, 1966). A total of 468 interviews was 
vergent validation smpbell & Fiske, 1959) of the initiated. Of these, 16 had to be discarded due to 
tripartite classifi Factor-analytic techniques termination of the interview by the respondent. 
were also used in ting the validation hypothesis. a M 3 2 
An index of contraceptive behavior, for obvious Collection and Classification of Attitude 
reasons, was obt | by the self-report of the Statements 
respondent. Relat ontribution of the three com- The first step involved preparing and classifying 
ponents (feeling, belief, and intention to act) of statements into each of the three components, 
attitude in the prediction of use of birth control Thirty-two judges, most of whom were psychology 
devices was assessed by determining discriminant graduate students, were requested to write a total of 
funetion weights. Using a stepwise discriminant six statements that would represent the € com- 
analysis procedure, the best predictor of behavior ponents and would express both “pro” an n" 
was determined for cach technique of measurement. views about the attitude referent, birth control, 
according to the component definitions offered by 
Respondents and Interviewers Rosenberg and Hovland (1960) and Ostrom (1969). 


To facilitate their task, they were provided with 
The respondents were low-income married Negro written definitions and some examples of each com- 
Women who were residents of public housing proj- ponent. (The definitions and examples are similar to 
ts in North Caroline. Women meeting the follow- those in the display presented below.) Additional 
ing criteria were < ered eligible for the study: items were composed by the author with the aid of 
(o) married, (b) ro, (c) age between 15 and colleagues and a research of the literature. Alto- 
35 (d) living with the husband, (e) family income gether, 264 statements were gathered and edited. 
kss than $6,000, (j) currently not pregnant, and (g) Two types of judges—experts (21 psychology 
@rrently not in the postpartum period. To avoid graduate students) and nonexperts (37 introductory 
race and sex, trained female Negro psychology students)—were requested to classify 264 
cmployed to collect the data. attitude statements into the three component cate- 
Every third household in the housing project was gories according to the following specific instruc- 
flected for the s ^ with the first household tions, which contained detailed explanations of the 
flected by a ranc process. If the selected re- three components with examples for each com- 
PPondent did not qualify for the study, the adjoining ^ ponent: 


j 


Please read the following instructions carefully. 

Attitudes can be characterized as an evaluation of the attitude object on a pro to con con- 
tinuum. Statements expressing feelings, beliefs statements, and statements expressing intention to 
act are all attitude statements. The global concept of attitude can be analyzed into three com- 
ponents: Feeling (F), Belief (B), and Intention to act (I). 1 3 
. Vour job is to classify the accompanying list of statements of "attitude toward birth control 
into the above three component categories. " , s 2 

| To avoid any ambiguity as to the meaning of the words "feeling," "belief," and "intention act, 
please read carefully the definitions of these three components. Also read carefully the examples 
of each component, 


Feeling Component (F) 


The statements in this component represent favorable to unfavorable feelimgs toward birth 
control. These are also called affective statements. They should express like or dislike, emotional 
and physiological reactions, and feelings best exemplified by the phrase “gut reaction.” These 
usually express emotional reaction to birth control. 

Examples of feeling statements concerning birth control: 


I am happy to learn about the benefits of birth control. 
The very thought of birth control disgusts me. 


Belief Component (B) 


The statements in this component reflect values, attributes assigned to the attitude object. 
ey include beliefs and opinions about birth control, characteristics of birth control, and the 
relationships of birth control to other values and objects, which may include oneself. 
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Examples of belief statements concerning birth control: 


Birth control will help me postpone childbirth as long as I want. 
I believe that birth control causes many birth defects. 


Intention to Act Component (I) 


This component includes statements representing supportive to hostile actions wit 
birth control. The items reflect intentions to act, The statements can be of future inten 
or predicted behavior in hypothetical situations. The item should contain behavior, into 
behave, resolution to act, or a preferred course of action. 

Examples of intention to act statements concerning birth control: 


I would volunteer to speak about the merits of birth control. 
I would walk a mile to get my birth control supplies. 


In the response sheet you will find numbers 1, 2, and 3 to the right of each item 
Mark under No. 1 if you think the item expresses feeling; under No. 2 if you believ 
expresses belief; under No, 3 if you believe the item expresses an intention to act. If 


any questions please ask the experimenter. 


Since, by chance alone, it would be expected that a 
statement would be assigned to each component 
one-third of the time, the proportion of obtained 
agreement was tested against the null hypothesis of 
.33. Only if a statement reached the 001 level of 
significance for the nonexpert group was it re- 
tained as reflecting primarily that component. The 
final item pool for scale construction was composed 
of 249 items, each one of which was similarly classi- 
fied by more than 70% of the nonexpert judges. 

The entire study was conducted in two phases. The 
first phase involved the construction of 12 scales 
(three components, each one measured by four tech- 
niques), and the second phase involved the admin- 
istration of these 12 scales. Independent samples 
were used in the two phases. Moreover, since the 
entire study was to be conducted on a Negro popu- 
lation, it was imperative for external validity of 
measurement that the scales should be constructed 
and standardized using a representative sample from 
the same population. Therefore, throughout this 
study, both in the phase of scale construction and 
in the phase of hypotheses testing, only Negro 
women were used. 


Construction of the Thurstone Scale (Method 
of Equal-Appearing Intervals) 


Thurstone scale construction was accomplished by 
utilizing samples of 32 respondents drawn from two 
different populations—those who were in favor of 
and those who were against birth control. These 
groups are referred to as pro and con groups or 
“users” and “nonusers” throughout this study. Pro 
and con female Negro groups were used in obtaining 
the scale values of items because ratings are affected 
by raters’ attitudes (Hovland & Sherif, 1952). Only 
those items that had a minimum difference (less 
than .5 scale unit) between the scale values as- 
signed by the pro and con groups and a Q value of 
less than 2.00 were used in the final Thurstone 
feeling, belief, and intention-to-act scales. 

The ratings of the items used in the Thurstone 
scales were obtained as follows. The respondent was 
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approached by a female Negro 
carried with her a cardboard folc 
when spread out, contained 11 c 
each cell appropriately labeled so 
cells had “very, very favorable" “very, very 
unfavorable” written at the top. The interviewer 
read aloud the written instructions, which explained 
in detail how the statements were io be sorted 
according to the degree of favorablene:s toward the 
attitudinal topic, birth control per se Each state- 
ment was presented on a 3X6 inch index card. 
Since many respondents were not proficient in read- 
ing, the interviewer always read aloud the item on 
the card and gave it to the respondent, who then 
placed it in 1 of the 11 compartments. Before each 
sorting operation, the index-card deck v shuffled so 
that the items were presented in a random order. 
However, within the first 10 cards, at least one ex 
treme positive and one extreme negative statement 
were presented so that the respondent was sensitized 
to the range of items. After the interview, the re- 
spondent was paid a nominal sum of money ($1) as 
a token of appreciation for her cooperation. The 
Thurstone feeling, belief, and intention-to-act scales 
each had 16 items made up of statements with 
minimum evaluative ambiguity (Q < 2.00) and cow 
ering the entire range based on medians derived. 
from the combined pro and con groups. À check on 
the equivalence of the internal validity of the three 
scales was also made in the second phase of 
study. 


Construction of the Likert Scale ( Method 0 
Summated Ratings) 


The construction of the Likert summated ratings 
scale required selecting three pairs of criteri 
groups: high and low groups for feeling items, 
and low groups for belief items, and high and s 
groups for intention-to-act items. After eliminate 
the statements that composed the Thurstone 5C% 
from the original pool of items, there remained © 
substantial number of statements in each category 
65 in feeling, 66 in belief, and 70 in intention-to-a! 


-- 


tems, drawn randomly from 
stered to a sample of 72 
(36 pro and 36 con) by 

, who recorded the re- 
cement or disagreement on 

cale was labeled "Strongly 
Know," "Disagree," and 

> Fach respondent’s responses to 
sc were summed so that Cate- 
sented a  pro-birth-control 
the criterion groups for the 
18 judges were selected as 


pro-birth-con feeling-criterion group, and 

Be lowest 18 judges were selected as the anti-birth- 
tontrol feeling-criterion group. Multiple criteria were 
‘sed in item selection. Items that differentiated the 
0 criterion grou; nally (ie, those with the 
ected, Among the equal t 


Tigest ¢ values) 5 
; with larger standardized dis- 
ficients were chosen for in- 
scale, Also, an overall multi- 
uted in order to ascertain 
t items taken together differentiated 
criterion groug The Likert feeling scale con- 
ted of 16 feclinz items that had high univariate 
$ values (#2 values) and appreciable discriminant 
Munction coefficients, which collectively differentiated 
two criterion groups. A similar procedure was 
allowed in the ction of Likert belief and 
Mention-to-act scales. Each scale had 16 items. 


onstruction of ihe Guttman Scales 
(Scalogram Analysis) 


Since all the items in the original item pool in 
the present study were already assigned scale values, 
1 Was assurance of item relevance with regard 
t the attitude in question. Thirty items (10 in each 
E were picked at random for the three Gutt- 
E. scales. These items were selected at the same 

E as those for the Likert scales from the item pool 
at remained after eliminating the items that 
E the Thurstone feeling, belief, and inten- 
35 Sm scales. These 30 items were given to a 
in ple of 72 respondents (36 in a pro group and 36 
han anti group) who indicated their agreement or 
Each ug on dichotomous response categories. 
Orph responses to the 10 items in each 
E "E were scored so that Category 2 always 
ire nted a pro-birth-control weighting. The scales 
TN produced in two stages. First, the 10-item 
"He Was subjected to scalogram analysis using a 
Ee analytic procedure (Dixon, 1964). The 
Eua of this analysis was to obtain (d) errors 
E scale and for each item and (b) frequency 
5 currence of "agree" and “disagree” responses to 
20 item, There are a number of ways to select a 
combination from the original 10 items. One 
ch is to select those 6 items that have the 


ne those 6 items that best represent six differ- 
7 pese of occurrence of agree or disagree, 
ese frequencies evenly distributed across the 
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entire spectrum. Such an approach reduces the possi- 
bility of obtaining a spuriously high coefficient of 
reproducibility. A third approach is a judicious com- 
bination of the first and second methods, which 
achieves the advantages of both methods. All three 
methods were utilized for each component scale. The 
combination that yielded the maximum coefficient 
of reproducibility with minimal marginal reproduci- 
bility was retained in the final scalogram analysis. 
The coefficients of reproducibility for all scales were 
fairly high (.97, .93, and .93 for feeling, belief, and 
intention-to-act scales, respectively), especially 
when compared to the low coeficients of minimal 
marginal reproducibility (.64, .61, and .65 for feel- 
ing, belief, and intention-to-act scales, respectively). 


Construction of the Guilford Selj-Rating Scale 


The scale developed here was a typical numerical 
scale with a sequence of nine defined alternatives. 
Respondents were asked to check an alternative 
that came closest to their own reaction. (Guilford, 
1954). To achieve greater equality of psychological 
intervals between categories, intervals were numbered 
1-9, such that 1 denoted the "extremely unfavor- 
able" end, and 9, the “extremely favorable" end. 
Hunt and Volkman (1937) have demonstrated that 
the addition of extreme terminal categories (eg. 
very much desirable, very much undesirable on the 
belief scale) serves to spread the entire distribution 
of ratings in the direction of that category. Greater 
dispersion is thus achieved. Bryan and Wilke (1941) 
have found that the ordinary rater likes to think 
of the good qualities first; therefore, in all scales 
used here, the favorable end was at the top. The 
feeling scale was labeled “I like birth control very 
much" at one end and “T dislike birth control very 
much" at the other end, with seven categories be- 
tween these two anchors. The belief scale contained 
three questions. The first one asked what the re- 
spondent's opinion was about birth control. The 


three suggested alternatives were (a) “Birth control 
has desirable qualities"; (b) “birth control has un- 
desirable qualities" ; and (c) “I don’t know.” The 
second question asked how desirable birth control 


was, and four alternatives were suggested: "very 
much,” “pretty much,” “somewhat,” and “a little.” 
Numerical scale values of 9, 8, 7, and 6 corresponded 
with these alternatives, respectively. The third ques- 
tion asked how undesirable birth control was, and 
four alternative answers were suggested: “very 
much” (undesirable), “pretty much,” “somewhat,” 
and “a little.” Numerical scale values of 1, 2, 3, and 
4 corresponded with these alternatives, respectively. 
The respondent answered Question 2 or Question 3, 
or none of them, depending on how she answered 
Question 1. A similar three-question scale format 
was adapted in the construction of the intention-to- 
act scale. Each one of the component scales was 
placed on a single page, along with instructions for 
jtem endorsement. The scales were given to a sample 
of 72 respondents (36 in a pro group and 36 in a 
con group) who indicated their self-ratings on feel- 
ing, belief, and intention-to-act components. The 
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TABLE 1 


TWELVE SCALES CONSTRUCTED AND USED IN THE PRESENT STUDY 


Technique Scale source Content of the scale 

ual-appearing intervals Thurstone & Chave Feeling 
Papag intervals Thurstone & Chave Belief 
Equal-appearing intervals Thurstone & Chave Intention to act 
Summated ratings Likert Feeling 
Summated ratings Likert Belief _ 
Summated ratings Likert Intention to act 
Scalogram analysis Guttman Feeling 
Scalogram analysis Guttman Belief | 
Scalogram analysis Guttman Intention to act 
Self-rating Guilford — Feeling 
Selí-rating Guilford — - .Belief 
Self-rating Guilford Intention to act 


mean responses for pro and con groups were com- 
pared in order to verify the concurrent validity of 
the scales, All three Guilford self-rating scales dif- 
ferentiated the two groups. 

By introducing new techniques in item selection 
(e.g, using pro and con groups in the Thurstone 
scaling, multiple criteria in the Likert scaling, and 
error and frequency criteria in the Guttman scaling), 
it was possible to develop highly reliable measuring 
instruments, even with pcople who had only cle- 
mentary education. 


Administration of the Twelve Scales 


The 12 scales constructed in the first phase of the 
study are shown in Table 1. No attitude statement 
appeared on more than one scale. This assured 
maximum independence among scales. The final in- 
strument consisted of two sections, The first section 
had a series of questions that obtained data on the 
respondent's age, number of children, income, 
whether she practiced birth control, etc. The second 
section contained the 12 scales. All attitude scales 
and biodata were combined into one booklet. Each 
of the attitude scales was placed on a separate sheet 
along with instructions for item endorsement. The 12 
scales were randomly ordered. No scale contained 
more than 10 items on a single page. 

The instructions for item endorsement were the 
same, regardless of the contents (feeling, belief, or 
intention to act) of the scale. Instructions in both 
the Thurstone and Guttman scales were as follows: 
“Below you will find a list of statements concerning 
birth control. Please indicate whether you agree or 
disagree by checking the appropriate response.” The 
Likert-scale instructions were: “A number of state- 
ments regarding birth control are presented below. 
Five possible reactions, strongly agree, agree, unde- 
cided, disagree, and strongly disagree, are listed un- 
der each statement. Check the alternative which 
comes closest to your personal reactions to the state- 
ment.” The Guilford-scale instructions were also 
similarly worded. Thus, all scales measured attitude 
toward birth control per se. Only the contents of the 
items constituting the scales differed; that is, the 
feeling scale contained feeling items, the belief scale 


contained belief items, and the intention-to-act scale 
contained intention-to-act items. The exper 
design called for differential contents of the 
The feeling, belief, and intention-to-act items 
similar to the ones shown above, Hence, it sho 
be reiterated that irrespective of the scale conten! 
the attitudinal topic was always birth control. 

On first meeting the respondent, after a friend!) 
greeting, the interviewer read aloud from the in- 
troductory sheet, which contained a brief description 
of the survey. It made clear that the respondent's 
participation was entirely voluntary and that her 
responses were purely for research purposes and 
would be kept confidential. Respondents were also 
assured that there would be no attempt to change 
their opinions or attitudes. They were informed that 
their cooperation would help the researchers (the 
Population Center and the Department of Psychol- 
ogy at the University of North Carolina) in under- 
standing how people thought, felt, and acted with 
regard to birth control. 

The interview lasted 1-14 hours. It consisted 
mainly of the reading aloud to the respondent the 
attitude statements and the multiple-choice answers 
and the recording of the responses by the inter- 
viewer (by checking the appropriate alternatives). 
The interview was always conducted in the respon- 
dent's home. The interviewer was cautioned against 
suggesting answers to the respondents by word, by 
gesture, or by facial or any other expression (Rosen- 
thal, 1966). Prior to standardizing the scales, some 
of the wordings of the statements were modified 
with the help of several interviewers so that they 
would be comprehensible to the entire respondent 
population. Since the interviewers came from the 
same ethnic group, communicative problems were 
nonexistent. After the interview, the respondent was 
thanked for her cooperation and was paid $1 as a 
token of appreciation. 

In this phase of the study, two samples of 50 in 
each (ie. 100 respondents in all) were used. The 
number of respondents who participated in the con- 
struction of scales and the final testing of the 
hypotheses (ie. the administration of the 12 scales) 
is shown in Table 2. No respondent who participated 
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Method and scale 


L in the first phase was approached in the second 
phase of the study. When all the attitude scales 
"were scored, cach respondent had obtained 12 scores 
— relating to her feeling, belief, and intention to act, 
measured by four methods—Thurstone, Likert, 
© Guttman, and Guilford. The scoring was done ac- 
C cording to the particular technique of scaling. 
— Differential scores on the feeling, belief, and inten- 
fion-to-act scales could be due to the content of 
^ the scales and/or due to the scales having differential 
"internal validity values. Hence, a check on the 
rnal validity of the feeling, belief, and intention- 
to Thurstone scales was made by examining the 
mumber oí items that met Thurstone's criterion of 
irrelevance, All items in every one of the scales met 
‘this criterion. The three scales were equivalent in 
their internal validity. Further, the operating char- 
acteristics of the items in the three scales showed a 
Similar pattern: The pro items showed an increasing 
-monotonic effect, while the con items showed a de- 
- creasing monotonic effect, irrespective of the scale. 
"These checks confirm that the three Thurstone scales 
ere equivalent in terms of internal validity and con- 
sistency. The differential scores on the three scales 
would, therefore, be accounted for by the nature of 
the contents of the individual scales rather than by 
- the scaling inequalities. 


Index oj Overt Behavior 


Obviously, direct observation of the behavior 
(contraceptive usage) is completely ruled out. Àn 
index of contraceptive behavior was obtained by the 
self-report of the respondent. However, a reasonable 
check was made by verifying the self-reports through 
reliable peer reports and information about receipt. 
of free birth control supplies. 


RESULTS 


EA 12x12 multitrait-multimethod matrix 
- "was computed (see Table 3) from scores on 
— the 12 verbal scales. The convergent and dis- 
‘criminant validities of feeling, belief, and in- 
- tention-to-act components were tested against 
the criteria established by Campbell and 
i Fiske (1959). To establish convergent valid- 


TABLE 2 


Nvxpxs or RESPONDENTS CONSTITUTING THE DIFFERENT SAMPLES IN THE Construction 
OF SCALES AND IN THE HYPOTHESES 


VOR es 
I. Construction of scales ‘Thurstone: feeling, belief, and intention 32 32 
Likert: feeling, belief, and intention 36 36 
Guttman; feeling, belief, and intention 36 36 
Guilford: feeling, belief, and intention 36 36 
II. Testing of the hypotheses Administration of scales 50 80 


ity, it has to be shown that the values in the 
validity diagonal are significantly different 
from zero. An examination of the 18 validity 
diagonal values shows that every one of them 
differed significantly from zero (p < 01). A 
separate analysis was done for each com- 
ponent. Each component was measured by 
six method pairs; this yielded six monotrait- 
heteromethod coefficients for that component. 
The mean correlations for feeling, belief, and 
intention-to-act components (computed by z 
transformation) were .55, .46, and .50, re- 
spectively. Each correlation was significantly 
different from zero (p < .01); this supports 
the convergent validity hypothesis that differ- 
ent methods did measure the same component. 
The second consideration is the discrimi- 
nant validity of the components: One re- 
quirement of discriminant validity is that a 
validity diagonal value (monotrait-hetero- 
method value) should be higher than the 
correlations obtained between that variable 
and any other variable having neither trait 
nor method in common (heterotrait-hetero- 
method values). Each of the 18 monotrait- 
heteromethod values was compared with the 
corresponding four heterotrait-heteromethod 
values. Compared to the .50 proportion ex- 
pected by chance alone, the observed propor- 
tion (i.e., the proportion of times the mono- 
trait-heteromethod value was greater than the 
corresponding four heterotrait-heteromethod 
values) of 1 was significant well beyond the 
.001 level. It may be concluded that reliably 
different evaluative responses were made to 
the three classes of attitude statements. 
A more crucial test of the discriminant 
validity of the tripartite distinction was made 
by employing two factor-analytic techniques. 
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TABLE 3 
MULTITRAIT-MULTIMETHOD MATRIX OF CORRELATIONS FOR THREE COMPONENTS (FEELING, 
BELIEF, AND Intention TO Act) Measukep BY Four METHODS (Tuuxstone, 
LIKERT, GUTTMAN, AND GUILFORD) 
Thurstone Likert Guttman Guilford 
Measure ————————— CST 
F | B 1 F | B | 1 F | B | 1 F | B 
Thurstone 
Feeling (89) 
Belief 26. (93) 
Intention to act | — 19. EJ (94) 
Likert 
Feeling (S8) 34 os | (95) 
Belief 2 (9 2 | 6 op 
Intention to act | —03 08 31. 65 (95) 
Guttman 
Feeling 06  -06 20 0 | (85) 
Belief 20 o |a @ a|s 6) 
Intention toact| 18  —05 os o (0,2 32 0) 
Guilford 
Feeling 2. 02/9) a 35 30 13 | (82) 
Belief 2. (9 23 @ sms 13 | 65 (83) 
Intention toact | — 17 12 15 34 14k 72 55 61 (87) 


Note.—The decimal points have been omitted. All correlations are uncorrected for attenuation. Monotrait-monomethod values 
areenclosed in parentheses ; heterotrait-monomethod values are underlined ; monotrait-heteromethod values are circled ; heterotrait- 


heteromethod values are unmarked. N = 100. 


TABLE 4 


FACTOR PATTERN MATRIX OBTAINED BY ROTATING 
PRINCIPAL AXES FACTORS COMPUTED FROM 
TABLE 3 TO ORTHOGONAL SIMPLE STRUC- 
TURE USING THE NORMALIZED VARIMAX 
METHOD (Kaiser, 1958) 


Factor 


Measure 
I | I | HI 
Thurstone 
I. Feeling —.64* .03 —42 
II. Belief —4i 405 —q1 
III. Intention to act 08 .64* —.26 
Likert 
I. Feeling —.76* .00 — 44s 
Il. Belief —31 28 —.77 
III. Intention to act —45 46 — —.30 
Guttman 
I. Feeling —,80^ .03 09 
II. Belief -Ah 22 —.21 
III. Intention to act —.22 67° 26 
Guilford 
I. Feeling —605.. 120. —.26 
II. Belief —.28 45> = 49 
III. Intention to act —.20 19e —43 
% total variance 21 21 16 


a High loadings of feeling measure. 
b High loadings of belief measure. 
* High loadings of intention-to-act measure. 


In both of the analyses, factoring was done 
using the 12 12 intercorrelation matrix 
(Table 3) as the input, and the initial com- 
munality estimates were squared multiple 
correlations. In the first analysis the factors 
were restricted to be orthogonal, and a Vari- 
max rotation (Kaiser, 1958) was performed. 
The three orthogonal factors extracted in this 
analysis accounted for approximately 5895 of 
the total variance. The three factors were 
feeling, intention to act, and belief. The first 
two factors were distinct trait factors for all 
methods, while the third trait factor (belief) 
was determined by belief measures in all 
methods except the Guttman method. The 
factor loadings and the variance accounted 
for by the factors are presented in Table 4. 
A second rotation was done in order to de- 
termine oblique simple loadings (Jennrich & 
Sampson, 1966) and also to assess the corre- 
lation among the oblique factors. The results 
of this analysis were very similar to those ob- 
tained in the orthogonal solution with a re- 
markably similar pattern of loadings. The 
factor correlation matrix and the simple 


3 re factor pattern loadings are presented 
in Tables 5 and 6. Since the intercorrelations 
among the factors were fairly low and the 
- factor loadings were very similar to those in 

the first analysis, the variances accounted for 
by the factors could reasonably be assumed to 
be close to those obtained in the first analysis. 
An examination of the factors in both analy- 
ses showed that the belief measures not only 
loaded on their own factor but showed low 
to moderate loadings on the other two factors. 
In a similar fashion, the Guttman intention- 
- to-act measure was moderately loaded on the 
belief factor. However, in spite of these minor 
deviations, the three factors (feeling, belief, 
“and intention to act) could be easily identi- 
- fied, and the variance accounted for by each 
factor was substantial (21%, 16%, and 21%, 
respectively). 

How sensitive is each measuring technique 
‘in differentiating the feeling, belief, and in- 
_ tention-to-act components? For a method to 
be sensitive, the requirement is that a com- 
‘ponent should correlate more highly with 
“itself when measured by two different meth- 


TABLE 5 


FACTOR PATTERN MATRIX OBTAINED BY ROTATING 
PRINCIPAL Axes Factors COMPUTED FROM 
TABLE 3 TO OBLIQUE SIMPLE STRUCTURE 
USING THE METHOD OF JENNRICH 

Anp Sampson (1966) 


PREDICTION OF CONTRACEPTIVE BEHAVIOR 


Factor 
Measure 
I | il | ul 
"Thurstone 
I. Feeling —.66* —.06 —.25 
TI. Belief —.07 .00 —. 10^ 
— IIL. Intention to act 49 67e | —48 
Likert 
I, Feeling Io ehh —.34 
II. Belief —.24 .22 —.69^ 
— T. Intention to act | —.06* 79°  —J8* 
Guttman 
I. Feeling —85* —.07 .20 
II. Belief — 46^ 16 —.11 
III. Intention to act -A7 .68* A07 
. Guilford 
l. Feeling —.64^ .21 —.13 
IL. Belief —.21 41b — —.39b 
TIT. Intention to act | —.11 79° .03 
% of total variance 
(approximate) 21 21 16 


* High loadings of feeling measure. 
» High loadings of belief measure. 
High loadings of intention-to-act measure. 
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TABLE 6 


FACTOR CORRELATION MATRIX OBTAINED 
BY RoraTING PrixciraL Axes FACTORS 
COMPUTED FROM TABLE 3 To OBLIQUE 

SIMPLE STRUCTURE USING THE 
METHOD OF JENNRICH AND 
Sampson (1966) 


TEZTUETOU 
Factor | I Hu LT 


I. Feeling | 100 
IL. Belief | —.27 1.00 
III. Intention to act | .22 —.23 1.00 
| 


ods than with another component measured 
by the same method. For a given component, 
this involves comparing its value in the valid- 
ity diagonals with its values in the hetero- 
trait-monomethod triangles. For each com- 
ponent measured by (a) the method pair with 
Thurstone as one of the methods and by (^) 
the method pair with Guttman as one of the 
methods, this requirement was met. The pro- 
portion of times the monotrait-heteromethod 
values exceeded the corresponding hetero- 
trait-monomethod values was .78 (p< 01) 
for the Thurstone and Guttman methods. For 
the other two methods, the Likert and the 
Guilford, the observed proportions (.33 and 
.11) did not reach the chance level of .50. We 
may conclude from these results that the 
Thurstone and the Guttman methods dif- 
ferentiate the three components better than 
the Likert and the Guilford methods. An ex- 
amination of the trait-pair correlations across 
methods revealed that feeling and inten- 
tion to act were least correlated (r — .39), 
while both belief and intention to act (r= 
.52) and feeling and belief (r = .54) were 
highly correlated. However, all the mean 
trait-pair correlations were significantly dif- 
ferent from zero (p < .01), which meant that 
all the components were significantly corre- 
lated. Thus, although the three components 
of attitude are correlated, it has been shown 
that they possess convergent and discriminant 
validities, as predicted by the first hypothesis. 

The amount of variance contributed by the 
unique determinants was assessed by an ap- 
proximation method, since all the entries in 
the correlation matrix were derived from the 
same sample. This was obtained by assessing 
the difference between variances accounted 
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TABLE 7 


POINT-BISERIAL CORRELATIONS BETWEEN EACH 
ATTITUDE COMPONENT AND THE 
U SE OF CONTRACEPTIVES 


Feeling and | Belief and keai to 


Technique | rA act and use 
Thurstone es] 66 .68 8i 
Likert .60 45 82 
Guttman 36 54 81 
Guilford A 66 79 
for by the monotrait-heteromethod mean 


value and the heterotrait-heteromethod mean 
value (Ostrom, 1969). The unique variance 
was found to be .21 (approximately), which 
was of a considerable magnitude, further sup- 
porting the first hypothesis. 

The second hypothesis predicted that verbal 
measures of intention to act would be better 
predictors of behavior than would either the 
feeling or the belief verbal measures. The 
behavioral variable was dichotomized into 
two categories: use and nonuse of contracep- 
tives. Accordingly, respondents were classified 
into two groups: users and nonusers. The two 
groups, with 50 in each group, provided the 
criterion groups for testing the second hy- 
pothesis. A very simple approach to the prob- 
lem of assessment of the relative importance 
of attitude components was to compare the 
point-biserial correlations (Guilford, 1954) 
between the dichotomous variable, use-non- 
use, among the feeling, belief, and intention- 
to-act components. An examination of these 
correlations (see Table 7) showed that the 
intention-to-act component, regardless of the 
measuring technique, was the highest among 
the correlations. This finding, prima facie, 
lent support to the second hypothesis. 

A more crucial test was performed to 
answer the question, How important is each 
one of these components when taken together 
with respect to the use of contraceptives? The 
relative weights of each component within a 
given method were assessed by computing 
the standardized discriminant function co- 
efficients for each method individually. To 
facilitate comparisons among these coeffi- 
cients, they are presented together in Table 
8. It is evident from the comparisons in 
Table 8 that whatever the method of mea- 
surement employed, intention to act was 
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TABLE 8 
STANDARDIZED DISCRIMINANT FUNCTION COEFFICIENT 
FOR FEELING, BELIEF, AND INTENTION COMPONENT 
MEASURED BY v Four DIFFERENT TrcuNiQU ES 
| Technique of measuremen 
| ae ee 
Thur: | Likert | Guttman | Guilford 


Feeling 
Belief 
Intention to act 


45 


weighted the most in discriminating the two 
groups, thus supporting the second IA 
sis. 

A stepwise multiple discriminant analysis 

was performed to assess the relative contribu- 
tion of feeling, belief, and intention-to-act 
measures in discriminating the users from 
nonusers. In a stepwise discriminant analysis, 
one variable is entered at each step into the 
set of discriminating variables. Each variable 
is selected by the following equivalent cri- 
teria: (a) the variable with the largest F 
value; (5) the variable that when partialed 
on the previously entered variables, has thc 
highest multiple correlation with the groups; 
(c) the variable that gives the greatest de- 
crease in the ratio of within to total gen- 
eralized variances. A variable is deleted if its 
F value becomes too low. 

The results of the stepwise multiple dis- | 
criminant analysis for the three variables 
(feeling, belief, and intention to act), mea- 
sured by four methods, are presented in Table 
9. Irrespective of the method, the first varia- 
ble entered in the stepwise discriminant anal- 
ysis was the intention-to-act measure. Though 
the initial F values were very high for all 
three measures under each method, after the 
stepwise discriminant analysis, the F values 
for the feeling and belief measures were re- 
duced to a considerable extent. The reduced 
F values for the feeling and belief measures l 
indicate that after entering the intention-to- M 
act measure, the feeling and belief measures 
did not contribute to discriminating the two | 
groups. This is shown by the error pattern i 
within each method. After the intention-to- | 
act measure was entered (i.e. after the first 
step), there was almost no reduction in the — 
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TABLE 9 
STEPWISE DISCRIMIN ANALYSIS ON THREE VARIABLES MEASURED BY Four TrcuNiQUES 
Technique ps Variable entered Initial Final te es A Ar e^ 
ifa! F value |  Pyslue Hor M Parr 
1 Intention to act 188.07 188.07 1 or”. 23 
Thurstone 2 Feeling 75.74 13.34 2 9 
3 Belief 85.08 | 1.90 3 10 
k 1 Intention to act 214.68 | 214.68 1 9 
Likert 2 Feeling 57.97 322 2 9 
3 | Belief 131.69 ETI | 3 9 
1 | Intention to act 187,42 187.42 | 1 | 10 
Guttman 2 Belief 41.02 1.57 2 10 
3 Feeling 15.32 29 | 3 10 
$ | a Intention to act 162.89 | 10289 lectio my al 
Guilford M eee Belief 77.59 pn] AAT 
3 Feeling 48.58 20 | 3 | 13 
| 


Note.—Analysis was for two groups, users versus nonusers (n = 50 for each group). 


number of errors due to misclassification. 
This finding fully supports the second hy- 
pothesis. The verbal responses of intention to 
act were better predictors of behavior than 
were either the feeling or the belief verbal 
measures. 


Discussion 


The hypothesized convergent and discrimi- 
nant validities of the tripartite classification 
of attitude into feeling, belief, and intention- 
to-act components were confirmed in this 
study. The results of the multitrait-multi- 
method analysis provided support for the 
hypothesis that evaluative responses within 
each component share a set of determinants 
distinct from those for other components. 
Similar results were obtained by Ostrom 
(1969), but the unique variance in his study 
was of a very low magnitude. Ostrom pro- 
posed that the low variance in his study could 
be due to the homogeneous nature of the 
attitude issue (church) and the relatively 
homogeneous population (students) that he 
employed in his study. In the present study, 
these limitations were overcome by using a 
controversial topic (contraceptive usage) with 
a heterogeneous population (users and non- 
users). Moreover, the behavior in the present 
study can be identified without amy diffi- 
culty: either a respondent uses contraceptives 
or does not use them. With samples drawn 
from opposite groups (users and nonusers), 
the possibility of intercomponent inconsis- 
tency is increased. This was reflected in the 


high unique variance obtained in the present 
study. . 

An ideal test of the second hypothesis 
would measure the actual contraceptive usage. 
Obviously, direct observation of actual be- 
havior is completely ruled out. Since the issue 
of birth control is rather involving, and the 
people are quite sure of what they practice, 
the self-reports may be considered reliable 
measures of contraceptive behavior. Also, a 
reasonable check was made by verifying the 
self-reports through reliable peer reports and 
information about receipt of free birth control 
supplies. The intention-to-act measure, irre- 
spective of the measuring method, was found 
to be a better predictor of behavior than a 
verbal feeling or a verbal belief measure. 

That the intention-to-act measure is the 
best predictor of behavior appears to explain 
why the earlier attitude studies failed to pre- 
dict behavior. Most of the attitude scales in 
the past employed cognitive (belief) items. 
Cartwright (1949), in discussing the sale of 
United States war bonds in World War I, 
pointed out that it is possible for a person to 
have a given cognitive and motivational struc- 
ture for a long period of time without the 
structure ever actually gaining control of 
behavior. In case of war bond sales, Cart- 
wright suggested that placing a person ina 
situation where he must decide whether or 
not he will buy a bond resulted in a commit- 
ment or decision to buy a bond. The simi- 
larity between the intention-to-act state- 
ments in the present study and the commit- 
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ment to action emphasized by Cartwright 
should be noted. When a person agrees with 
an intention-to-act statement, he, in effect, 
is committed to act in a specific manner. Such 
a commitment to action cannot be confidently 
inferred from a person's agreement with be- 
lief and feeling statements. This seems to 
explain the high predictive power of inten- 
tion-to-act measures, 

The above finding has implications for pre- 
diction as well as control of behavior. It was 
shown that the intention measure has the 
maximum discriminative power in classifying 
people into users and nonusers. A corollary 
of this is that by suitably manipulating the 
intention-to-act variable, with due emphasis 
on the situational variables, an effective 
change in behavior can be brought about. 
This has enormous relevance for educational 
programs and persuasive communications that 
normally stress the belief (cognitive) com- 
ponent. 

The results of this study have also shown 
that there are differences among the methods 
of measurement. The Thurstone method was 
very sensitive in differentiating the three 
components of attitude, while the Guilford 
was the least sensitive of all. These results 
suggest that the Thurstone method would be 
the most sensitive in differentiating the three 
components. The sensitivity of a method in 
discriminating the components of attitude 
thus seemed to be dependent on the sophisti- 
cation involved in scale construction. The 
Thurstone approach orders respondents with 
respect to the underlying variable. In addi- 
tion, however, a metric is fitted to the under- 
lying continuum, and both the attitude items 
and the respondents are located along this 
continuum. The set of scale values assigned 
to the items and to the respondents fulfills 
the requirements of an interval scale as de- 
fined by Stevens (1951). That is, there is a 
consistent unit of measurement throughout 
the scale. The zero point is arbitrary, how- 
ever. The Thurstone method, when applied 
to measure the three components, produces 
three almost independent scales, each scale 
with its own units of measurement derived 
from its item pool. This would explain the 
superiority of the Thurstone method in dis- 
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criminating the feeling, belief, and intention 
to-act components, 


The format of Guilford scaling, on the 


other hand, is uniform across all component 
In each component scale there are nine de 
fined alternatives. Respondents are asked t 
check the alternatives that come closest 
their own reactions on each scale. Since t 
scales are identical in format, the possibilit 
of checking the same alternative in all scale 
is heightened. Hence, it is not surprising thai 
the Guilford method was less sensitive i: 
discriminating the components. The Like: 
and Guttman methods are moderately "sensi 
tive in discriminating the feeling, belief, and 
intention-to-act components. 

One important contribution of this study i 
that it demonstrates the feasibility of a con 
ceptual separation of the conditioned stimu- 
lus (feeling) and discriminant stimulus (be- 
lief and intention to act) functions of attitude 
objects. Thus, providing evidence for the tri- 
partite division facilitates the transition from 
the conceptual language of attitudes to the 
stimulus-response language of behaviors 
(Greenwald, 1968). Performance of an act 
may be considered a function of learned in- 
tentions, beliefs, and feelings in combination 
with current stimulus conditions. The feeling 
component stimulates or inhibits performance. 
The belief component aids in building 
up stimulus-response-reinforcer relationships. 
The intention component functions as an 
organizer, coordinating the feeling and belief 
input with the behavioral output. This func- 
tional approach to tripartite classification also 
accounts for the superior predictive power of 
the intention component. 
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INFLUENCE OF FUTURE CHOICE IMPORTANCE AND 
AROUSAL UPON THE HALO EFFECT ' 


EDGAR O'NEAL °? 


Tulane University 


Two determinants of the magnitude of the influence of anticipated choice on 
the halo effect are investigated: importance of future choice and arousal. 
Male subjects ranked photographs on positive personality traits. Half of the 
photographs were of persons about which they anticipated making a choice. 
In confirmation of the hypotheses, (a) the intercorrelation of the rankings was 
greater for the choice photographs than for the nonchoice photographs when 
the subjects anticipated an important choice, but not when they anticipated 
an unimportant choice; and (b) the intercorrelation was greater for the choice 
photographs than for the nonchoice photographs when the subjects were ^ ~ 


when they were informed as to the source of their arousal or not aroused. 


aroused by caffeine and uninformed as to the source of their arousal, but not (^ 
= 


This study was designed to investigate two 
possible determinants of the halo effect *: the 
importance of future choice and arousal. The 
hypotheses were advanced within the context 
of choice certainty proposed by Mills (1968). 

According to choice certainty theory, be- 
fore a person takes an action, he wants to be 
certain that it is better than the alternative 
actions. The theory assumes that certainty 
that an action is best will be a function of 
the proportion of total cognitions favoring 
that action; the higher the proportion, the 
greater the certainty. An individual faced 
with a choice will attempt to increase his cer- 
tainty regarding which action is best and will 
avoid decreasing his certainty. This theory is 
similar to Jones and Gerard's (1967) concep- 
tualization of the unequivocal behavioral ori- 
entation. It differs from Janis’ (Mann, Janis, 
& Chaplin, 1969) conflict position, since it 


1 This study was conducted during the tenure of a 
National Institutes of Health Predoctoral Research 
Fellowship and was facilitated by a grant from the 
National Science Foundation to the University of 
Missouri, Judson Mills, principal investigator. 
Thanks are due to Judson Mills for advice and sug- 
gestions and to Russell Geen and Jefferson L. Sulzer 
for comments on an earlier version, 

2 Requests for reprints should be sent to the 
author, Department of Psychology, Newcomb Col- 
lege, Tulane University, New Orleans, Louisiana 
70118, 

3 The halo effect is defined by Jones and Gerard 
(1967) as "The tendency for trait ratings to show 
a higher intercorrelation than would be revealed by 
more objective measurement [p. 7131." 


does not invoke anticipation of postdecisions! 
regret as motivational construct. 

In a recent study, the influence of futur: 
choice upon the halo effect was investigated 
(O'Neal & Mills, 1969). It was reasoned that 
persons uncertain about a future choice re- 
garding other persons may attempt to in- 
crease their certainty by changing their cog- 
nitions about the other persons so that (a) 
more of them favor the preferred persons and 
(b) fewer of them favor the nonpreferred 
persons. The effect of this should be to in- 
crease the intercorrelation of the traits at- 
tributed to the other persons. In the O’Neal 
and Mills study, male college students ranked 
female photographs of two different sizes on 
a number of desirable traits after they had 
been told that half of the photographs of each 
size were of promiscuous girls and that in a 
later experiment they would choose among the 
photographs of one size (the choice set) 
which were the promiscuous girls. It was ex- 
pected that choice photo rankings would be 
more highly intercorrelated than the rankings 
of the other photographs because there would 
be a greater tendency for subjects to rank 
uniformly high those choice photos they favor 
and as uniformly low those choice photos 
they do not favor. As predicted, the intercor- 
relation of the ranking of the photographs in 
the choice set was higher than the intercor- 
relation of rankings of the other photographs. 

The first purpose of the present study was 
to determine the influence of the importance 
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re 


iticipated choice upon the halo effect. 

not explicitly stated in choice cer- 

y (Mills, 1968), it seems reason- 

if a person anticipates making a 

is uncertain, the more important 

the greater the desire to achieve 

i rhe consequences of an unfavorable 

ould be more severe in the case of 

aportant anticipated choice, and so 

to be certain about the choice 
greater. 

greater desire for certainty, an in- 

anticipating an important choice 

abou! which he is uncertain could be expected 

to change even more of his cognitions so that 


they port the favored alternative and do 
not support the nonfavored alternative. This 
shov'd produce a still greater tendency to- 
wat e halo effect for the individual antici- 
pating an important choice involving other 


persons. The first hypothesis of the present 
was as follows: The anticipation of 
:z important choices about other per- 
|l produce a greater tendency toward 
, effect than will the anticipation of 

unimportant choices about other 
persons. 

‘The second purpose of the study involved 
the influence of physiological arousal and 
anticipated choice upon the halo effect. To be 
sure, the concept of arousal is not an entirely 
clear one. Early theorists (e.g-, Malmo, 1958) 
tended to view arousal as a unitary concept, 
hence the ubiquitous term “general ” arousal. 
But some recent, critics (Lacey, 1967; Rout- 
tenberg, 1968; Taylor & Epstein, 1967) 
presented evidence that general arousal is pro- 
duced by the activity of a number of physio- 
logical subsystems. Despite these recent criti- 
cisms, it still seems to make sense to speak 
of arousal in a general sense, since the pro- 
posed subsystems are usually correlated 
(Lacey, 1967). For the present purpose, 
therefore, no attempt was made to distin- 
guish among possible subsystems producing 
arousal. 

Recent research in cognitive dissonance 
suggests that arousal might accompany lack 
of certainty regarding a choice. Although 
choice certainty theory differs from disso- 
nance theory, in that it deals with behavior 
prior to commitment, and dissonance theory 
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does not (Festinger, 1957, 1964), both make 
predictions of changes in cognitive organiza- 
tion. Gerard (1964) argued that cognitive 
dissonance affects arousal. Buckhout (1966) 
recorded heart rate data while subjects were 
verbally reinforced for choosing to read aloud 
statements opposite to their initial attitude. 
Buckhout reported that subjects whose atti- 
tude change was in the direction of the rein- 
forced position showed a higher degree of 
initial heart rate increase and a greater sub- 
sequent decrease than did subjects whose at- 
titude change was not in the reinforced direc- 
tion. 

In a study more directly related to arousal 
prior to choice, Berlyne (1961, Experiment 
II) read words to subjects and asked the sub- 
jects to respond with the first word that came 
to mind. The stimulus words had previously 
been rated by Laffal (1955) for their response 
uncertainty (entropy, H) value: those that 
evoked a wide variety of free association re- 
sponses were rated high in response uncer- 
tainty, and those which produced a narrower 
range of responses were rated low in response 
uncertainty. Berlyne found that subjects’ gal- 
vanic skin response amplitude was greater fol- 
lowing presentation of words high in response 
uncertainty value than words low in response 
uncertainty value. 

Mann et al. (1969) had subjects choose 
between two unpleasant physiological stimu- 
lations and reported that prior to announc- 
ing their choice, the subjects’ heart rates in- 
creased. However, the heart rate increase 
could have been due simply to anticipation 
of the unpleasant stimuli. Despite this reser- 
vation regarding the results of this latter 
study, it does not seem unreasonable to posit 
that an individual who is uncertain about a 
prospective choice will be aroused; and more- 
over, that the less certainty the individual 
has about his anticipated choice, the more 
his arousal. 

If it is true that lack of certainty about an 
anticipated choice is accompanied by arousal, 
then when an individual is aroused, under 
certain conditions he may interpret his arousal 
as that accompanying lack of certainty. Pro- 
viding that the arousal occurs in a situation 
similar to those in the past in which the indi- 
vidual experienced lack of certainty, he may 
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use his level of arousal as an indication of the 
degree of his lack of certainty. If the indi- 
vidual were aroused for another reason—say, 
from a cup of coffee—then an anticipated 
choice provides cues that he may use to at- 
tribute his arousal to that ordinarily accom- 
panying lack of certainty. However, this 
should not occur if the individual were aware 
that a portion of his arousal was due to an- 
other source, because he could “discount” a 
portion of his arousal as being not due to 
lack of certainty (cf. Nisbett & Schachter, 
1966). 

It would seem to follow from this line of 
reasoning that if he is not aware of the source 
of his arousal, the more aroused the indi- 
vidual who is uncertain about an anticipated 
choice, the greater will be his desire to in- 
crease his certainty. With a greater desire to 
increase certainty, he might therefore be ex- 
pected to change more of his cognitions so 
that they support the favored alternative and 
do not support the nonfavored alternative, 
than if he were either less aroused or able to 
discount a portion of the arousal. This should 
result in a greater intercorrelation of the 
traits that he attributes to persons involved 
in his anticipated choice. The second hypothe- 
sis, therefore, was as follows: The anticipa- 
tion of making choices about other persons 
will produce a greater tendency toward the 
halo effect when the individual is more highly 
aroused and unaware that a portion of his 
arousal is from a source other than lack of 
certainty. 


METHOD 
Overview 


Under the guise of a “space-flight drug study,” 
male subjects were told that they would later have 
to choose from among eight photographs the photo- 
graphs of four men they would most like to work 
with in a group task. Then they ranked these photo- 
graphs (choice photos) and eight other (control) 
photos on a number of positive attributes. 

Ten subjects were randomly assigned to each of 
six experimental conditions: aroused-misinformed, 
high importance; aroused-misinformed, low impor- 
tance; aroused-informed, high importance; aroused- 
informed, low importance; not aroused, high im- 
portance; and not aroused, low importance. In the 
high-importance conditions, the subjects were told 
that their choice would definitely determine who 
their partners would be in the group task; and, in 
the low-importance conditions, the subjects were 
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told that there was only a small chance that it 
would determine who their partners would be. In 
the aroused-misinformed conditions, the subject: 
were given caffeine and told that they had received 
a vitamin; in the aroused-informed conditions, they 
were given caffeine and told that it was a stimulant ; 
and, in the not-aroused conditions, they were given 
a placebo and were told that it was a vitamin. For 
half of the subjects in each condition, the choi 
photos were one set of eight photographs; and for 
half of the subjects in each condition, the choice 
photos were the other set of eight photographs 


Procedure 


Male undergraduates in introductory psychology 
were screened by a telephone interview to exclude 
as subjects those students with contraindications for 
caffeine. In the telephone interview, the experi- 
menter introduced himself as a researcher conducting 
a space-flight drug study, who had been given the 
student’s name and telephone number by a member 
of the psychology department. The student was 
asked if he was willing to participate in both of two 
50-minute sessions and if he had any objections to 
taking drugs. Two students who were contacted re- 
fused to participate because the study involved the 
use of drugs. The remainder were asked a number of 
questions, some intended to disguise the true nature 
of the experiment, and others intended to identify 
students for whom taking caffeine would be detri- 
mental to their health. Twelve prospective subjects 
were excluded because they indicated that they had 
suffered from either heart, kidney, or stomach ail- 
ments, Appointments were made with the others, and 
they were directed to the experimental room, in the 
University of Missouri Student Health Center.* 

At the experimental room, the experimenter intro- 
duced himself and stated that the purpose of the 
experiment was concerned with the effect of certain 
drugs upon the group performance of standard 
space-flight tasks: during the first session some base- 
line recordings would be made of the subject’s 
physiological reactions to a drug, and at the second 
session the subject would be given the same drug 
and would work on some space-flight problems with 
four other undergraduates. 

The subject’s pulse was taken at the wrist for a 
full-minute interval by the male experimenter. Then 
the subject was asked to read a description of the 
effects of four drugs identified by the letters A, B, 
C, and D and to sign a statement indicating his 
consent to take one dose of one of them. Drug A 
was described as a vitamin that might result in 
slightly increased appetite and thirst. Drug B was 
described as a stimulant that might result in slightly 
increased heart rate. The drug-description sheet in- 


4 Appreciation must be expressed to William R. 
Galeota, Director of Student Health Services and 
Professor of Psychology at the University of Mis- 
souri, who supervised the administration of caffeine 
e provided research space in the Student Health 

enter. 


duded a statement that experimental considera- 

"tions might not permit informing the subject as to 
"which one of the four drugs he would be given. 

The subject was seated at a table on which were 


“Drug A,” “Drug B,” etc, corre- 
he letter designations of the drugs on 
t sheet the subject had read. Each of 
ined either caffeine in size #00 cap- 
igar placebo capsules of the same 
ne capsules contained 200 milligrams 
amount in 13-2 cups of coffee) ob- 


"tained by crushing two tablets of NoDoz (Grove 

Laboratories Division of Bristol-Myers), plus lac- 
tose. NoDoz is available without prescription, and 
the dosage of 200 milligrams is a therapeutic amount 
recommended on the box for adults. 


Before giving the subject the capsule, the experi- 
menter mentioned that the drug received would be 
the stimulant/vitamin and asked if the subject had 
read the effects of the stimulant/vitamin on the 
sheet. Der upon the condition to which the 
subject had been assigned, the experimenter took 
out a bottle labeled either “stimulant” or “vitamin” 
and gave the subject either the caffeine or placebo 

T capsule. Just before giving the subject the capsule, 

the experimenter said again that the drug he had 

deen assigned to take was the stimulant/vitamin. 

— Then the subject was told that it would take 30 
minutes for the drug to take effect and that the 
experimenter would return in a few minutes. At this 

point the experimenter left the room for 12 minutes. 
— Upon returning, the experimenter introduced the 
M anticipated choice by stating that a second purpose 
of the study was to investigate the effect of crew 
congeniality upon the success of space flights. It 
was explained to the subject that one way to maxi- 
mize the chance of congeniality might be to allow 
the captain of a mission to select his crew members. 
The experimenter went on to say that in order to 
simulate such a selection process, he was using 
photographs of all participants in the study and 
that he would take a Polaroid snapshot of the sub- 
ject before the conclusion of the day’s session. 
Each subject was told that later he would choose 
the photographs of the four men with whom he 
would most like to work in the second session. The 
experimenter told each subject in the high-importance 
condition that he had been randomly designated 
captain and that his choices would therefore deter- 
mine the makeup of his space-fight group in the 
second session. It was explained to each subject in 
the low-imporlance condition that there was only a 
small chance—about 1 in 10—of his randomly being 
designated captain and therefore little likelihood of 
his choice actually determining the makeup of his 
space-flight group. 
. The experimenter provided the subject with a 
rationale for the ranking task by saying that there 
was need for a check on the influence of first-im- 
pression factors in the space-flight group's success, 
and that the check would involve the subject's rank- 
ing photos of the other participants—those cur- 
rently participating and those who participated last 
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semester—according to a number of personality 
traits. It was explained to the subject that he would 
first rank the photographs and then choose the 
photographs of the four men with whom he would 
most like to work. 

Then the experimenter arranged the 16 photos of 
male students at another university in a uniform 
order in front of the subject. The 8 control photos 
each had a small slip of paper attached marked 
"assigned." After the photographs were displayed 
for the subject, the experimenter said: 


You'll notice that some of the photos are tagged, 
and some are not. The tagged ones—let's see, 
there are eight of them—are of men who par- 
ticipated in the study last semester. The photos 
without tags are of current participants, and it 
is from among these that you will later select 
four as being of men you would like to work 
with next time. 


As a check on the effects of the arousal manipu- 
lation, the pulse rate was taken for a second time. 
At this point approximately 30 minutes had elapsed 
since administration of the capsule, The experi- 
menter’s excuse for taking pulse rate again was that 
he had neglected to write down the pulse rate 
earlier and that he had forgotten what it was. 

The subject then ranked the 16 photographs on 
each of eight traits: considerate, energetic, fearless, 
patient, resourceful, responsible, sincere, and studious. 
At the completion of the rankings, the experimenter 
mentioned to the subject that there was something 
about the experiment that he had not told him and 
asked the subject what he thought it could be. None 
of the subjects’ responses indicated that they had 
been aware of the true purpose of the experiment. 
Finally, the experimenter revealed the true nature of 
the experiment to the subject, including the name, 
dosage, and effects of the drug he had taken, and 
asked him not to discuss the experiment with anyone. 


RESULTS 
Effectiveness of the Arousal Manipulation 


Before considering the effects of the experi- 
mental conditions upon the dependent varia- 
ble, it is appropriate to present the results of 
a check on the manipulation of arousal^ The 
mean change in pulse rate in beats per minute 
for subjects in each condition, from prior to 
administration of the capsule to approxi- 
mately 20 minutes after administering the 


5 There were no significant differences by analysis 
of variance among the conditions in initial pulse 
rate, The mean initial pulse rates, in beats per 
minute, were as follows: high importance-misin- 
formed, 76.9; high importance-informed, 77.45 high 
importance-not aroused, 78.1; low importance-mis- 
informed, 77.3; low importance-informed, 78.3; low 
importance-not aroused, 77.5. 
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TABLE 1 
MEAN Purse RATE CHANGE (Ge Brats per MINUTE) 


| Arousal 
|| 
Importance |^ m Pie ET L its i 
d- Aroused- N 

Eerad |! infor ted | aroused | Total 
High 6.2, 6.9, | —2.9 34 
Low 6.7. 3.5, -32. 2.3 

"Total 6.5 | 5.2 | -34 


Note.—^» = 10, Cells not having a subscript in common are 
significantly different at the .05 level by the Duncan multiple- 
range test. 


capsule, is presented in Table 1. Positive val- 
ues in the table represent an increase in mean 
pulse, and negative values, a decrease. 

It may be noted that there was an increase 
in mean pulse rate for the subjects in the 
four aroused conditions and a decrease for the 
subjects in the two not-aroused conditions. 
This latter decrease may have been due to 
the fact that the pulse rate was elevated ini- 
tially as a result of climbing the two flights 
of stairs to the experimental room, and subse- 
quently depressed by a few minutes of rest. 

An analysis of variance of the mean change 
in pulse rate yielded a highly significant ef- 
fect (F = 21.58, df — 2/54, p< .001) for 
the arousal factor. There were no differences 
in mean pulse rate change significant at the 
.05 level by the Duncan multiple-range test 
among individual cells in the aroused condi- 
tions or between the two cells in the not- 
aroused conditions. However, each cell in the 
aroused conditions differed significantly (p < 
.05) from each cell in the not-aroused condi- 


Epcar O'NEAL 


tions. It would appear from these data that 
caffeine had the effect of increasing ph 
logical arousal. Moreover, the data indica 
that this occurred regardless of whether the 
subjects were informed or misinformed abo 
the source of their caffeine-induced arousa) 


Effects of the Importance of Anticipated 
Choice 


Each subject’s ranking of the photograph 
on each trait was converted into two revised 
rankings, one for the choice photos and one 
for the control photos. These revised rankings 
are based on the order, within its set, of the 
subject’s ranking of each photograph. This 
conversion yielded a 1-8 ranking for the 
choice photos and a 1-8 ranking for the con- 
trol photos on each of the traits. Coefficients 
of concordance, Kendall’s W, were computed 
for the revised rankings of the choice photos 
(choice W) and also for the revised rankings 
of the control photos (control W). Kendall’s 
W bears a linear relationship to the average 
Spearman rank-order correlation between all 
possible pairs of the rankings. The means of 
the coefficients of concordance are presented 
in Table 2. 

According to the first hypothesis, the an- 
ticipation of making important choices about 
other persons will produce a greater tendency 
toward the halo effect than will the anticipa- 
tion of making unimportant choices about 
other persons. It was expected from the first 
hypothesis that the choice W should exceed 
the control W by a larger margin in the high- 


TABLE 2 
MEANS OF THE COEFFICIENTS OF CONCORDANCE (W) 
Arousal 
Importance 
Choice-no choice ipee EPSEN Not aroused Total 
Choice E .48 42 48 
High Control 27 ES 36 35 
Choice minus control .26 .07 .06 13 
Choice .50 ER 33 .42 
Low Control 33 A3 AL 39 
Choice minus control 16 .00 —.07 03 
Choice .52 EN 38 45 
Total Control .30 42 38 .37 
Choice minus control 21 .03 —.01 .08 


Note,—n = 10. 


importance conditions than in the low-im- 
portance conditions. The choice W minus 


control W differences are presented in Table 


L2. It will be noted that this difference is 
greater for those subjects in the high-impor- 
tance conditions (.13) than for those in the 
low-importance conditions (.03). 

A repeated-measures analysis of variance 
(Table 3) yielded significance for the Im- 
portance X Choice-No Choice interaction (F 
= 7.61, df = 1/54, p<.01). According to 
prediction, the choice W minus control W dif- 
ferences are significantly greater in the high- 
importance conditions than in the low-im- 
portance conditions. The results of this test 
lend strong support to the first hypothesis. 
Subjects who anticipated an important choice 
regarding other persons demonstrated a 
greater tendency toward the halo effect in the 
traits attributed to these other persons than 
did subjects who anticipated an unimportant 
choice regarding the other persons. 


Effects of Arousal 


The second hypothesis stated that the an- 
ticipation of ng choices about other per- 
sons would produce a greater tendency toward 
the halo effect when the individual is more 
highly aroused and unaware that a portion of 
his arousal is from a source other than lack 
of certainty, From this second hypothesis it 
would be expected that the choice W minus 
control W difference should be greater in 
the aroused-misinformed conditions than 
in either the aroused-informed or not- 
aroused conditions. A potential obstacle to 
the interpretation of the choice W minus 
control W differences would exist if there 
were differences among the conditions in 
control W. Indeed, the control W appears (see 
Table 2) to be lowest in the aroused-misin- 
formed conditions. However, an analysis of 
variance of the control Ws revealed that the 
effects of the arousal factor upon control Ws 
is not significant (F = 2.55, df = 2/54). 
Therefore, there appears to be no compelling 
reason why the choice W minus control W 
differences could not provide evidence for the 
second hypothesis. 

The arousal factor produced significant 
effects upon the choice W minus control W 
differences, these effects being evident in the 
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TABLE 3 


SuwwARY OF REPEATED-MEASURES. ANALYSIS OF 
VARIANCE OF MEANS OF COEFFICIENTS 
or Concoxpance (W) 


| ar ieee [ee 


Source 
Between | 59 
Importance (A) aE 9.64 09 
Arousal (B) 2 86.48 | .79 
AXB 2| 928 08 
Error 54 | 109.31 
Within 60 
Choice-no choice (C) | 1 524.82 18.22* 
AXC 1| 219.01 1.61* 
BXC | 2| 364.34 12.65* 
AXBXC | 2 6.84 24 
Error 54 28.79 
*p <0 


significant Arousal x Choice-No Choice in- 
teraction (F = 12.65, df = 1/54, p < 01) in 
the repeated-measures analysis of variance 
(Table 3). Comparisons were made of the 
difference among the three levels of the 
arousal factor in the choice W minus control 
W difference. The difference between the 
aroused-misinformed conditions and the 
aroused-informed conditions was significant 
beyond the .01 level (t= 3.47, df = 38). 
Also, the difference between the aroused-mis- 
informed and unaroused levels reached the 
same significance (¢ = 3.62, df = 38, p< 
01). No significance was obtained for the 
difference between the aroused-informed and 
the not-aroused levels of the arousal factor. 
In confirmation of the second hypothesis, 
arousal increased the influence of choice on 
the halo effect when the person aroused was 
unaware of the source of his artificially in- 
duced arousal. 


DISCUSSION 


While the results strongly support the two 
hypotheses, one other aspect of the data 
deserves consideration. Although not explic- 
itly hypothesized in the present study, it 
would be expected that the rankings of the 
photographs involved in the future choice 
would be more highly intercorrelated than the 
rankings of the other photos, that is, that 
the choice W would exceed the control W for 
the subjects in all conditions. 'This outcome 
would seem to be in harmony with those re- 
sults reported by O'Neal and Mills (1969). 
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The significant main effect obtained for 
the choice-no choice factor in the analysis of 
variance (Table 3) does not permit an un- 
qualified conclusion, however, because of the 
interactions between choice-no choice and 
arousal, and choice—no choice and importance. 
The choice W was significantly greater than 
the control W only in the two aroused-mis- 
informed conditions and in the aroused-in- 
formed, high-importance condition. The evi- 
dence indicates that anticipated choice in- 
creases the halo effect only when the choice is 
important and the individual is aroused. 

This result is not necessarily in conflict 
with the O’Neal and Mills (1969) results. 
They reported a higher intercorrelation in the 
rankings of choice photos by male subjects 
who anticipated choosing which photos were 
of “promiscuous girls.” It can plausibly be 
argued that the situation confronted by the 
subjects in the O’Neal and Mills study re- 
sembles most closely that faced by the sub- 
jects in the aroused, high-importance condi- 
tions in the present experiment. Choices re- 
garding photographs of promiscuous girls 
could well have produced arousal. Also, 
choices regarding the promiscuity of females 
would seem to be at least as important for 
male subjects as a choice of preferred part- 
ners in a group task. The results of the 
present study replicate the study by O’Neal 
and Mills if it is assumed that the choice in 
the earlier study was important and that the 
subjects were somewhat aroused. 

The results, together with the foregoing ex- 
planation of the situation faced by subjects in 
the O’Neal and Mills (1969) study, make 
possible tentative conclusions about two as- 
sumptions implicit in choice certainty. It 
would seem that the predicted effects of lack 
of certainty regarding an anticipated choice 
occur only if the choice is a fairly important 
one, While it is not possible to say precisely 
how important an anticipated choice must be 
before the effects of lack of certainty are 
produced, it seems clear that lack of cer- 
tainty regarding a trivial choice (perhaps such 
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as the one faced by the subjects in the 
importance condition) does not produce th: 
Finally, it would appear that arousal i 
important component of the motivatic: 
increase certainty regarding an anticipa 
choice. 
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OBSERVER PERSEVERANCE AS RELATED TO A 


MODEL'S 


SUCCESS: 


A SOCIAL COMPARISON ANALYSIS * 


SEYMOUR M. BERGER °? 


University of Massachusetts 


Lewis and Duncan demonstrated that an observer's perseverance on à task 


could be influenced by a model's success, 
directly rewarded. Subsequent studies, however, 
of the observer was not necessary. Berger and 


but only when the observer was 
indicated that direct reward 
Johansson suggested that this 


inconsistency could be resolved in terms of social comparison theory. Lewis 


and Duncan used the exprimenter as 


a model, while subsequent studies used 


a peer. It was argued that a peer provided a relevant basis for comparison, 
while the experimenter did not. To test this analysis, observers were exposed 


io either a peer model or 
success or 75% success schedules. As 


an experimenter's assistant 


model under either 25% 
predicted, differential effects in observer 


perseverance were found with the peer model but not with the assistant 


model, although demand characteristics in the 
however, supported the social comparison 


to these effects. Additional research, 


situation might have contributed 


hypothesis, The model's schedules had a differential effect on observer per- 


severance when the model and observer held similar opinions, 


ential effect did not occur when they 


It is well established that subjects who are 
partially reinforced (e.g., rewarded on 25% 
of the trials) during acquisition subsequently 
show greater resistance to extinction than 
subjects who are continually reinforced 
(100% reward) on all trials during acquisi- 
tion. Partial reinforcement phenomena are of 
considerable importance to social psychology, 
particularly to social learning, because it is 
believed that partial reinforcement is more 
typical of actual social learning situations 
than continuous reinforcement (e.g, Bandura 
& Walters, 1963). While most of the research 
in partial reinforcement has focused on indi- 
vidual learning, there are some studies that 
have developed an analog involving an obser- 


1 This research was supported by National Science 
Foundation, Research Grants GS 1702 and GS 2746. 
The author wishes to express his appreciation to 
Susan Palmer and Mary Blehar for their assistance 
as experimenter and model, respectively, in Experi- 
ment I, and to Adrienne Cozzolino and Betty Zalis 
for their assistance as experimenter and model, re- 
spectively, in Experiment II. In addition, the author 
is grateful to Ivan Steiner for helpful comments 
regarding an earlier draft of this study. Experiment 
I was completed while the author was a faculty 
member at Indiana University. 

2 Requests for reprints should be sent to the 
author, Department of Psychology, University of 
Massachusetts, Amherst, Massachusetts 01002. 


but this differ- 
held opposite opinions. 


vational learning situation. The observational 
learning analog involves two people. One per- 
son (model) participates in the acquisition 
phase, while the second person (observer) 
participates in the extinction phase. The ob- 
server watches the model during acquisition, 
but the observer does not participate in the 
acquisition phase directly. 

The term analog is used advisedly here to 
emphasize that the studies of partial rein- 
forcement effects through observation de- 
scribed below involve experimental procedures 
that are not identical with the partial rein- 
forcement paradigm that is typically em- 
ployed in individual learning situations— 
especially the operant conditioning situation. 
In the individual learning situation, there is 
usually no break between the acquisition and 
extinction phases of the paradigm, but in the 
observational learning analog a break occurs 
after acquisition, when the observer takes the 
place of the model. In the individual learning 
paradigm, the measure of extinction usually 
involves a gradual decrement in response 
strength over trials, but in the examples of 
the observational learning analog to be pre- 
sented, the measure of extinction in two of 
the studies (Berger & Johansson, 1968; Lewis 
& Duncan, 1958) was the number of trials 
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until the subject stopped the experiment (i.e., 
a sudden decrement); and in a third study 
(Rosenbaum & Bruning, 1966), extinction 
was not measured at all, only frustration ef- 
fects. Finally, individual learning studies of 
partial reinforcement effects require the sub- 
ject to make the same specific response during 
acquisition and extinction, but in one of the 
studies. (Berger & Johansson, 1968), the 
model and observers used the same response 
categories but did not perform the same 
specific acts; furthermore, the model and 
observers responded to somewhat different 
stimuli, 

Although the studies reported here have 
clear foundations in the partial reinforcement 
literature, the theoretical implications of the 
variations that have been introduced in the 
observational learning analog are not clear. 
Consequently, it may be misleading to imply 
that the results obtained with the observa- 
tional learning analog are strictly comparable 
to those obtained with the individual partial 
reinforcement paradigm. The observational 
learning analog may be a useful descriptive 
framework, but it may lead to unwar- 
ranted theoretical implications drawn from 
traditional partial reinforcement literature. 

In order to minimize possible confusion, a 
different descriptive framework is employed 
in this study. The term success is used instead 
of reward or reinforcement, because it has 
fewer theoretical overtones and yet ade- 
quately describes the outcome of a trial on 
which the subject is correct. When resistance 
to extinction refers to the number of trials 
until the subject voluntarily stops working 
on the task, the term perseverance is sub- 
stituted because it is descriptively appropriate 
and theoretically neutral. 

For purposes of consistency in this study, 
these terms are substituted when suitable for 
the terminology originally used by the authors 
of the studies to be reviewed. 

The initial study of the influence of a 
model's success on an observer's perseverance 
was carried out by Lewis and Duncan (1958). 
They were not able to produce, as they pre- 
dicted, greater perseverance in observers who 
merely watched a model who was successful 
2595 of the time, as opposed to observers 
who merely watched a model who was success- 
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ful 100% of the time. They did find, how 
ever, that observers who watched a : 
who was 25% successful, amd received 
each time the model received a 10¢ reward 
during the acquisition phase, showed greater 
perseverance than observers who watched 
model who was 100% successful and rece: 

a 10¢ reward each time the model received 
10¢ reward during acquisition. Subsequent 
Rosenbaum and Bruning (1966) reported 
that children who were merely exposed 
models who were 100% successful subs 
quently showed more frustration under cond 
tions of extinction (ie. no success) th 
children who were merely exposed to a mod: 
on a 50% success schedule. In other word 
these authors reported that they were ab! 
to produce differential frustration effects ; 
observers without rewarding them during ti 
acquisition phase. More recently, Berger an 
Johansson (1968) demonstrated that ob- 
servers merely exposed to a model who wa: 
25% successful during acquisition showe: 
greater perseverance than observers expose: 
to a model who was 100% successful. Thus 
Rosenbaum and Bruning and Berger and 
Johansson provide evidence that mere ob- 
servation of a model on a partial succes: 
schedule can influence an observer’s subse- 
quent behavior, while Lewis and Duncan 
found it necessary to reward the observer 
directly. 

While all three studies had certain common 
features, they also differed in many details. 
Lewis and Duncan (1958) used a modified 
slot machine task, Rosenbaum and Bruning 
(1966) used a lever-pulling task, and Berger 
and Johansson (1968) used an extrasensory 
perception (ESP) card-guessing task; Rosen- 
baum and Bruning used children as models 
and observers, while the other two studies 
used adults; Lewis and Duncan used the 
experimenter as a model, while the other 
studies used a peer as a model. 

Tn their analysis of the effects of differential 
model success on observer perseverance, 
Berger and Johansson suggested that the 
agreements and discrepancies among these 
studies could be understood in terms of 
Festinger's (1955) social comparison theory. 
The aspect of their analysis that is relevant 
to this research is concerned with the type of 
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node! used in the three studies. Berger and 
wgued that observers judge their 
mance in these relatively ambigu- 
mental situations by comparing 
erformance with the model’s per- 
; a standard. According to Fes- 
1955) social comparison theory, 
The dency to compare oneself with some 
other 


ic person decreases as the differ- 
ence en his opinion or ability and 
one’s increases [p. 150].” Berger and 
Joha suggested that the experimenter, 
who | nably is familiar with the experi- 
mental situation and task, is a less relevant 
basis comparison than a peer. Conse- 
quently, Rosenbaum and Bruning and Berger 
and Johansson were able to demonstrate ef- 
fects of differential model success on observer 
behav without direct reward to the ob- 
serve cause they used a peer as a model, 
while wis and Duncan did not obtain such 
effects because the experimenter was an 
irrelevant model. 
Experiment I was undertaken to see if this 


interpretation could be supported. An ESP 
card-zvessing task, similar to the one used by 
Berger and Johansson (1968), was employed 
in this study. For half of the observers in 
this study, the model was described as a 
peer; for the other half of the observers, the 
model was described as an assistant of the 
experimenter.* Within each of these model 
conditions, half of the observers watched the 
model on a 25% success schedule, while the 
other half watched the model on a 75% 
success schedule." Subsequently, all observers 


3 Lewis and Duncan (1958) also used verbal de- 
scriptions of a model’s performance and did not find 
mere observational effects. It is difficult to assess 
whether these descriptions provided a relevant basis 
for social comparison. 

t Because of the nature of the task in this study, 
it was not feasible to use the experimenter as the 
model. It was felt that the assistant of the experi- 
menter would closely approximate the relevant model 
characteristics, that is, someone who is closely asso- 
ciated with the experiment and who presumably is 
familiar with the task. 

5Tn this study, a 75% success schedule was em- 
ployed in place of the 100% success schedule used 
previously. This change seemed desirable in order to 
make the model's performance appear credible. The 
theoretical considerations that are applicable to the 
100% schedule apply as well to the 75% schedule. 
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participated in the experiment under condi- 
tions where they were never successful. The 
hypothesis was that observers exposed to a 
peer model under 25% success would show 
greater perseverance than observers exposed 
to a peer model under 75% success. Further- 
more, no difference in perseverance would 
be produced by the two success schedules 
when the model was the experimenter's 
assistant. 


EXPERIMENT I 
Method 


The 64 male and female subjects in this experiment 
were recruited from the introductory psychology 
classes at Indiana University, Their participation in 
the study partially fulfilled a course requirement, 

Subjects were run individually and were assigned 
to one of the four experimental treatments in the 
order of their appearance for the experiment. Each 
condition contained 16 subjects. When a subject 
assigned to one of the peer-model conditions ar- 
rived, the model was introduced as “another sub- 
ject.” When the subject was assigned to one of the 
"assistant-model" conditions, the model was intro- 
duced as the experimenter's assistant who would 
help demonstrate the task. The same person played 
both model roles and, of course, was à confederate 
of the experimenter. 

The observer and the model were seated opposite 
each other at a table. The table had a rectangular 
area cut out in front of each participant which was 
covered by a piece of transparent celluloid. The 
celluloid had a triangular-shaped target inscribed on 
it, with the apex of the triangle pointing toward the 
participant. A line from the apex to the base bisected 
the triangle. The area of the triangle on the par- 
ticipant's left was outlined in red, while the area 
on the right was outlined in black. The observer 
could see both targets before being seated, but a 
low opaque partition blocked the observer's view of 
the model's target when he was seated. A television 
camera was mounted below the target area on the 
observer's side. No camera was mounted below the 
target area on the model’s side, but this was not 
apparent to the observer even before he was seated, 
A small plastic pendulum suspended from a chain 
was placed in front of each participant.® 

The experimenter read the instructions, which 
stated that the experiment was a study of ESP that 
employed a new experimental technique. It was ex- 
plained that previous studies required subjects to 
verbalize their thoughts. It was suggested that the 
subjects might receive thought messages that they 
were unable to verbalize and that as a consequence, 
some earlier studies were unable to demonstrate ESP. 
In this study, the subject would hold a pendulum 
that has been shown to be a more sensitive measure 


9'These pendulums are used in Kreskin’s ESP 
game, available from the Milton Bradley Company. 
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TABLE 1 


MEAN AND MEDIAN NUMBER oF TRIALS TO QUITTING 
AS A FUNCTION or MODEL AND Success 
Conprrions: EXPERIMENT I 


M | Mdn 
l 
Model | Success schedule | Success schedule 
| 25% | gs | 25% | 75% 
Peer 40.31 24.00 27.00 16.00 
Assistant 63.25 63.06 39.00 40.50 


of one's thoughts. The experimenter had a deck of 
cards in front of her. The deck was not an ordinary 
deck of cards, since it did not contain an equal 
number of cards from red and black suits. The 
subject would hold the pendulum over the apex of 
the target in front of him. The experimenter would 
then turn up a card and think of the color of the 
suit she saw on the card, The subject was to think 
of a color, red or black, while holding the pendulum. 
The television camera would record the movements 
of the pendulum to determine whether it moved 
toward the red or black area. After a few seconds 
(actually standardized as 10 seconds), the experi- 
menter would ask the subject to tell her what color 
the subject had in mind. The subject was allowed 
to rest his arm between trials. Before beginning, the 
observer and model were asked to practice a bit. 
(This was done to convince the subject that the 
pendulum would move by itself, or at least so it 
would appear. In almost all cases, the pendulum did 
move at least a little while the subject thought of 
one of the colors during this practice period.) 

If the model was introduced as an assistant, the 
experimenter explained that the assistant would 
demonstrate the procedure. If the model was intro- 
duced as a peer, the experimenter explained that the 
“other subject" would go first. Except for the dif- 
ferences in the introduction of the model and these 
instructions, all other aspects of the procedure 
were the same for observers within each of the 
reinforcement conditions. 

Before the model's eight trials began, the experi- 
menter handed the observer a deck of cards and 
asked him to shuffle the deck. (This deck was not 
used, however.) 

Two schedules of success were used. If the model 
was under the 25% success schedule, she gave correct 
judgments on Trials 2 and 8. If the model was under 
the 75% success schedule, she gave correct judgments 
on all trials except 2 and 8. After each guess, the 
experimenter held up the card that she was looking at 
so that it was in full view of the model and observer. 
In this way the model and observer could determine 
whether or not the model was successful. (Actually, 


7 This instruction was given so that the subjects 
would not be able to determine the chance level of 
being correct. 
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the experimenter manipulated the model's success by 
drawing a card from one of the two piles of car 
in front of her—one pile containing red suit card 
the other, black suit cards. The experimenter 
behind a shield so that the observer could not 
the manipulation.) 

After the model's trials were over, the model w 
excused. The observer was told that he would n 
participate in the same experiment, except that th 
was no set of number of trials in his case. He cou! 
continue in the experiment until he wished to stop 
When he wanted to stop, he simply had to say so 
to the experimenter. 

In this phase of the experiment, the experimenter 
drew cards from the card piles in front of her so 
that the observer was never correct. The number oí 
trials until the observer stated that he wanted t 
stop was used as a measure of perseverance. 

The experiment wascexplained to each observer, 
and he was asked not to discus the experimen: 
with anyone. 


Results 


Because of extreme heterogeneity in vari- 
ance between conditions and highly skewed 
distributions of scores, the Mann-Whitney U 
test was used for the statistical analysis of 
the data. The data are summarized through- 
out this study in terms of means and medians 
since the median may be more representative 
of the data than the mean, given the skewed 
distributions. 

Table 1 presents the mean and median 
number of trials to quitting for each condition. 

The observers who were exposed to a peer 
model who received 25% success showed 
greater perseverance than observers who wit- 
nessed the peer model receive 75% success. 
The Mann-Whitney U was 69 (N — 16/16, 
p<.05). No reliable difference was found 
in the observers’ perseverance between the 
two success schedules with the assistant model 
(U — 107, N — 16/16). 

An unanticipated difference appeared be- 
tween the two model conditions. Observers 
in the assistant-model conditions partici- 
pated longer than observers in the peer-model 
conditions (z = 3.11, p< .01). 


Discussion 


In accordance with the hypothesis, no dif- 
ferential effects in observer perseverance were 
found when the assistant model was used, but 
such effects were found when the peer model 
was used. Although the social comparison 
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analy :s account for the results of the 
exper another interpretation of the re- 
sults yt been eliminated. It is possible 
that were responding to subtle 
dem: aracteristics in the experimental 
situa rhaps observers felt some obliga- 
tion t the experimenter obtain evidence 
for ; the situation involving a peer 
mod :e 75% success schedule, observers 
mas ‘elt less obligated to keep trying 


(sin xperimenter had one good subject 


alrea han observers who were exposed to 
the essful (25%) peer model. Conse- 
quen ; the peer-model condition, a pos- 
sible » why observers participated longer 
aíter hing the 25% successful model, as 
oppo to the 7546 successful model, was 
that were trying to help the experimenter 
obtai xme positive evidence of ESP. In 
the tant-model condition, however, the 
mode performance did not provide real 
data, the observers may have felt equally 
obligaic& in both the 25% and 75% success 
conditicns to keep trying to provide positive 
evidence for ESP. Thus, the observers in the 
assistan!-model conditions did not show dif- 
ferentis] perseverance. In addition, the fact 
that observers in the assistant-model 
condi! generally participated longer than 
observers in the peer-model conditions may 


be the result of another sort of demand 
characteristic in the situation. When the peer 
model was used, the observers may not only 
have developed an expectancy for task suc- 
cess, but also may have developed an expec- 
tancy regarding how much time to put in for 
one credit toward fulfilling the course re- 
quirement, since the “other subject” got one 
credit for participating in just eight trials. 
The assistant model did not provide this kind 
of information, since his participation was 
not for credit, Thus, observers in the 
assistant-model conditions may have felt a 
greater obligation to participate for the full 
45 minutes for which the experiment was 
scheduled. 

In order to provide a better test of the 
social comparison hypothesis, à second ex- 
periment was undertaken which eliminated 
some of the possible demand characteristics 
that were present in Experiment E 
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EXPERIMENT II 


In this study, the manipulation of the con- 
ditions for social comparison was accom- 
plished by varying the similarity of the model 
and the observers opinion regarding the 
existence of ESP. Social comparison theory 
states that the tendency to compare oneself 
with another person tends to decrease as the 
difference between his opinion and one's own 
increases (Festinger, 1955). Therefore, if the 
model expresses an opinion regarding ESP 
that differs from that of the observer, the 
observer should be less likely to compare his 
own performance with that of the model than 
the observer who holds an opinion that is 
similar to the model's. 

It was hypothesized that observers exposed 
to a model expressing an opinion regarding 
the existence of ESP that was similar to 
their own would be more likely to show dif- 
ferences in perseverance as à function of 
the degree of the model’s success than the 
observers who were exposed to a model 
expressing an opposite opinion. 


Preliminary Study 


A study was undertaken in the spring of 
1970 to test this hypothesis. Unfortunately, 
because of campus disruptions and the stu- 
dent strike that occurred during that semester, 
the number of subjects obtained was less 
than planned. Nevertheless, the results of 
this investigation are reported because they 
contribute some relevant information. 


Method 


The 48 male and female subjects in this experi- 
ment were recruited from the introductory psychol- 
ogy course at the University of Massachusetts, 
Amherst. The subjects were paid $1.25 for their 
participation. Theit participation did not fulfill a 
course requirement, 

As in the previous study, the subjects were run 
individually and were assigned to the model success 
conditions in advance of their appearance for the 
experiment. The model in this study was a con- 
federate of the experimenter and played the role of 
a peer student who had signed up for the experiment. 
When the model and the observer appeared for the 
experiment, the model was seated at a table opposite 
the experimenter and behind a screen. The observer 
was seated behind and to the right of the experi- 
menter. Before starting, the experimenter asked the 
model and observer (in that order) whether or not 
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TABLE 2 


MEAN AND MEDIAN NUMBER OF TRIALS TO QUITTING 
AS A Function or MODEL AND Success 
CONDITIONS: PRELIMINARY STUDY 


M Mdn 
Model Success schedule | Success schedule 
25% 15% 25% 15% 
Similar opinion 86.40 | 25.92 | 34.00 | 20.00 


Different opinion 39.80 | 45.50 | 22.50 | 23.50 


they believed in ESP. For half of the subjects, the 
model reported that she believed in ESP, and for 
the rest she reported that she did not believe in ESP. 
The experimenter recorded their responses. 

Then the experimenter read the instructions that 
were similar to the ones used in Experiment I. In 
this study, the pendulum and target were not used. 
The instructions simply stated that this was a study 
of ESP. The subject's task was to guess which one 
of the four suits of cards the experimenter had 
picked up from the deck in front of her. The deck 
of cards was described as a special deck that did 
not contain an equal number of cards from each 
suit. The observer was asked to watch and was told 
that his turn would come after the other person 
had finished. The experimenter handed the deck of 
cards to the observer and asked him to shuffle the 
deck, Then the experimenter picked up the top card 
and held it up so that the observer could see it. 
The model could not see the card because of the 
screen that separated her from the experimenter, 
When the experimenter said, “now,” the model 
called out a suit. 

The model could see the experimenter's feet under 
the table. By a prearranged code involving different 
foot positions, the experimenter could signal the 
model as to which suit she was holding. The model 
was given eight trials. 

Two schedules of success were used: 25% and 
75%. If the model was to be correct on 25% of the 
trials, the experimenter signaled the correct suit only 
on Trials 2 and 8; if the model was to be correct 
75% of the time, the experimenter signaled the 
correct suit on all trials except 2 and 8. After each 
guess, the experimenter held up the card that she had 
in her hand so that the model could see it over 
the screen, After the eighth trial, the model was 
excused, and the observer was asked to take her 
position. The observer was told that he would 
participate in the same experiment, except that in 
his case there was no set number of trials. He 
could participate for as long as he wished. When 
he wished to stop, he need only say so to the experi- 
menter. The experimenter handed the deck of cards 
to the observer and asked him to shuffle the deck. 
While the observer was shuffling the deck behind 
the screen, the experimenter removed another deck 
of cards from her drawer that was prestacked, so 
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that she could always have access to a card from a 
suit that was different from the one called out by 
the observer. After receiving the reshuffled deck from 
the observer, the experiment proceeded. In this p 
of the experiment, the observer was never cory 
After the observer's guess on each trial, the exper 
menter held up a card of a different suit from the 
one he had just guessed. The experimenter recorde 
the number of trials until the observer asked her t 
stop. The experimenter did not know the hypothe: 
of the experiment. 

After the last subject was run, the observers wer 
assigned to groups for data analysis. If, at the 
beginning of the experiment, the observer had ex 
pressed an opinion regarding ESP that was the 1c 
as the models, he was considered similar; if his 
opinion was opposite, he was considered differen 
Combined with the two conditions of model reward, 
this arrangement produced four groups: 25% suc 
cess, similar opinion; 75% success, similar opinio 
25% success, different opinion; 75% success, diffe: 
ent opinion. When the experiment was terminated 
there were 15 subjects in the first group, 13 subject 
in the second group, and 10 subjects each in th 
third and fourth groups. 


Results 


The mean and median number of trials to 
quitting for each group are reported i: 
Table 2. Observers exposed to a similary 
opinion model under 25% success showed 
greater perseverance than observers exposed 
to a similar opinion model under 75% success, 
but the Mann-Whitney U did not quite reach 
the .05 level (two-tailed) of significance 
(U = 59.5, N = 15/13, p < 10). The dif- 
ference between the 25% and 75% model 
success conditions with the different opinion 
model was small and did not approach sta- 
tistical significance (U = 210, N = 10/10). 

No difference was found in the observers’ 
perseverance when all observers exposed to 
the similar opinion model were compared with 
all those exposed to the different opinion 
model (z = .04). There were no reliable dif- 
ferences between observers in the two model- 
opinion groups within the 75% success condi- 
tion (U = 50, N = 13/10) or within the 25% 
success condition (U = 63, N = 15/10). 


D 


Complete Study 


As a follow-up to the preliminary study, a 
complete study was run during the summer 
of 1970. This study was essentially the same 
as the preliminary one except for two 
changes: (a) Instead of using a live model, 


a 


à 


OBSERVER PERSEVERANCE TO A Mopet’s Success 


the model was presented on videotape. (5) In 


this stt the similarity manipulation was 
improved having the subject state his 
opinion before the model’s opinion was re- 
vealed. This procedure provided better con- 
trol over the similarity manipulation and 
eliminated the possibility that factors influ- 
encing one’s readihess to agree or disagree 
with another’s opinion would be confounded 
with the similarity manipulation. 


-two students were recruited from the 
psychology course at the University of 
tis, Amherst. Their participation in the 
ially fulfilled a course requirement. 
iructions to the subject were generally the 
; the preliminary study. The subject was 
» would be shown how a previous subject 
ormed on the ESP task before he started. 
venter then asked the subject whether or 
ved in ESP, After the subject responded, 
»enter told him that the subject he would 
videotape had expressed either the same or 


see on 


opposite opinion, depending upon the prearranged 
schedule. Subjects were assigned to one of the four 
conditic (ie. 25% success, similar opinion; 7596 
success, similar opinion; 25% success, different opin- 
ion; 75% success, different opinion) in the order in 
which they appeared for the experiment. There were 
18 subje n each group. 

The subject watched the videotaped performance 
of a confederate model on a 22-inch television 
monitor, The view that the subject had of the model, 
the screen, and the experimenter was from directly 
behind the model, a view that was about the same 
as he had from the chair in which he was seated. 


After the usual eight trials in which the model was 
either 25% or 78% correct, the experimenter gave 
the same instructions regarding the unlimited number 
of trials for the observer and his option to stop 
the experiment whenever he wished. 


Results 


Table 3 contains the means and medians 
for the four conditions of this experiment. 
Again, observers exposed to the similar 
opinion model under 25% success showed 
greater perseverance than observers exposed 
to the similar opinion model under 757% suc- 
cess conditions (U — 91, N = 18/18, P< 
.05). No reliable difference was found be- 
tween observers in the 25% and 75% success 
conditions when a different opinion model 
was used (U — 133, N — 18/18). 

Observers exposed to a similar opinion 
model did not differ from those exposed to 
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TABLE 3 


MEAN AND MEDIAN NuMBER OF TRIALS TO QUITTING 
AS A FUNCTION OF MODEL AND SUCCESS 
CONDITIONS: EXPERIMENT IT 


M Mdn 
Model Success schedule | Success schedule 
25% 15% 25% 15% 
Similar opinion 73.12 | 38.33 | 44.00 | 26.50 
Different opinion 50.78 | 34.00 | 36.50 | 35.00 


a different opinion model with regard to per- 
severance (z = .80). Within the 75% success 
condition, observers in the two model-opinion 
groups did not differ reliably in perseverance 
(U = 161.5, N = 18/18). The same compari- 
son for observers within the 25% success 
condition also produced no reliable differences 
(U = 126, N = 18/18). 


DISCUSSION 


Before offering an interpretation of these 
findings, it is necessary to examine some 
confounding that occurred in the design of 
Experiment II. Seventy-seven percent of the 
observers in the preliminary study and 7596 
of the observers in the completed study re- 
ported that they believed in ESP. Since the 
percentage of observers who reported that 
they believed in ESP was almost exactly the 
same in both of the success conditions and in 
both of the model-opinion conditions, there 
was no confounding of the observer's opinion 
with the success or model-opinion manipula- 
tions. However, the fact that a high per- 
centage of observers believed in ESP led to 
a confounding of the similarity manipulation 
and the models opinion. In the similar 
opinion conditions, most of the observers were 
told that the model believed in ESP because 
most of the observers stated that they be- 
lieved in ESP; in the different opinion condi- 
tion, most observers were told that the model 
did not believe in ESP because most of the 
observers said they believed in ESP. There- 
fore, it is possible that the differences reported 
in perseverance effects for the two opinion 
models were not related to similarity of 
opinion between the model and observer but 
to the model's opinion per se. 
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In order to evaluate this alternate interpre- 
tation, additional comparisons were made. The 
data from the preliminary and completed 
studies were combined to obtain an adequate 
sample size for these analyses. The effect of 
the model’s opinion per se was examined by 
comparing the perseverance of observers ex- 
posed to a model who apparently believed in 
ESP with the same measure for observers 
exposed to a model who apparently did not 
believe in ESP. Within the 75% success con- 
dition, the difference was small and not sta- 
tistically reliable (z = .85, Mann-Whitney U 
test). The same comparison within the 25% 
success condition revealed no reliable differ- 
ence (z= 1.44). Therefore, the model’s 
opinion does not appear to be related to the 
observer’s perseverance within either of the 
success conditions. As a further check on the 
effect of the model’s opinion, comparisons 
were made between the two success conditions 
when the model’s opinion was held constant. 
There was a trend for observers in the 25% 
success condition to show greater perseverance 
than observers in the 75% success condition 
when the model believed in ESP, as well as 
when the model did not believe in ESP 
(z = 1.85, p = .06; z = 1.87, p = .06, respec- 
tively). Again, the model’s opinion per se did 
not seem to be a significant factor. However, 
when the effect of similarity of opinion be- 
tween the model and the observer was exam- 
ined with these combined data, the differential 
effects of degree of model success that were 
reported in Experiment II reappeared. The 
differential effect on observer perseverance 
was found when a similar opinion model was 
used (z = 2.76, p = .006), and no such effect 
was in evidence when a different opinion 
model was used (2 = .53). 

With this alternate interpretation elimi- 
nated, the results reported for Experiment IT 
provide clear support for the social compari- 
son hypothesis. The differential effect on ob- 
server perseverance was found when a similar 
opinion model was used, but not with the 
different opinion model. The fact that the 
completed study replicated the findings of the 
preliminary study provides substantial evi- 
dence of the reliability of these relationships. 

The results of Experiment I cannot be 
interpreted unequivocally as support for the 
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social comparison analysis because of the 
confounding with demand characteristic 
However, the fact that the viability of the 
social comparison analysis is clearly deny 
strated in Experiment II is taken as an i: 
cation that this analysis is a plausible inte 
pretation for the differential effects of t 
type of model on observer perseverance 
Experiment I. 

These studies did not examine possi! 
interpretations for the differential effects 
the model's success conditions on observ 
perseverance. The predictions regarding th: 
effects of a model’s partial success on observ: 
perseverance are generalizations from the e: 
pirical findings in the partial reinforcemen 
literature. There are a number of ways i 
which these predictions could be explained in 
an observational learning setting. 

Although Lewis and Duncan (1958) and 
Rosenbaum and Bruning (1966) offered some 
what different interpretations of their finding: 
that took into account the particular depen- 
dent variables in their studies, their interpre 
tations were derived from the expectancy- 
frustration hypothesis, which has been used 
to account for partial reinforcement effect 
in individual learning situations. According 
to this hypothesis (see Hilgard & Bower, 
1966, pp. 488—490), the greater the anticipa 
tion or expectation of reward that is built up 
during acquisition, the greater the frustration 
that is produced by nonreward during extinc- 
tion. The effect of frustration is to evoke 
avoidance responses that interfere with the 
instrumental response and eventually produce 
extinction of the instrumental response. Thus, 
subjects who were given large percentages of 
reward during acquisition were more frus- 
trated during extinction than subjects who 
received smaller percentages of reward during 
acquisition, Furthermore, smaller percentages 
of reward during acquisition supposedly in- 
crease the subject’s tolerance for frustration 
during extinction because the subject has 
learned (during acquisition) that frustration 
is followed by reward. Consequently, sub- 
jects who received smaller percentages of 
reward during acquisition should extinguish 
more slowly than subjects who received larger 
percentages of reward. 

It is generally assumed that anticipation 
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of rewa r frustration can be conditioned 
to stir that precede the occurrence of 
rewat | frustration. These stimuli may be 
associ vith the instrumental behavior in- 
volve he experimental task as well as 
with stimuli in the situation, including 
elemen f the task itself. 

If t inalysis is applied to the observa- 
tiona rning analog, it is assumed that the 
obser’ quires expectations for reward or 
succe ;sed upon the degree of the model's 
succes the task. When the observer subse- 
quent! wiüicipates in the task without 
succes he degree of frustration produced 
and his tolerance for frustration are related 
to the expectations he developed as a result 
of watching the model. However, the results 
of the present study indicate that this analy- 
sis may only apply when the observer is 
exposed to an appropriate model for social 


comparison (ie. either a peer model or a 
similar opinion model). Presumably, those ob- 
servers who watched a model who was rele- 
vant ior social comparison in the 75% success 
condition developed higher expectancies for 
success and a lower tolerance for frustration 
than observers exposed to a relevant model 
in the 25% success condition; these differen- 
tial expectancies apparently did not develop 
in those observers who watched an irrelevant 
model for social comparison under these 
success schedules. 

The discrimination hypothesis for partial 
reinforcement effects does not seem applicable 
to the studies described here. This hypothesis 
would explain the greater perseverance of ob- 
servers exposed to a 25% successful model as 
compared to those exposed to a 75% success- 
ful model as due to the fact that the ob- 
server’s performance is more similar to the 
model's performance under 25% success than 
under 759% success. In other words, it took 
the observer longer to discriminate that the 
situation had changed when the model was 
25% successful than when the model was 
7595 successful. However, the conditions for 
such discriminations were the same when the 
similar opinion and different opinion models 
were used, yet the differential effect of a 
model's success was only evident with the 
similar opinion model. Unless one wishes to 
argue that the observer was paying attention 


to the performance of the similar opinion 
model and not attending af all to the per- 
formance of the different opinion model 
(which seems unlikely), it seems as though 
the discrimination hypothesis cannot account 
for the results reported here. 

An alternative to the expectancy-frustra- 
tion analysis is the differential-attraction hy- 
pothesis proposed by Festinger (1961). This 
hypothesis states that goals become more at- 
tractive when they are harder to obtain or 
when they require more effort to achieve. 
In the present context, this hypothesis would 
predict that observers who watched a model 
on a 25% success schedule would view success 
as harder to obtain than observers who were 
exposed to a model on a 75% success schedule. 
Therefore, the observers would find the ex- 
perimental task to be more attractive after 
exposure to the 25% success model than after 
exposure to the 7576 success model, and, as a 
consequence, they would work longer to 
achieve success. However, this differential 
attractiveness of the task for the observer as 
a function of model success could depend 
upon the social comparison process. 1f observ- 
ers do not consider the model's performance 
as an appropriate standard for comparison, 
the task may not develop differential attrac- 
tiveness for observers, depending upon the 
conditions of model success. Consequently, 
this would explain why the observers in the 
different opinion-model groups showed no 
reliable differences in perseverance, while the 
observers in the similar opinion-model groups 
did show such differences. Obviously, further 
research is needed to identify which explana- 
tion for the differential effects of model suc- 
cess on observer perseverance is tenable. 

Given the results of these studies, it ap- 
pears that the discrepancy between the Lewis 
and Duncan (1958) results and those of the 
subsequent studies mentioned above is due to 
the type of model employed in these studies. 
More important, however, is the general con- 
clusion that can be drawn from the studies 
reported in this paper. These studies provide 
evidence that the effect of a model’s success 
on an observer’s perseverance is dependent in 
part on a social comparison process inyolving 
the model as a relevant standard for social 


comparison. 
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ence proxemic behavior. 


culation of spatial orientation be- 
ractants has been postulated by 
3) to be the direct result of cul- 
ing. People of different cultures not 
ak different languages, he stated, but, 
more importantly, inhabit different sensory 
worlds. Thus, during ordinary daily conversa- 
tion, Arabs may stand rather close and di- 
rectly with one another and be highly in- 


volved on a sensory level; an American or 
Englishman would blanch at such intimacy. 
Likewise, subcultures within the larger cul- 


ture are thought to differ in such “proxemic” 
relationships, the effect being to consolidate 
each minority group and to isolate it from 
Others, making intergroup communication 
more difficult (Hall, 1966, pp. 2, 149). The 
principle aim of the present study was to 
examine whether such subcultural variation 
is characteristic of lower-class blacks and 
Puerto Ricans and middle-class whites, as 
well as to test Hall’s assumption that dif- 
ferences in patterns of structuring space are 
basic to culture and subculture, by studying 
spacial orientation behavior among young 
school children. 

Are proxemic relationships a potential 
source of misunderstanding in face-to-face 


1 This study is based on a thesis submitted by the 
first author to Queens College of the City Univer- 
sity of New York in partial fulfillment of the re- 
quirements for the Master of Arts degree. 

D Requests for reprints should be sent to John R. 
Aiello, who is now at the Department of Psychol- 
ogy, Michigan State University, East Lansing, Michi- 
gan 48823. 


D STUDY OF THE PROXEMIC BEHAVIOR OF YOUNG 
)OL CHILDREN IN THREE SUBCULTURAL GROUPS’ 


JOHN R. AIELLO? anv STANLEY E. JONES 
Queens College oj the City University oj New York 


The proxemic relationships of interacting pairs of first- and second-grade 
children from three subcultural groups were observed in school playgrounds. 
Interaction distance and directness of shoulder orientation (axis) were re- 
corded. Middle-class white children stood farther apart than lower-class black 
and Puerto Rican children. Sex differences among white children in distance 
scores and culture and sex differences in axis scores were also found. The results 
suggest that proxemic patterns are acquired early in life and. support the 
contention that differences between the dominant culture and other groups in 
the use of space are basic, with the qualification that sex roles may also influ- 


cross-cultural exchanges? An examination of 
the literature shows that empirical evidence 
on cultural differences is still in question. On 
the one hand, Watson and Graves (1966) 
found differences in spatial orientation (dis- 
tance, shoulder orientation, and eye contact) 
between the Arab and American cultures and 
among geographical regions within these cul- 
tures, and Willis (1966) found some evidence 
of race differences in interaction distance 
within the American culture.* On the other 
hand, Forston and Larson (1968) found no 
significant differences between Latin Ameri- 
cans and Americans,‘ and Jones (1971) found 
striking cultural homogeneity of distance 
scores among four lower-class subcultures in 
New Vork City. 

In light of the inconclusive findings of 
previous research, the present study was de- 
signed not only to provide further data but 
also to incorporate certain features not com- 
bined in previous studies that seemed critical 
to testing Hall's ideas about cultural differ- 
entiation. First, it seemed necessary that the 
cultural groups selected for comparison be 
ones that are supposed to differ from one an- 
other in proxemic behavior, judging from the 


8 The differences that Willis (1966) found between 
blacks and whites (13 inches) were not very great 
in magnitude, however, and were significant at only 
the .09 level. 

4 This could have been, however, the result of the 
nature of the task, a political discussion, since Latin 
Americans have been reported to sit farther apart 
when discussing a serious topic than when engaging 
in social interaction. 
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observations of experts familiar with them. 
Second, it seemed appropriate that children 
rather than adults be studied. Hall maintained 
that the differences between minority groups 
and the dominant American culture are basic 
and are intrinsically related to the core values 
of a culture, one such value being the use and 
structuring of space (Hall, 1966, p. 165). If 
the proxemic behavior of a culture is basic, 
then it should be acquired at an early age. 
Finally, when the authors considered a point 
made by Birdwhistell (1959), that nonverbal 
behaviors are highly sensitive to situational 
factors, it seemed fitting that observations be 
made in an unobtrusive manner in field set- 
tings that were comparable across cultural 
groups. 

The main hypotheses in this study are 
based on the assertion advanced by Hall 
(1966), that lower-income blacks and Puerto 
Ricans are “more highly involved” than white 
middle-class Americans and therefore use a 
closer interaction distance (cf. Hall, 1966, pp. 
172-173). The authors found support for 
Hall’s observation in their conversations with 
teachers in black and Puerto Rican schools 
who pointed out that in lower-class black 
homes, touching was far more common than 
talking, a condition we would expect to cor- 
relate with close interaction distance." Draw- 
ing on this background, the authors made the 
following predictions: 

Hypothesis I: White children will stand 
farther apart than will black children. 

Hypothesis II: White children will stand 
farther apart than will Puerto Rican children. 

The cultural relevance of body orientation 
(axis) has received much less attention from 
Hall and other investigators of proxemic be- 
havior. Although it has been hypothesized 
that blacks are less direct than members of 
other cultural groups (Jones, 1971), Hall 


5'The predictions of the present study seem to 
contradict the finding of Willis (1966), that blacks 
stand farther apart than whites. However, his study 
was conducted with college students, and although 
it is not clear from his report, it would appear that 
the generalizability of his conclusions is limited to 
middle-class blacks. In addition, the validity of 
Willis’ study must be questioned since it was con- 
ducted in a variety of unspecified field settings in 
such a manner that situational factors were not 
controlled. 
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made no specific predictions about differences © 
in axis behavior among the groups observed 


in the present study, and there is no previous 
data confirming such differences. Also, while 
there is some evidence that women are more 


direct in shoulder orientation than men 
(Jones, 1971), it was not known whether such 
sex differences would appear among young 
school children. It seemed appropriate there- 
fore to limit predictions concerning axis data 
in the present study to the following non- 
directional, exploratory hypotheses: 

Hypothesis III: There will be a difference 
between black and white children in axis ori- 
entation. 

Hypothesis IV: There will be a difference 
between black and Puerto Rican children in 
axis orientation. F 

Hypothesis V: There will be a difference 
between males and females in axis orientation. 


METHOD 
Subjects 


A total of 210 dyads were employed in the study. 
Equal numbers of male-male and female-female 
dyads were selected from black, Puerto Rican, and 
white subcultures, such that there were 35 samples 
taken from each sex-culture group. The educational 
levels of the children were the first and second 
grades; the age range was 6-8 years old. The obser- 
vations of black and Puerto Rican children were 
made in a lower-socioeconomic-class elementary 
school in the South Bronx area of New York City. 
The observations of the white children were taken 
at a middle-class elementary school just outside of 
New York City in Nassau County. 


Procedure 


The scales employed were adapted from the 
proxemic notational system devised by Hall (1963). 
Distance was measured in inches. As Jones (1971) 
has suggested, observations were recorded as though 
the interactants were of equal height and their noses 
were pointed straight forward, parallel with their 
shoulders. Axis was defined in terms of the degree 
to which the interactants’ shoulders were turned 
toward or away from one another on an 8-point 
compass-face scale. That is, judgments were scored 
on a scale running from 0 to 8, corresponding to 
the points on a compass. For example, a "zero" 
position represented a direct face-to-face relation- 
ship. A “two” position represented a shoulder orien- 
tation forming a right angle, as though one person 
were parallel to a compass needle pointing due 
North and the person facing him were parallel to 
a compass needle pointing due East. A position in 
which the interactants stand side by side, facing out 
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irection, would be scored “four.” Ob- 
ere recorded to the nearest unit. 

Obse of dyadic interaction were gathered 
in early fall 1969 during recess periods and lunch 
hours playgrounds of each school by two 


: in the s 


servatior 


trained judges." Steps were taken to insure 
against ver bias: The judges were not informed 
of th ses, they observed children other than 
those in own classes, and they were trained in 
the u » measurement scale until their pretest 
scores wi und to be highly reliable. 

The e training sessions each consisted of a 
series observations by both judges on the 
same t ' children of the grade levels being 
studie success of the training sessions is evi- 
dence: he fact that in the final training session, 
out of observations, 18 were in perfect agree- 
ment n the two judges on both axis and 
distanc ss; the remainder were very close. Al- 
though he actual collection of data different 
judges employed in each setting, the high reli- 
ability ieved during the training period and the 
similari of the contexts in which observations 
were r would seem to rule out the possibility of 
consist distortion of judgments in either play- 
ground ion, 


To i ict the teacher judges in the utilization 


of crite for selection of dyads, the first author 
accompanied each of the judges into the field for the 
first two observation periods. The teacher judges 
were insiructed to select only same-sex, same-culture 
dyads engaged in verbal interaction without any 
physical objects or other persons obstructing the 
freedon movement of the subjects. Other teachers 
were consulted if the subcultural identification of 


subjects was in doubt. The teacher judges were also 
instructed to record their observations of dyads at 
the first instant in time when they could ascertain 
that at least one of the interactants was verbalizing 
to the other and both interactants were relatively 
stationary. 


RESULTS 
Cultural Differences in Distance 


An analysis of variance for distance scores 
(see Table 1) showed that there was a signifi- 
cant main effect for culture (p < .001). This 
finding is in line with Hypotheses I and II, 
which stated that white children would stand 


TABLE 1 
. ANALYSIS Or VARIANCE FOR DISTANCE SCORES 
Source "T MS F 
Sex (A) 1 80.476 6.00* 
Culture (B) 2 | 1021757 76.18** 
AXB 2 61.062 4.55* 
Error 204 13.413 
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TABLE 2 
Mean Distance Scores oF SUBCULTURAL 
Groups ny SEX 
| Subculture 
Sex of interactants pot zt - ega i, 
Puerto | Black | White 
Male 5.257 | 5.628 | 13.514 | 8.133 
Female 4.371 | 6.057 | 10.257 | 6.895 
Sexes combined 4.814 11.886 


5.843 


farther part than would black children and 
that white children would stand farther apart 
than would Puerto Rican children. In addi- 
tion to these predicted results, a significant 
main effect was found for sex (p < .05), and 
a significant interaction effect was found be- 
tween sex and culture (p < .05). 

As can be seen in Table 2, the mean ob- 
served distances of the sex-culture groups are 
consistent with the first two hypotheses. The 
mean distances for the white subculture ap- 
pear to be quite dissimilar from those of the 
black and Puerto Rican subcultures. 

The significant interaction effect between 
sex and culture indicates that sex influenced 
cultural differences. Therefore, the appropriate 
test for the first two hypotheses of the study 
is to see if both males and females within 
the black and Puerto Rican subcultures differ 
significantly from both sexes within the white 
subculture. Duncan's multiple-range test was 
used to test for differences between the mean 
distance scores of the sex-culture groups. Ta- 
ble 3 summarizes the results of this test. 

The application of Duncan's test shows 
that Hypotheses I and II concerning the na- 
ture of the cultural differences were confirmed. 
There were no significant differences between 
black and Puerto Rican subcultures among 


TABLE 3 


Douncan’s New MULTPLE-RANGE TEST APPLIED TO 
THE DIFFERENCES BETWEEN SEX-CULTURE 
Distance Means (k = 6) 


Culture group 


Sex pair 
Puerto Rican Black White 
Female-female 4.374 6.06, 10.26, 
Male-male 5.26, 5.63, 13.52, 


_Note.—Means that do not share the same subscript are sig- 
nificantly different (p < .05). 
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TABLE 4 
MEANS, MEDIANS, AND STANDARD DEVIATIONS OF AXIS SCORES OF 
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SUBCULTURAL GROUPS BY SEX 
Subculture 
Cultures combined 

Sex of interactants Puerto Rican Black White 

M Mdn SD M Mdn SD M Mdn SD M Mån | ) 
Male .629 | .295 | 1.087 | .629 | .421 | .873| .543 | .229 | .980| .610 | .308 | .976 
Female .657 | .261 | 1.162 | .914 | .375 | 1.223 | .657 | .295 | 1.027 | .743 | .308 | 1.135 
Sexes combined .643 | .278 | 1.117 | .786 | .397 | 1.062 | .600 | .261 | .998| — — 


males or females. On the other hand, males 
and females within the white subculture did 
differ significantly from each of the black and 
Puerto Rican sex-culture groups. It should 
also be noted that Duncan's test showed a 
significant difference between sexes within the 
white subculture, but no significant difference 
between the sexes within either the black or 
the Puerto Rican subcultures. The implica- 
tions of this latter set of relationships are 
examined below. 


Sex and Cultural Differences in Axis 


Hall's proxemic notational system for axis, 
employed in this study, was found to be a 
limited measurement instrument for the study 
of young children. The 8-point measurement 
scale did not allow for fine distinctions at 
the lower end of the scale in which most of the 
Observations took place, as can be seen in 
Table 4. This tendency of the distribution to 


TABLE 5 


Axis Scores: SUMMARY or KRUSKAL-WALLIS ONE- 
Wav ANALYSIS OF VARIANCE VALUES 
FOR RANKED DATA 


be skewed to the right suggested that the 
sumption of normality necessary for the use « 
the parametric F test could not be met in 
these data. Therefore, the nonparametric 
Kruskal-Wallis one-way analysis of varian 
which utilizes rank-order scores, was chose 
Table 5 presents a summary of the 11 con 
parisons necessary to test the hypotheses CO: 
cerning axis. 

Upon finding a significant overall nce 
between the six sex cultures (H= = 28 ge 
= 5, p < .001), comparisons were made B: 
tween males and females within each of the 
three subcultures: Puerto Rican, black, ani 
white. No significant difference was found 
within any culture, justifying the testing for 
a significant difference between the three sub- 
cultures combining sex groups within sub 
cultures. A significant difference was obtained 
among Puerto Rican, black, and white chil- 
dren (H = 28.22, df=2, p< 001), which 
then permitted individual tests to be made 
between each pairing of subcultures. 

The comparison between black and white 
subcultures yielded a significant difference 


Comparison H value | n 
Overall 28.57** | 210 
Black (male-female) 40 70 
White (male-female) 29 70 
Puerto Rican (male-female) 12 70 
Black, Puerto Rican, and White 28.22** | 2310 
(combining sex groups) 
Black-White 6.06* | 140 
Black-Puerto Rican 1.25 | 140 
White-Puerto Rican O01 | 140 
Males (across cultures) 1.00 | 105 
Females (across cultures) 97 |105 
Male-Female (combining culture 22.67** | 210 
groups) 
*p <.05 
** > <.001 


(H= 6.06, df — 1, p< 05), thereby con- 
firming Hypothesis III, which predicted that 
there would be a difference between black 
and white children on axis orientation. Table 
4 shows that the black children stood less 
directly than did the white children. How- 
ever, there was no significant difference be- 
tween the Puerto Rican and black subcultures 
in axis orientation, contrary to Hypothesis 
IV. As expected, there was no significant dif- 
ference between Puerto Rican and white chil- 
dren. 

Tests for differences were then performed 
among females across subcultures and among 


PaoxEMIC BEHAVIOR OF CHILDREN IN SUBCULTURAL GROUPS 


, males acr 


- subcultures, respectively. Find- 


ing no icant differences among females 

or amo: les, a comparison was made be- 
tween and females, ignoring culture. 

E This fi: nparison did produce a signifi- 
cant ce between males and females 
(7 = df=1, p< .001), confirming 

; Hypot! predicting sex differences. An 
inspec the means in Table 4 indicates 
that i: h culture the mean female axis 
score er than the mean male axis score. 

DISCUSSION 

The ta support the hypotheses that 
white en stand farther apart than do 
black Puerto Rican children. This con- 
@ firmation of the hypotheses supports Hall's 
contention that there are cross-subcultural 
proxen lifferences. By demonstrating that 
differen n subcultures were found to occur 

as early 2s ages 6-8, support is given to Hall's 
reason ig that these “differences in the struc- 
turing of space between minority groups and 
the dominant white culture are basic [1966, 

p. 165},” since they are acquired early in life. 
Two rival explanations for the differences 

i between the cultural groups should be consid- 
ered. list, since verbal behavior was not 
monitored in the study, the possibility is left 


open that the lower-class and middle-class 
children were involved in two separate kinds 
of interaction, which might be differentially 
related to proxemic behavior. However, the 
fact that all of the children were observed 
in settings that reinforce “play” activities 
: reduces the likelihood of this sort of variation. 
The authors are presently involved in re- 
search designed to control such variables as 
topic of conversation. A second alternative 
explanation would be that the density of 
children varied from one playground to the 
other. Greater restriction on the space avail- 
able for interaction in the black and Puerto 
Rican school presumably might account for 
closer interaction distances. Although there 
did appear to be somewhat less space per 
pupil in the ghetto playground, the difference 
was small. The children in both schools were 
able to move about in rather large areas. In 
addition, the judges were instructed to record 
only those interactions where freedom of 
movement was possible. 
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While axis scores between black and Puerto 
Rican groups were in the predicted direction 
(.2 < p < .3), no significant differences were 
found for either axis or distance scores be- 
tween these subcultures. One possible expla- 
nation for this finding is that since the black 
and Puerto Rican children observed in the 
study attended the same school, they may 
have had considerable interaction with one 
another and become more nearly alike in 
proxemic behavior than would ordinarily be 
the case. Further research employing schools 
that have more homogeneous black or Puerto 
Rican populations is desirable in order to 
better assess whether children in these sub- 
cultures do differ. The results of the present 
study seem to reinforce the conclusion of a 
previous study of adults, that members of 
poverty subcultures tend to be rather similar 
to one another in spatial orientation behavior 
(cf. Jones, 1971). 

A further implication of the study is that 
an “equilibrium level" of nonverbal behavior 
may exist, not just for individuals, as Argyle 
and Dean (1965) have postulated, but for 
cultures as well. While white children were 
found to stand at a greater distance during 
interaction in the present study, black and 
Puerto Rican children appeared to stand at a 
less direct angle. It could be that although 
some homogeneous degree of nonverbal ex- 
pression prevails across cultures, the particu- 
lar mode of expression varies from culture to 
culture. 

Analysis of the distance data revealed two 
unpredicted results: a significant main effect 
of sex and a significant interaction of sex and 
culture. As can be observed from the mean 
distance scores (see Table 2), these effects 
are attributable primarily to the rather large 
differences between sexes in the white subcul- 
ture (white males stood farther apart than 
white females) and the apparently negligible 
sex differences in the black and Puerto Rican 
subcultures. When viewed in light of the find- 
ings of Willis (1966), that adult males tend 
to stand farther apart than females, this out- 
come of the present study suggests the possi- 
bility that sex differences in proxemic orienta- 
tion may be acquired at an earlier age among 
middle-class white children, perhaps because 
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of greater emphasis placed on appropriate sex- 
role behavior in their homes. 

The finding that females were less direct 
than males appears to contradict the con- 
clusion reached by Jones (1971), that men 
maintain a wider axis than women, although 
it may be that 6-8-year-old children have a 
different code from that of adults. It will be 
recalled, however, that the great majority of 
the axis scores fell in two categories, “zero” 
and “one,” so that no fine discriminations 
could be made. As can be observed in Table 
4, the median axis scores for males and fe- 
males are equivalent. This suggests that con- 
clusions drawn from the axis data should be 
regarded as highly tentative. It also shows 
that a more refined instrument than Hall's 
8-point compass-face observational system is 
necessary for measurement of children's 
proxemic behavior. Jones has proposed the use 
of a 12-point scale, corresponding to the hours 
of a clock. This system could very well be 
further divided into hour and halí-hour inter- 
vals, resulting in a more useful 24-point scale. 


CONCLUSION 


Taken together, the results of the study 
provide support for the widely held notion 
that subcultures tend to differ in proxemic be- 
havior. Puerto Rican and black children were 
found to stand closer than middle-class white 
children, and there was some evidence that 
they also stood less directly. In addition, the 
data suggest that sex differences in interac- 
tion distance may vary from culture to cul- 
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ture, at least at the age level investigated. It 
should be noted that these results are subject 
to the interpretation that variation among the 
groups studied is rooted in social class rather 
than culture per se, since the lower-class black 
and Puerto Rican children were not signifi- 
cantly different from one another. The fact 
that the differences were found among young 
children, however, does suggest that distinct 
proxemic patterns may be basic to member- 
ship in minority groups or the dominant mid- 
dle class within the American culture. 
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) Naive subjects served as “students” in a study purporting to examine educa- 
tional practices. The “teacher” was an experimental accomplice who adminis- 
tered monetary rewards on either a 40% or a 100% reinforcement schedule. 

The magnitude of rewards was adjusted so that all subjects received the same 
amount of money during a work session. During a recess which occurred at 
the conclusion of the first of two work sessions, subjects who had received 
10% reinforcement (a) attributed greater freedom to the teacher, (b) were 
more attentive to cues concerning the teacher's dispositional qualities, and (c) 
manifested a greater tendency to ingratiate themselves to the teacher. (The 
last finding was only marginally significant.) Results are interpreted as indi- 
cating that reinforcements are communicative acts that inform the recipient of 
the probable intentions, attitudes, and freedom of the agent who administers 
them. The recipient’s reactions to reinforcements appear to depend in part 
upon cognitive and evaluative processes that are prompted by the particular 
reinforcement schedule to which he is subjected. 


In a vocent literature review, Steiner (1970) 
examined a wide array of research and the- 
ory suggesting that the reactions an indi- 
vidual evokes from his associates depend in 
part upon the amount of freedom they at- 
tribute to him. Thus, for example, evidence 

. indicates that associates are less likely to 
impute dispositional qualities to the individual 
when his observable behaviors are thought to 
be role directed than when they appear to be 
performed without external coercion or re- 
straint, Under the former condition, associates 
are also less inclined to be persuaded by the 
individual’s advocacy of an attitudinal posi- 
tion, and less prone to form strongly positive 
or negative appraisals of him. Furthermore, 
associates are more likely to react in an ingra- 
tiating manner if the individual seems to en- 
joy a margin of freedom to decide whether or 
not he will administer the payoffs he controls. 
The present study continues the investigation 
of attributed freedom. It tests the contention 
that a reinforcing agent’s manner of admin- 


1'This study was conducted as master’s research 
by the first author under the supervision of the sec- 
ond author, It was supported by Grant M-4460 from 
the United States Public Health Service, National 
Institutes of Health, 

2 Requests for reprints should be sent to Ivan D. 
Steiner, who is now at the Department of Psychol- 
ogy, University of Massachusetts, Amherst, Massa- 
chusetts 01002. 


istering rewards and punishments is construed 
by his associates as revealing his margin of 
freedom, and that associates are more atten- 
tive to cues concerning a reinforcing agent’s 
dispositional qualities, and more inclined to 
ingratiate themselves to him, when he employs 
reinforcement techniques that imply a high 
degree of freedom. 

In previous studies, researchers have in- 
duced the perception that an actor has little 
freedom by publicly assigning him to a role 
(Jones, Davis, & Gergen, 1961; Steiner & 
Field, 1960), by telling subjects that the actor 
has been instructed to produce a message fa- 
voring a specified point of view (Jones & 
Harris, 1967), or by creating conditions under 
which it appears unlikely that the actor pos- 
sesses the ability, knowledge, or environmental 
support that is needed in order to change his 
behavior (Berkowitz, 1962; Burnstein & 
Worchel, 1960; Cohen, 1955; Lanzetta & 
Hannah, 1969; Pastore, 1952; Weiner & 
Kukla, 1970; Wiggins, Dill, & Schwartz, 
1965). In only a few studies of the third type 
has the actor’s own behavior been manipu- 
lated to create the impression that he pos- 
sesses a wide or narrow margin of freedom. 
The authors have discovered no instance in 
which the rewarding behaviors of a reinforc- 
ing agent have been systematically manipu- 
lated for this purpose. 
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If a person is known to lack freedom, we 
are likely to expect him to behave in a rather 
standard and consistent manner. But if a per- 
son is known to behave in a highly consistent 
and predictable fashion, will we conclude that 
he therefore lacks freedom? If a reinforcing 
agent always rewards one type of response, 
and always punishes another, will he be seen 
as enjoying less freedom to determine his own 
reinforcement strategy than one who adheres 
to a schedule of variable reinforcement? Such 
may be the case for either of two reasons: 
(a) Routine behavior tends to acquire an aura 
of normativeness (Thibaut & Kelley, 1959; 
Waller & Hill, 1951); or (5) behaviors that 
seem typical or appropriate tend to be under- 
stood as consequences of situational con- 
straints or social rules, and are not thought 
to be prompted by the actor’s personal pref- 
erences (Jones & Davis, 1965). 


Hypothesis I 


An individual whose reinforcing behaviors 
are highly consistent across trials (e.g., 100% 
reinforcement) will be seen as possessing less 
freedom to select his own reinforcing strategy 
than will an individual who employs a less 
consistent schedule (e.g., 40% reinforcement). 

It has been reported (see Jones & Davis, 
1965; Steiner, 1970) that subjects are more 
confident of their evaluations of persons whose 
behaviors appear to be freely performed than 
of those who are thought to act under con- 
straints. Consequently, if Hypothesis I is con- 
firmed, a second hypothesis may be advanced. 


Hypothesis II 


Subjects receiving inconsistent reinforce- 
ment will manifest greater confidence in their 
ratings of the agent's dispositional qualities 
than will subjects receiving consistent rein- 
forcement. 

When a reinforcing agent seems free to do 
as he pleases, people on whom he may bestow 
his rewards are well advised to discover what 
pleases him. Thus, if Hypothesis I is sup- 
ported, we may anticipate that an inconsis- 
tent reinforcement schedule will prompt 
greater attentiveness to cues concerning the 
agent's beliefs and preferences than will a 
consistent reinforcement schedule. 
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Hypothesis III 


After exposure to cues concerning a rein- 
forcing agent's dispositional qualities, subjects 
who have received inconsistent reinforcement 
from the agent will report these cues more 
accurately than will subjects who have re- 
ceived consistent reinforcement. 

According to Jones (1964), Jones, Gergen, 
Gumpert, and Thibaut (1965), and Kauff- 


mann and Steiner (1968), subjects are unlikely ^ 


to ingratiate themselves to a power figure who 
has little freedom to decide when or how his 


power will be employed. However, ingratiation ' 


is a common tactic for gaining payoffs from 
power agents who are thought to enjoy 
margin of freedom concerning the use of thei 
power. Assuming support for Hypothesis Iw 
may predict the following relationship. 


Hypothesis IV 


Subjects exposed to inconsistent reinforce- 
ment will manifest a greater tendency to in- 
gratiate themselves to the reinforcing agent 
than will those who receive consistent rein- 
forcement. 

The consistency with which a reinforcing 
agent administers rewards and punishments is 
not the only factor that may influence the 
amount of freedom he is presumed to enjoy. 
Tf the persons over whom the agent exercises 
his control hear or see a higher authority in- 


struct the agent when to use his reinforcing - 


capacities, they are unlikely to attribute much 
freedom to him. In this research, both the 
reinforcement schedule of the agent and re- 
ceipt versus nonreceipt of instructions were 


manipulated. If the receipt of instructions - 


affects perceived freedom in the same way as 
does consistent reinforcement, Hypotheses II, 
III, and IV may be rephrased to express the 
impact of the former variable. Thus, subjects 
who receive reinforcements from an instructed 
reinforcing agent should (a) express little 
confidence in their evaluations of the disposi- 
tional characteristics of the agent; (b) be 
inattentive to cues concerning his dispositional 
qualities, and therefore inaccurate in recalling 
such cues; and (c) be uninclined to ingratiate 
themselves to him. 


REINFORCEMENT SCHEDULES AND ATTRIBUTED FREEDOM 


METHOD 

Subject 1 Design 

Ninety le students enrolled in the elementary 
psychology c at the University of Illinois served 
as subjects in the experiment. They received course 
credit cir participation, Two subjects were 
elimin rom the analysis; one expressed sus- 
picion ng the true identity of the accomplice, 
and on unable to complete the required ques- 
tionnaires and procedures within the 1-hour time 
allotm 2 factorial design was employed. 
Half of > subjects received continuous reinforce- 
ment an experimental accomplice, while the 
remain received variable reinforcement. Within 
each of | two categories of subjects, half saw the 
accomplice receive instructions concerning the man- 
ner in which reinforcements should be adminis- 
tered, and half did not. Twenty-two subjects were 
random! ssigned to each cell of the design. 
Procedure 

The ject and accomplice (a male student) 
waited in a corridor until the experimenter appeared 
and ushered them into the experimental room, They 


were seated at a table and asked to respond to 
eight items from the California F Scale by placing 
checkma:ks on 5-inch graphic scales. The end points 
of the s were anchored by "agree very strongly" 

sœ very strongly.” After this instrument 
participants responded to the Rotter 
<ternal (I-E) scale (Rotter, 1966). 
imenter then explained that the major 
the experiment was to examine the effects 
Ps behavior on a student's performance. 
In par r, the experimenter claimed to be in- 
terested in identifying the kind of instructional be- 
haviors that improve a student's performance. It 
was announced that one subject would be randomly 
assigned the role of teacher and ome subject that of 
a student. The “random” selection process was à 
rigged lottery, through which the accomplice was 
always assigned the role of teacher. 

Participants were then moved to a table on the 
opposite side of the room. On this table were two 
piles of twenty-five 3X 5 cards labeled “anagram 
task 1” and “anagram task 2” and a metal box. The 
experimenter informed the participants that during 
the course of the experiment they would complete the 
two anagram tasks and that an intermission would 
be interspersed between the two tasks. The second 
anagram task was an experimental ploy and was 
never actually utilized. 

The metal box was placed near the teacher, and 
the first deck of anagrams was placed behind it, 
blocking the student's view of the anagrams. After 
a brief explanatory introduction to the anagram task, 
the teacher was instructed to present the anagrams 
one at a time to the student. The experimenter dem- 
onstrated with the first four cards. 

The student was instructed to solve the anagrams 
as quickly as possible and to state his answer aloud. 
It was made clear that the formation of any English 
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word using each letter once, within the 20-second 
time limit, would be considered a correct response. 
The experimenter asserted that he would time the 
student on cach anagram and would state after each 
response whether it was correct or incorrect. Five of 
the 25 anagrams were insolvable. The other 20 had 
been selected aíter numerous pretests had indicated 
that they were easy enough to be readily solved by 
college students. 

The experimenter opened the metal box, which was 
so located that its contents could not be seen by the 
subject, and said: 


Teacher, another of your duties is to dispense 
reinforcements to the student in the form of coins 
from this box. Student, any money you have left 
over at the end of this task is yours to keep, and, 
similarly, any money you have at the end of the 
second anagram task will also be yours to keep. 
Teacher, you will also make some money; how- 
ever, you are being paid a set amount for partici- 
pation in the study. Your money has already been 
set aside for you. 


Receipt of reinforcement rides. In half of the ses- 
sions, the teacher (accomplice) was told to rein- 
force the student according to a schedule provided 
by the experimenter. He was given a card ostensibly 
containing reinforcement rules and told: “I want 
you to follow these instructions explicitly in your 
rewarding and punishing of the student with this 
money. Keep the instructions with you and refer to 
them at any time you need to. (During the ana- 
gram task, the accomplice frequently referred to the 
instructions.) Although the subjects saw the teacher 
receive the card of instructions, they had no way of 
knowing how specific or detailed the rules were. 

Nonreceipt of reinforcement rules. In half of the 
sessions, the teacher was not given a card bearing 
instructions, but was told: 


You are to use this money in any way you think 
will be most effective in improving the student’s 
performance. During the course of the anagram 
task, you can give the student money at any time 
or take money back at any time. I leave it com- 
pletely to your own discretion to create your own 
schedule of reward and punishment. 


Continuous reinforcement. In half of the sessions, 
the teacher reinforced the student according to a 
continuous schedule, rewarding the student with a 
dime for each anagram correctly solved and punish- 
ing him for each incorrect solution by taking away 
a dime. Eighty percent (20) of the reinforcements 
were positive, and 20% (5) of the reinforcements 
were negative, leaving the student with the total 
earnings of $1.50. 

Variable reinforcement. In half of the sessions, the 
teacher reinforced the student on à predetermined 
variable ratio (2.5) schedule, rewarding the student 
with a quarter eight times and twice punishing the 
student by taking a quarter away. Therefore, al- 


3' The variable ratio schedule was identical for all 
subjects. 


-— 
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though the reinforcement schedule and magnitude of 
reinforcement varied between conditions, within each 
condition 80% of the reinforcements were positive 
and 20% were negative, and all subjects earned 
$1.50.4 : 

Upon completing the anagram task, the subject 
was assured once again that he could keep his earn- 
ings and was encouraged to put the money in his 
pocket. The experimenter then administered the 
Effectance Arousal scale (Byrne & Clore, 1967) to 
both participants, under the guise of examining 
their reactions to the first anagram task. Following 
the Effectance Arousal scale, the accomplice was in- 
formed that since he was serving as teacher in the 
experiment, the experimenter would like to learn 
about his “personality” and “some of his ideas con- 
cerning teaching." The student was handed a Life 
magazine and told that he could relax during this 
brief intermission and look at the magazine if he 
wished. However, the experimenter reminded the 
subject that he would soon be working on the 
second “money-making anagram task” which would 
be identical in form to the first task. 

The teacher was ushered to a table at the opposite 
side of the room and seated alongside the experi- 
menter with his back to the student, The student, 
however, was close enough to hear the ensuing 
conversation and was thus free to ignore the maga- 
zine and eavesdrop if he chose to do so, The inter- 
view, which was conducted in subdued tones, began 
with an “association test.” The experimenter orally 
presented a series of stimulus words, and the ac- 
complice responded with a prearranged response to 
each stimulus. Half of the items resembled those of a 
typical “association” test (e.g., Love-girl), while the 
others dealt with the student-teacher situation (e.g., 
ERROR-mistake). All of the prearranged responses 
seemed to the authors to be consistent with the 
stimuli with which they were paired. 

At the completion of these items, the experimenter 
stated that he would like to obtain information con- 
cerning the teacher's teaching philosophy. To ac- 
complish this end, the experimenter asked the teacher 
six short questions concerning his teaching philosophy 
(eg. What is the most effective way of improving 
learning: reward or punishment?). To each question, 
the accomplice responded with a predetermined one- 
or two-sentence answer. These questions and the 
accomplice's answers have been reported elsewhere 
by Davidson (1970), as has the list of stimulus and 
response words mentioned above. 

Following the interview, the student's magazine 


*Four subjects were unable to solve one of the 
solvable anagrams: two in the variable ratio, no- 
instructions condition and one in each of the con- 
tinuous reinforcement conditions. Missing an ana- 
gram in a continuous reinforcement condition re- 
sulted in the subject earning only $1.30, whereas it 
had no effect on earnings in the variable ratio condi- 
tions. These four subjects were not dropped from 
the analysis; their scores on dependent variables 
were consistent with those of subjects responding 
correctly to all anagrams. 
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was collected, and participants were infor 
the experimenter was also interested in finding 
whether teacher-student pairs who know 
other perform better on an intellectual 
teacher-student pairs who do not. Participant: 
told that they would have a chance to bec 
acquainted through the use of a highly controlled 
conversation technique (Kauffmann & Steir 1968). 


If we just allow two people to talk to - fora 


few minutes, some groups of people w talk 
about one thing, and other groups would talk 
about something entirely different. To ensure that 
all our groups talk about the same thing, : will 
give you a questionnaire on which you are to 
indicate a few of your beliefs or attitudes, v. u are 
each receiving different questionnaires, but after 
you have filled them out you will exchange papers 
to enable you to see how the other person feels 


about the issues. Then you will each answer the 
same questions the other answered, and will 
exchange papers again. This way you w 
something of the other person’s attitudes just as 
you might in a normal conversation. 


The accomplice received a questionnaire containi 
the eight F-scale items that both participants ! 
completed at the outset of the experimental session, 
whereas the subject received a form containing eight 
different items from the F scale. The accomplice re- 
sponded to his scale by checking spaces that had 
already been lightly marked by the experimenter.’ 
When the student and teacher exchanged papers, the 
student learned that the teacher held views that were 
systematically different from his own. Because the 
student had been instructed to place checkmarks on 
the teacher’s paper in order to indicate how he per- 
sonally felt about each of the items, and because he 
realized the teacher would see the answers he sup- 
plied, an opportunity was created for the student to 
ingratiate himself to the teacher through the use of 
conformity, 

Following completion of the “controlled conversa- 
tion,” the experimenter said that he wanted the 
teacher to fill out a questionnaire that was different 
from the one the student would be receiving and 
involved very different instructions. Consequently, 
the experimenter informed the teacher that he would 
prefer to have him complete this questionnaire in 


*While the participants were completing the I-E 
scale, the experimenter had placed checkmarks on an 
unmarked copy of the eight-item form he had col- 
lected from the subject. Two of the marks were lo- 
cated at exactly the same positions on the 5-inch 
graphic scales as were the subject’s initial responses. 
Answers to three items were displaced 14 inches in 
the direction of greater authoritarianism, and the 
other three were displaced 1 inch in the same direc- 
tion, Direction of discrepancy was not varied be- 
cause previous research (Kauffmann & Steiner, 1968) 
had shown that although direction of discrepancy 
does indeed affect amount of ingratiation, there is no 
significant interaction of direction with a number of 
other variables. 
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TABLE 1 


> MEANS OF THE PERCEIVED 
FREEDOM SCORES 


AND ATTRIBUTED FREEDOM 361 


TABLE 2 


ANALYSIS OF VARIANCE OF THE PERCEIVED 
FREEDOM SCORES 


Reinforcement schedule 


Variable ratio | Continuous 

Receive ions 11.64 5.09 
Did not e instructions 13.27 4.86 
another He then ushered the teacher into an 
adjacen . returning immediately to the experi- 
mental ;. The subject was presented with an 
instrum: asking him to recall the teacher's re- 
sponse oth the questions concerning teaching 
philoso; nd to the association test. There were 
also it »cerning the subject's perception of the 
teacher 

Upon coopletion of this questionnaire, the student 
was to! t there did not seem to be enough time 
availabl: for the second anagram task, He was 
allowed keep his earnings and was informed that 
he had cove well on the anagram task. A letter was 
sent to ; subject at the completion of the re- 
search, vibing the experiment and its results, and 
explain: he necessity for the deception. 

RESULTS 

Percepiica of Freedom 

Perception of the other's freedom was mea- 
sured on two 9-point scales: “How free did 


the teacher feel to withhold reward when you 
gave a correct answer to an anagram?" and 
“How free did the teacher feel to withhold 
punishment when you made a mistake on an 
anagram?” The end points of both scales were 
anchored by (1) “not at all free” and (9) 
"very free," Because the responses to these 
items were highly correlated (r= .79 across 
cells; within cells, the lowest correlation was 
.52), the scores were summed to yield a 
single measure of perceived freedom. 

Table 1 shows the group means, and Table 
2 summarizes the results of a 2 X 2 analysis 


Source df as XE p< 
Between 
Instructions (A) 1 10.92 EY A | 
Schedule (B) 1 1,230.01 | 82.01 | .001 
AXB 1 19.10 | 1.27 | .27 
Within 84 14.99 


of variance performed on the perceived free- 
dom scores. There was a significant main effect 
of reinforcement schedule (5 < .001), indicat- 
ing that the mean perceived freedom score 
was significantly higher for the variable ratio 
conditions (1245) than for the continuous 
conditions (4.98). Neither the interaction 
effect nor the main effect of instructions was 
significant. 

A Newman-Keuls test performed on the 
cell means indicated that (a) the variable 
ratio, no-instructions group mean was signifi- 
cantly larger than the continuous reinforce- 
ment, no-instructions group mean (p < 01); 
and (b) the variable ratio, instructions group 
mean was significantly larger than the con- 
tinuous reinforcement, instructions group 
mean (p < .01). The reinforcement schedule 
manipulation accounted for 48% of the vari- 
ance in the perceived freedom ratings. Thus, 
there is support for the first hypothesis, 
which predicted that an agent whose rein- 
forcing behaviors are highly consistent will be 
seen as possessing less freedom than an agent 
who employs a variable schedule of reinforce- 
ment. 

The impact of reinforcement schedule is 
also revealed by the subjects' responses to a 
request that they “describe the teacher as he 
really is." Table 3 reports the anchor points 
and values of the ratings subjects gave on 


TABLE 3 


Mean RATINGS OF THE REI 


nrorcinc AGENT MADE BY Four TREATMENT GROUPS 


Variable ratio, 


Group no instructions 


- ———— 


Predictable-unpredictable 6 
Programmed-original 6 
Rigid-flexible 6: 
Consistent-changeable 6 


Continuous 
reinforcement, 
instructions 


Continuous 
reinforcement, 
no instructions 


Variable ratio, 
instructions 


54 1,7 34 
3.6 33 3.5 
44 2.7 3.9 
5.0 1.8 3.2 


Note,— The four scales ranged from 1 to 9. 
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TABLE 4 


Group MEANS FOR THE NUMBER OF 
Free Associates RECALLED 
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TABLE 5 


ANALYSIS OF VARIANCE OF THE NUMBER 
OF FREE ASSOCIATES RECALLED 


| Reinforcement schedule 
Condition | 


Variable ratio | Continuous 


4.36 3.14 
5.18 3.45 


Received instructions 
Did not receive instructions 


four 9-step scales. A separate analysis of vari- 
ance was performed for each item and revealed 
a significant main effect for reinforcement 
schedules (p < .01) and a Significant In- 
structions X Reinforcement interaction (p < 
.01) for each. It is clear that subjects who 
experienced the variable conditions described 
the teacher as more unpredictable, original, 
flexible, and changeable than did those who 
were exposed to continuous reinforcement. 
However, the effect of the teacher’s reinforce- 
ment schedule tended to be greater when he 
had not received instructions than when he 
had. Although these findings do not deal ex- 
plicitly with perceived freedom, they are con- 
sistent with the contention that variable ratio 
reinforcement favors the attribution of free- 
dom to the agent who dispenses it. 


Confidence in Ratings 


As predicted by attribution theory, and in 
partial support of the second hypothesis, the 
subjects indicated greater confidence in their 
evaluations of the teacher when he had re- 
ceived no instructions (X = 6.47) than when 
he had presumably been told how to admin- 
ister reinforcements (X = 5.5, p < .02). Hy- 
pothesis II also predicted that subjects would 
be more confident of their evaluations of an 
agent employing a variable schedule of rein- 
forcement than of one employing a continu- 
ous schedule. However, the means differed in 
the direction opposite to that specified by 
this prediction (continuous reinforcement, X 
= 6.30; variable ratio, X = 5.60; p < .10). 
Confidence was measured on a 9-step graphic 
scale. 


Attentiveness 


Attentiveness to cues concerning the teach- 
er's beliefs and preferences was assessed by 
two measures of incidental learning: (a) the 


Source df MS F $< 
Between 
Instructions (A) 1 7.10 1.92 Eyi 
Schedule (B) 1 | 48.10 2.99 .001 
AXB 1 1.38 37 55 
Within 84 3.70 


number of teacher’s associates correctly re- 
called and (5) the number of the teacher's 
responses to the teaching philosophy inter- 
view correctly recalled. 

The maximum score for the association test 
was 10. Table 4 shows the group means for 
the number of the associates recalled. Table 5 
reports a main effect for reinforcement 
schedule (p < .001), indicating that the mean 
number of recalled responses was significantly 
higher for the varie ralio conditions (4.77) 
than for the continuous conditions (3.29). 
Neither the interaction effect nor the main. 
effect of instructions was significa: 

A Newman-Keuls test revealed that (a) 
the variable ratio, no instructions group mean 
was significantly larger than the continuous 
reinforcement, no instructions group mean 
(p «€ .01); and (b) the variable ratio, instruc- 
tions group mean was significantly larger than 
the continuous reinforcement, instructions 
group mean (f «.05). The reinforcement 
schedule accounted for 12% of the variance 
in the number of associates recalled. 

Six questions concerning the teacher's re- 
sponses during the teaching philosophy inter- 
view were presented to the subject in the form 
of a multiple-choice test. For each item, there 
were three possible choices; thus, the chance 
level of correct responding was represented 
by a score of 2. Table 6 shows the group 
means for the number of teaching philosophy 
responses correctly identified. The results of 
the analysis of variance summarized in Table 
7 reveal (a) a significant main effect for rein- 
forcement schedule (5 —.001), indicating 
that the mean number of correct responses 
was significantly higher for the variable ratio 
conditions (3.71) than for the continuous con- 
ditions (2.53); and (b) a marginally signifi- 
cant main effect for instructions (p < .06), 
indicating that the number of correct re- 
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the without-instructions condi- 


sponses 
tions was higher than for the with- 
instru conditions (2.82). 

AN n-Keuls test revealed that (a) the 
variab! ), no instructions group mean was 
signifi larger than the continuous rein- 
forcem: ; instructions group mean (f 
< .01 ! (b) the variable ratio, instruc- 
tions nean was significantly larger than 
the uous reinforcement, instructions 
grouj (p < .05). The schedule manip- 
ulati the instructional manipulation ac- 
count 1496 and 2% of the variance of 
the d ent variable, respectively. 

Th indings support Hypothesis III and 
are c ent with the contention that sub- 
jects are more attentive to cues emitted by 
agent (forcing on a variable schedule than 
to ag reinforcing on a continuous sched- 
ule. e is also marginal support for the 
conte: ton that subjects are more attentive to 
agents who have not publicly received rein- 
forcent rules than to those who have been 
instructed to behave in a given manner. 
Ingra 10n 

Each subject responded twice to the same 


eight-item version of the F scale, once at the 
outset of the experimental session and again 
after seeing the responses of the accomplice. 
In the second instance, the subjects knew that 
their ratings would subsequently be viewed by 
the accomplice. To compute the conformity 
score, the 5-inch graphic scales were divided 
into twenty }-inch segments. The subject’s 
conformity score for each item was obtained 
by counting the number of segments he moved 
his second response in the direction of the 
accomplice’s response. 

A 2 x 2 analysis of variance performed on 
the conformity scores is summarized in Table 


TABLE 6 


Group MEANS FOR THE NUMBER OF TEACHING 
Puitosoemv RESPONSES CORRECTLY 
IDENTIFIED 
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TABLE 7 


ANALYSIS OF VARIANCE OF THE NUMBER OF TEACHING 
PHILOSOPHY Responses CORRECTLY IDENTIFIED 


Source a | 


MS F p< 
Between 
Instructions (A) 1 7.68 3.85 054 
Schedule (B) 1 | 30.73 15.39 001 
AXB 1 43 36 55 
Within 84 | 2.00 


9. It reveals marginally significant main ef- 
fects of the instructional manipulation (p < 
06) and reinforcement schedule ($ < .10). 
The differences between the means reported 
in Table 8 are in the predicted directions, 
with more conformity being manifested by 
the subjects receiving the variable ratio and 
no-instructions treatments. However, the 
variances within cells are sizable, and the 
obtained findings are interpreted as constitut- 
ing weak but consistent support for Hypothe- 
sis IV. The instructional manipulation and 
the schedule manipulation accounted for 3.1% 
and 2% of the variance, respectively. 


Attentiveness and Ingratiation as Correlates 
of Perceived Freedom 


It was suggested above that (a) the degree 
of freedom attributed to a reinforcing agent 
depends upon the schedule of reinforcement 
he follows and upon whether or not he is be- 
lieved to have been assigned a specific role 
and that (5) attentiveness and ingratiation on 
the part of the reinforced individual are di- 
rect functions of the amount of freedom at- 
tributed to the agent. Data presented in Ta- 
bles 1 and 2 strongly support the contention 
that perceived freedom is affected by the 
agent’s reinforcing behaviors, but do not re- 
veal a significant impact of reinforcement 
instructions. Table 10 reports the correlations 


TABLE 8 


MEAN CONFORMITY AND STANDARD 
DEVIATION PER CELL 


Reinforcement schedule Variable Variable Continuous Continuous 
m y reinforce- | reinforce- 
Condition — Statistic | ratio, no |. ratio, ub no ment, 
Variable ratio | Continuous instructions) instructions | instructions | instructions 
Received instructions 3.32 2.32 x oe ver 25 2 
Did not receive instructions 4.09 2.73 SD : É È á 


1 
1 
4 
A 
l 

1 
*] 
7 
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TABLE 9 


ANALYSIS OF VARIANCE OF THE CONFORMITY SCORES 


Source df | 


MS F p< 
Between 
Instructions (A) 1 180.41 3.88 .052 
Schedule (B) 1 127.68 | 2.76 .10 
AXB 1 11.64 .25 
Within 84 | 46.40 


between perceived freedom and measures of 
attentiveness and ingratiation. 

For the entire sample of subjects, each of 
the obtained correlations was positive and sig- 
nificant. For the subjects receiving specific 
reinforcement treatments, the correlations 
were, of course, attenuated by the narrow 
range of scores on perceived freedom. (Rein- 
forcement schedule accounted for 4846 of the 
variance of such scores.) However, even under 
these restrictive circumstances, perceived 
freedom tends to be positively associated with 
attentiveness and ingratiation. The correla- 
tions for subjects receiving specific informa- 
tion treatments were also positive. 


Discussion 


It was predicted that a variable reinforce- 
ment schedule would evoke greater attentive- 
ness to cues concerning the reinforcing agent’s 
dispositional qualities, and more ingratiating 
behaviors, than would a continuous reinforce- 
ment schedule. These predictions were based 
on the assumption that an agent who employs 
a variable ratio is believed to possess a wide 
margin of freedom to do as he pleases. When 
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an agent is perceived in this light, it be- 
hooves the recipient of his reinforcing act ons 
to discover what pleases him and to act > a 
manner that will win his approval. The i 


of this study are consistent with these p - 
tions and assumptions, although the rela 
ship between the agent’s reinforcement 
ule and the amount of ingratiation he el 
did not reach a conventional level of 
tical significance (p < .10). 

Parallel predictions concerning the im) ct 
of publicly assigning reinforcement rule: to 
the agent were not so uniformly suppo 
Subjects did not attribute significantly gre 
freedom to unassigned agents, and, altho: ch 
the subjects manifested greater attentiven “ss 
when there was no public assignment, this 
effect was significant for only one of two moi- 
sures. Unassigned agents elicited more in: :- 
tiation than did those who were publicly 2s 
signed, but this outcome was only margins y 
significant (5 < .06). 

In retrospect, it appears likely that : 
manipulation of role assignment was rater 
weak. The teacher was handed a small c.i 
and told that it carried instructions conce: i- 
ing proper reinforcing behavior. But the na. e 
subject had no way of knowing how detail 
these instructions might be, or whether they 
imposed severe or mild restrictions on t 
teacher's actions. Had this ambiguity not been 
present in the manipulation, findings concern- 
ing the effect of role assignment might have 
been more uniformly consistent with predic- 
tions. In any event, it should be noted that 
obtained results were in the anticipated di- 


ste - 


TABLE 10 
CORRELATIONS BETWEEN ATTRIBUTED FREEDOM AND OTHER VARIABLES 


Variable 
Category subjects Attentiveness n 
Conformity 
Associates Teaching philosophy 
All subjects X Pb iene gres EL 88 
Variable ratio subjects , .26** 09 EL 44 
Continuous reinforcement subjects Al .23* —.07 44 
Instructions subjects R NK od 14 44 
No-instructions subjects aon apt se 44 


* Significantly different from zero at the ,10 level. 
** Significantly different from zero at the .05 level. 
+++ Significantly different from zero at the .01 level. 


ahi (ett ni i it 
ee, 


2i J 


REINFORCEMENT SCHEDULES AND ATTRIBUTED FREEDOM 


rection: n when they were not statistically 
signific 

It micht be argued that the rather strong 
effect inforcement schedule reflected a 
difference between the treatments with respect 
to the ount of money that subjects might 
concei expect to win. Thus, although the 
subjec iving variable ratio and continu- 
ous r sment treatments actually won 
the same amounts, it is reasonable to suspect 
that ble ratio subjects envisioned the 
possib f winning more than continuous 
reinforcement subjects after the recess was 
termi: (In variable ratio treatments the 
teacher dispensed and withdrew quarters, 
wheres in continuous reinforcement treat- 
ments he dealt only with dimes.) If variable 
ratio subjects believed the teacher had a 


larger sum at his disposal than did continuous 
reinforcement subjects, the tendency of the 
former ʻo manifest greater attentiveness and 
ingratiation may have reflected their estimates 


of the teacher's “stake,” rather than his pre- 
sumed ireedom to dispense it. The data of 
this study do not permit an unequivocal test 


s hypothesis, since the treatments that 
kely to have encouraged the perception 
ich stake are the ones in which the sub- 


imagined magnitude of the 
was not totally responsible for the obtained 
results, Steiner (1970) has suggested that 
perceived freedom is a joint function of the 
valence of possible outcomes and the proba- 
bility that they will be obtained. If this is the 
case, reinforcement schedule and stake may 
be codeterminants of the freedom that is at- 
tributed to a reinforcing agent. 

It is possible that variable ratio subjects 
are especially attentive because the agent’s 
reinforcing behaviors seem unusual, illogical, 
or bizarre. Byrne and Clore (1967) have 
argued that inexplicable events threaten one’s 
self-confidence, and thus arouse an “effec- 
tance motive” (White, 1959). According to 
this view, one needs to understand the en- 
vironment in order to feel secure, whereas the 
theory advanced in this study maintains that 
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one seeks to understand the manipulable as- 
pects of the environment in order that they 
may be manipulated to one’s advantage. Per- 
haps both motives for attentiveness are op- 
erative in many situations, but there is reason 
for doubting that “effectance” is a very satis- 
factory explanation of the behaviors of our 
variable ratio subjects. Byrne and Clore’s 
scale for measuring effectance arousal was 
administered to all subjects immediately fol- 
lowing completion of the anagrams task. The 
means of the obtained scores for the two rein- 
forcement conditions were very similar (varia- 
ble ratio = 12.5, continuous reinforcement = 
11.9; F < 1). 

In order to vary the frequency of reinforce- 
ment while holding constant the total amount 
of money subjects won, it was necessary to 
vary the magnitude of reinforcement. Perhaps 
variable ratio subjects attributed more free- 
dom to the teacher because he administered 
larger reinforcements, rather than because he 
administered them less frequently. The greater 
magnitude of the variable ratio reinforce- 
ments may also have led subjects to see the 
teacher as a more “powerful” agent. Thibaut 
and Kelley’s (1959) matrix analysis of power 
suggests that the variable ratio teacher might 
have seemed more powerful than the continu- 
ous reinforcement teacher because the former 
moved the subject through a wider range of 
outcomes. Additional research is needed to 
identify the separate effects of frequency and 
magnitude of reinforcement on attributed 
freedom, and to determine whether an agent’s 
attributed power is discriminably different 
from his attributed freedom. 

The findings of this study suggest that re- 
inforcements have effects that are often ig- 
nored by researchers. When subjects are hu- 
man beings, rewards and punishments may þe 
more than mere instrumentalities for 
strengthening or weakening habits, or incen- 
tives by which desired behaviors are induced. 
They may also be communication devices 
which inform the recipient about the probable 
intentions, attitudes, and freedom of the 
agent who administers them. Variable ratio is 
likely to tell the recipient that he is dealing 
with a comparatively free agent whose actions 
reveal his personal preferences and idiosyn- 
cracies, and who may be susceptible to ingra- 
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tiation. On the other hand, continuous rein- 
forcement tends to mark the agent as one who 
lacks freedom, whose behaviors disclose little 
about his dispositional character, and who is 
relatively immune to ingratiation. To be sure, 
when variable ratio is administered by an 
experimenter in a laboratory setting, it may 
communicate approximately the same message 
as does continuous reinforcement: human sub- 
jects are likely to realize that even seemingly 
erratic experimenters are highly programmed 
and enjoy little discretionary freedom. How- 
ever, outside the laboratory, the pattern of re- 
inforcements received by an individual may 
have a strong influence on the manner in 
which he perceives the controlling agent, and 
on his attitudes toward that person. Tf this is 
the case, behavior therapy may be assumed to 
evoke complex cognitive and evaluative proc- 
esses which are not comfortably handled by 
the Skinnerian model. To a far greater degree 
than rats or even chimpanzees, humans ap- 
pear to impose meaning upon social situations, 
and an agent's reinforcement schedule may 
readily become a source of cues which guide 
this endeavor. 

It is easy to infer implications of this re- 
search for child training, teacher-pupil rela- 
tions, and a broad array of situations in which 
reinforcements are employed. Additional work 
is needed to determine how completely the 
findings of this study can be generalized to 
cover events that occur in such real-life set- 
tings. 
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'The problem was to test the applicability of Fiedler's contingency model on 56 


classroom groups. 


It investigated the effectiveness of high and low least 


preferred co-worker (LPC) leaders with strong or weak power on structured 
and unstructured group tasks when the leader-member relations were good. 
'The results revealed that low LPC leaders were more effective than high LPC 
leaders when the leader had strong power and the tasks were either structured 
or unstructured, Further, high LPC leaders were more effective than low LPC 
leaders when the leader had weak power and the task was unstructured. No 
difference was found between high and low LPC leaders when the task was 
structured and the power was weak. The findings offer partial support for the 


model. 

Much research has been done in the area of 
leader-hi» effectiveness, but little has been 
done in direct application of leadership theory 
to classroom situations. The following study 
was designed to test the application of one 
such theory, Fiedler’s (1964) contingency 
model. on small interacting classroom groups. 

Previous research by Fiedler (1964) indi- 


cated that leadership effectiveness is a func- 
tion of situation favorability, where favorabil- 
ity refers to the degree to which the dimen- 
sions of the group situation affect the ability 
of the leader to influence the group members. 
He proposed in the contingency model that an 
interaction effect existed between leadership 
style, as measured by the Least Preferred Co- 
worker (LPC) scale as cited in Fiedler 
(1967), and group effectiveness, and that di- 
rect, task-oriented leaders (low LPC) were 
most effective in highly favorable or highly 
unfavorable situations, while indirect, relation- 
oriented leaders (high LPC) were most effec- 
tive in moderately favorable situations. Situ- 
ation favorability was determined in order of 
importance by leader-member relations, task 


1 This study is based on the author's doctoral dis- 
sertation completed at Indiana University. Apprecia- 
tion is expressed to Laurence D. Brown, Jerome M. 
Chertkoff, Edward E. Gotts, and Richard L. Turner 
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? Requests for reprints should be sent to the au- 
thor, who is now at the Institute for Child Study, 
leo) ac of Maryland, College Park, Maryland 
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structure, and position power. The above can 
best be explained by examining Table 1. In 
this table, Cell 1 represents the most highly 
favorable situation, while Cell 8 represents the 
least favorable situation. Thus, in a highly 
favorable situation, the leader-member rela- 
tions are good, the task is structured, and the 
position power is strong, while in the least 
favorable situation, the leader-member rela- 
tions are bad, the task is unstructured, and 
the position power is weak. 

By combining the results of various corre- 
lational studies, Fiedler (1964) discovered 
that a U-shaped function existed between 
LPC and group effectiveness. In particular, 
low LPC leaders were most effective in Cells 
1, 2, 3, and 8, while high LPC leaders were 
most effective in Cells 4 and 5. Cell 6 had not 
been specifically investigated, while the re- 
sults concerning Cell 7 were inconsistent and 
did not strongly support either high or low 
LPC leaders. 

Tests of the contingency model were made 
by Anderson (1964), Shaw and Blum (1966), 
Hunt (1966), Fiedler (1966), and Reilly 
(1968). Of these studies, only Fiedler at- 
tempted to replicate the entire model, and his 
results strongly supported the model. Hunt, 
who tested Cells 1, 2, 5, and 7, also obtained 
results which strongly supported the model. 
Both Anderson’s and Shaw and Blum's results 
weakly supported the model, and Reilly's 


367 


368 


TABLE 1 
CONTINGENCY MODEL 


Cell | Leader-member 


Task structure | Position power 


relations | | 
1 | Good | Structured | Strong 
2| Good | Structured Weak 
3 Good Unstructured | Strong 
B Good Unstructured Weak 
5 Bad Structured Strong 
6 Bad Structured Weak 
7 Bad Unstructured Strong 
8 Bad Unstructured Weak 


results failed to replicate any parts of the 
model. 

Since few researchers attempted to apply 
the contingency model to an educational set- 
ting, this study was specifically designed to 
test the applicability of the model to small, 
intact, interacting, classroom groups. It inves- 
tigated the effectiveness of high and low LPC 
leaders with either strong or weak power on 
both structured and unstructured group tasks. 
This particular study investigated only the 
first four cells of the model, because socio- 
metric data collected prior to the experimental 
session in a classroom comparable to those 
used in the study indicated that too few stu- 
dents displayed a sufficient dislike for class- 
mates to create groups with poor leader- 
member relations. One of the weaknesses of 
past studies was the use of correlations be- 
tween LPC and achievement on a task as the 
dependent variable. Frequently, these corre- 
lations were not significant at acceptable lev- 
els, and thus the implications drawn from 
them were viewed with reservations. Butter- 
field (1967) noted that of the 72 correlations 
Fiedler (1967) used to support his theory, 
only 15 were significant at the .05 level or 
less. Consequently, the use of correlational 
data in this study was avoided. 

The specific hypotheses investigated were 
as follows: (1) Low LPC leaders are more 
effective than high LPC leaders when the task 
is structured and the leader-member rela- 
tions are good, regardless of position power; 
(1a) low LPC leaders are more effective than 
high LPC leaders when the task is structured, 
the position power is strong, and the leader— 
member relations are good; (1b) low LPC 
leaders are more effective than high LPC 
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leaders when the task is structured, the posi- 
tion power is weak, and the leader-nx r 
relations are good. (2) An interaction eect 
exists between LPC and position power when 


the task is unstructured and the leader—iem- 
ber relations are good; (2a) low LPC leaders 
are more effective than high LPC leaders 
when the task is unstructured, the position 
power is strong, and the leader-member : 

tions are good; (2b) high LPC leaders cre 
more effective than low LPC leaders when t? 
task is unstructured, the position power 
weak, and the leader-member relations 


good. 


j 


METHOD 
Subjects 


Subjects were 224 (97 males and 127 females) 
sophomores and juniors who were members of seven 
sections of educational psychology at a large mid 
western university. 


Procedure 


Five or 7 days prior to the experimental session 
during one of their regularly scheduled class per : 
all subjects were asked by their instructor to com 
plete an LPC scale. The four highest and the four 
lowest in each class were selected as the leaders for 
their particular section. These leaders were randomiv 
assigned to three-member groups, which were also 
randomly formed within their particular section; 
thus, there were a total of 28 high LPC-led groups 
and 28 low LPC-led groups, consisting of one leader 
and three group members each, 

Two high LPC leaders and two low LPC leaders 
from each section were randomly selected as strong 
power leaders, while the remaining leaders became 
weak power leaders. This manipulation was effected 
by means of the instructions given to the groups. 
Groups with strong power leaders were informed 
that their leader was specifically selected for this 
task and, further, that he had the responsibility to 
determine if all group members should share equally 
in the grade that the group received; that is, mem- 
bers contributing more work than others should 
receive a higher grade than those contributing less. 
Those groups with weak power leaders were in- 
formed that their leaders were selected at random, 
and there was no reason to suppose that they were 
more qualified than any other member of the group. 
Further, they could select another leader for future 
assignments, if they desired. To validate this manip- 
ulation, the position power scale was administered to 
all group members except the leader immediately 
after the completion of the last task. 

All groups were assigned to both a structured and 
an unstructured task concerned with a 12-minute, 
problem-centered, open-ended, stimulus film (What 
Do I Know about Benny? 1967). Task structure was 
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defined sion verifiability, goal clarity, goal 
path r and solution multiplicity, as cited 
in Sh A structured task was high in 
decisi bility and goal clarity and low in 
goal iitiplicity and solution multiplicity, 
while ructured task had the opposite char- 
acteris order of presentation of the tasks 
was janced to control for an order effect. 
Grou; randomly assigned to either the struc- 
tured instructured task first, and upon com- 
pletior ere given the opposite task. A time 
limit inutes was allowed for each task. The 
score < . tasks was the dependent variable used 
in th At all times, a genuine classroom at- 
mosph maintained in which subjects were to 
believ the assignments they were performing 
were | their normal curriculum and that these 
assign: represented a part of their course grade. 


The uctured task was an opinion-type ques- 


tion < ning recommendations the group, which 
was i: me it was a teacher's committee on stu- 
dent : , was to make to a school board concern- 
ing B . a student discussed in the stimulus film. 
The were evaluated by three judges on the 
basis hree criteria: style, adequacy of recom- 
mend and persuasibility. Style referred to the 
gram: |] correctness of the paper, the quality of 
the i: ve used, the correctness of the spelling, and 


the : opriateness of the statement as a recom- 


mend to the school board. Persuasibility re- 
ferred the degree to which a school board would 
be infuenced to accept the recommendation pre- 
sented. Adequacy of recommendations referred to 
the degree to which the recommendations effectively 
conside; he problem and all its aspects. 

The tured task was an objective test concern- 
ing details and problems presented in the stimulus 
film. The group was instructed to work together on 


only one question at a time. It was evaluated by 
summing the number of correct responses to the ob- 
jective test. 

Leader-member relations were not varied and were 
assumed to be good. To determine if this assumption 
remained valid throughout the study, the group at- 
mosphere scale was administered to the leader after 
each task. In addition, the group evaluation scale 
was administered after each task to each group mem- 
ber except the leader. This scale evaluates the ex- 
tent to which each group member feels that he is an 
integral part of the group. In addition, it measures 
his perception of his ability to work with the as- 
signed leader of the group, his desires to remain in 
the group for future work, and his perception of 
how the entire group perceives the effectiveness of 
the leader. 


Design 


In each section, the instructor introduced the stim- 
ulus film by stating: “Today’s film is entitled What 
Do I Know about Benny?, and it is about a pue 
teacher conference” After the film was completed, 
the instructor said: 
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You are now going to do some group work con- 
cerning the film. Each group will do two assign- 
ments which will be graded and count toward your 
final grade. For this particular assignment, I have 
prearranged the groups. Because there is not 
enough space, I’ve scheduled another room for 
half of you, and I've asked Mr. {an 
assistant] to help with the work. The following 
people should go with Mr. to Room 


To the students remaining in the classroom, the in- 
structor said: “Tve put the rest of you into the fol- 
lowing groups.” The instructor then indicated which 
students belonged into which groups, who was the 
group leader, and where each group was to work, 
Once the groups were formed and in their correct 
place, the instructor gave the strong power com- 
munication and informed the groups that they had 30 
minutes to complete each task. The first task for 
each group was then given to the leader of that 
group. When the first task was completed, the re- 
sults were collected, and the leader of the group was 
given the first group atmosphere scale, which he was 
to complete in isolation from the rest of the group. 
The instructor then gave each group member the 
first group evaluation scale. When all scales were 
completed, they were collected by the instructor, and 
the leader was given the second task. When this 
task was completed, the results were again collected, 
the leader was given the second group atmosphere 
scale in isolation from the group, and the group 
members were given the second group evaluation 
scale. When the group members completed the group 
evaluation scale, the instructor gave them the position 
power scale. When all scales were completed, they 
were collected, and the group was dismissed. 

Tn all cases, the instructor's groups were the four 
strong power groups. The assistant and the instructor 
followed the same procedure, except for the power 
communication, in which the assistant gave the weak 
power communication. Thus, in the instructor's class- 
room there were four groups with strong power, two 
who had high LPC leaders and two who had low 
LPC leaders. One of the high and one of the low 
LPC-led groups did the structured task first, while 
one of each did the unstructured task first. In the 
assistant’s classroom, there were four low power 
groups distributed in the same manner as the in- 
structor's classroom. In both classrooms, the groups 
were stationed at the corners of the room, and the 
two groups working on the same task at the same 
time were positioned diagonally from each other. Any 
additional students were placed into separate groups 
and given the same assignments; however, their 
results were not included in the analysis. 

The unstructured task was retyped exactly as the 
group had written it. Copies of the task were given 
to the three judges, who were advanced graduate 
students. The order of presentation of the task copies 
to the judges was randomized for each judge. The 
judges received a standard instruction sheet which 
defined the three criteria by which the task was to 
be scored and the procedures by which this scoring 
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was to be done. They were to evaluate the papers on 
each criterion using any value from 1 to 50, with 1 
representing a poor evaluation and 50 representing a 
good one. Further, all papers were to be graded on a 
particular criterion before any papers were to be 
graded on the next criterion. To evaluate the struc- 
tured task, the groups’ results were scored by an 
objective key. 


Analysis of the Data 


To analyze the data, a 2 X2 factorial design be- 
tween LPC (high and low) and position power 
(strong and weak) was used for both the structured 
and the unstructured task. Hypothesis 1, using the 
group's effectiveness on the structured task as the 
dependent variable, investigated Cells 1 and 2 of 
the contingency model. A main effect was predicted 
in which the low LPC leader would be more effec- 
tive than the high LPC leader. Hypothesis 2, using 
the group's effectiveness on the unstructured task as 
the dependent variable, investigated Cells 3 and 4 of 
the model. An interaction effect was predicted, in 
which the low LPC leader would be more effective 
than the high LPC leader when the position power 
was strong and the high LPC leader would be more 
effective than the low LPC leader when the position 
power was weak. To test the hypotheses (1a, 1b, 2a, 
and 2b) which investigated Cells 1, 2, 3, and 4, re- 
spectively, an individual comparison was run be- 
tween the appropriate cells using the F test cited in 
Winer (1962, p. 120). 


RESULTS 


To select the most task-oriented and rela- 
tion-oriented leaders within a particular sec- 
tion, the LPC scale was administered as a 
premeasure prior to the experimental session. 
The minimum value obtainable on the par- 
ticular LPC scale used in this study was 16, 
while the maximum was 128. For all students, 
the mean value was 66.0, the median was 
65.0, and the standard deviation was 17.3. For 
high and low LPC leaders, the mean values 
were 92.2 and 40.5, respectively. 

To validate this assumption of good leader— 
member relations, two scales were adminis- 
tered after each task was completed. The first 
of these scales, the group atmosphere scale, 
represented the leader’s perception of his re- 
lationship with the group. The possible range 
of scores on this scale was from 10 to 80, with 
the higher value representing more favorable 
relations, The mean value after both tasks was 
71.7, while the standard deviations after the 
first and second task were 4.8 and 8.6, respec- 
tively. The second of these scales, the group 
evaluation scale, represented the group's eval- 
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uation of the atmosphere within the group and 
of the relationship between the group members 
and the leader. The range of possible scores 
on this scale for each individual was from 4 
to 20, with the larger value representing the 
more favorable relationships within the group. 
To obtain a group score, the values obtained 
by each of the three group members (the 
leader did not complete this scale) were added 
together, thus giving a possible range of 
12-60. The mean group score after the first 
task was 53.1, with a standard deviation of 
3.61, while after the second task the mean 
group score was 54.0, and the siandard devi- 
ation was 3.33. Thus, both the group atmos- 
phere scale and the group evaluation scale in- 
dicated that after each task, the relationship 
within the group was good. 

To determine if the power manipulation 
was successful, the position power checklist 
was administered to the group members (ex- 
cluding the leader) at the end of the experi- 
mental session. The possible range of scores 


on this checklist was from 5 to 18, with the 
higher value indicating that the group mem- 
bers perceived the leader to have sironger po- 


sition power. For the strong position power 
groups, the mean was 9.3, and the standard 
deviation was 2.5, while for the weak position 
power groups the mean was 3.5, and the 
standard deviation was 2.6. A /* test on the 
difference between the two groups (df — 1/ 
86) yielded a value of 91.7, which was highly 
significant, thus indicating that the two 
groups were significantly different from each 
other in regard to the perception of power of 
the leader by the group members. The check- 
list scores indicate that the strong position 
power leaders had only moderately strong 
power, which left some doubt as to the suc- 
cessfulness of the power manipulation. How- 
ever, in responding to the position power 
checklist statements, “Leader can recommend 
punishments and rewards" and “Leader can 
effect (or can recommend) promotion or 
demotion," all group members in the strong 
position power groups responded “agree,” while 
only a few members of the low power groups 
responded in an affirmative manner. Since 
these two statements were the essence of the 
power manipulation, it was concluded that 
this manipulation was successful. 


yerSHIP STYLE AND PERFORMANCE OF SMALL CLASSROOM GROUPS 


Each completed both a structured 
and an ‘tured task. On the structured 
task, ier-Richardson reliability was 
61, an it-half reliability was .62, with 
a stand ror of measurement of 2.58. The 
reliabili the measurements of the three 
judges < unstructured task was com- 
puted | of analysis of variance for ad- 
justed as cited in Winer (1962). This 
particu ibility was selected because the 
varian to differences between the mean 
ratings by judges represented a possible 
system vurce of variation. By adjusting 
the dat differences in frame of reference 
by the ent judges, this source of varia- 
tion wa minated as part of the error of 
measur st. The reliability of measurements 
of the judges on the unstructured task 
for th ee criteria, style, persuasibility, 
and a cy of recommendations, respec- 
tively, 751, .710, and .715. In addition, 
by sun - the values of the three criteria, a 
total sc was obtained from which a total 
reliability for the three judges was computed. 
This value was .815. 

The following Pearson product-moment Co- 
efficients of correlation between criteria were 
computed: style and persuasibility, .711; style 
and adequacy of recommendations, .711; per- 


d adequacy of recommendations, 
> values were obtained by summing 
the three judges’ ratings on each criterion to 
arrive at a total score for each unstructured 
task graded for a particular criterion. These 
total scores were used to compute the corre- 
lations. By examining these correlations, it 
can be seen that there was considerable over- 
lap between the various criteria used to judge 
the unstructured task. 

To test Hypothesis 1, the scores from the 
structured task were used as the dependent 
variable in a 2 x 2 factorial design, with 14 
cases per cell, The means of these data are 
presented in Table 2, and the results of the 
analysis of variance are presented in Table 3. 
A main effect was predicted with the low 
LPC-led groups being more effective than the 
high LPC-led groups; however, an interaction 
effect was instead found between LPC and 
position power. Thus, Hypothesis 1 was not 
Supported by the data. 
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TABLE 2 


STRUCTURED AND UNsrRUCTURED Task MEANS OF 
Hic AND Low LPC Groups WITH 


Power 

LPC Task — 

Strong | Weak 

High | Structured | 29.07 | 32.07 

Unstructured style 45.50 | 53.86 

Persuasibility 46.64 | 53.07 
Adequacy of recommenda- 

tions | 45.71 | 52.93 

"Total | 45.29 | 53.21 

Low |Structured 32.92 | 29.71 

Unstructured style 54.50 | 46.07 

Persuasibility 53.71 | 46.29 
Adequacy of recommenda- 

tions 53.21 | 48.00 

"Total 54.14 | 46.71 


Hypothesis la tested Cell 1 of the contin- 
gency model and predicted that when the posi- 
tion power was strong, the low LPC leaders 
would be more effective than the high LPC 
leaders. To test this, an individual compari- 
son was made between the high and low LPC 
groups with strong position power when the 
task was structured, using the method cited 
by Winer (1962) and discussed previously 
under analysis of the data. The results of this 
analysis are presented in Table 4. The F test 
(df — 1/52) was significant at the .05 level. 
By examining the data in Table 2, it can be 
seen that low LPC leaders were more effec- 
tive than high LPC leaders when the position 
power was strong and the task was structured. 
Since this difference was significant, Hypothe- 
sis 1a was supported, and Cell 1 of the con- 
tingency model was replicated. 

Hypothesis 1b tested Cell 2 of the contin- 
gency model, and it predicted that when the 
position power was weak, the low LPC leaders 
would be more effective than the high LPC 
leaders. To test this, an individual comparison 
was run between the high and low LPC groups 
with weak power on the structured task. The 
results of this analysis are presented in Table 
4, The F test (df = 1/52) was not significant 
at the .05 level, and by examining the data in 
Table 2 it is seen that it was not in the pre- 
dicted direction. Consequently, Hypothesis 1b 
was not supported by the data, and Cell 2 of 
the contingency model was not replicated. 
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TABLE 3 


SUMMARY OF ANALYSIS OF VARIANCE FOR FACTORIAL 
Desicn FoR HiGu AND Low LPC Groups with 
STRONG OR WEAK POWER ON THE STRUCTURED 
AND UNSTRUCTURED TASK 


Source of variation SS df MS F 
Structured task 
LPC (A) 7.87500 1 7.87500 
Position power (B) «16071 1 -16071 
A XB 135.16071 1 | 135.16071 | 9.015* 
Within cell 779.64286 | 52 14.99313 
Total 922.83929 | 55 
Unstructured style task 
LPC (A) 5.16071 1 5.16071 
Position power (B) 3.50000 | 1 3.50000 
A XB 672.07143 1 | 672.07143 | 15.373** 
Within cell 3,335.64286 | 52 64.14608 
Total 4,326.98214 | 55 
Unstructured persuasibility task 
LPC (A) .28571 1 .28571 
Position power (B) 3.50000 | 1 3.50000 
AXB 672.07143 1 | 672.07143 | 9.623* 
Within cell 3,631.85714 | 52 | 69.84341 
Total 4,307.71429 5 


Unstructured adequacy of recommendations task 


LPC (A) 23.14286 | 1 23.14286 
Position power (B) 14.00000 1 14.00000 
XB 540.64286 | 1 | 540.64286 | 7.205* 
Within cell 3,902.14286 | 52 75.04121 
Total 4,479.92857 | 55 
Unstructured total task 
LPC (A) 19.44643 1 19.44643 
Position power (B) .87500 | 1 .87500 
AXB 825.44643 1 | 825.44643 | 11,990* 
Within cell 3,579.78571 | 52 68.84203 
Total 4,425.55357 | 55 
*p«.01 
** p <.001 


To test Hypothesis 2, each of the judges’ 
scores on the unstructured task for each cri- 
terion and for the total score was converted 
to a T score, and the median T score of the 
three judges was selected as the dependent 
variable for each of the criteria and for the 
total score. These values were placed into the 
appropriate cell of a 2 X 2 factorial design, 
with 14 cases per cell. By examining Table 2, 
it can be seen that the means in the high 
LPC, weak power cel and the low LPC, 
strong power cell were consistently higher 
than the means in the high LPC, strong power 
cell and the low LPC, weak power cell when 
the task was unstructured. Table 3 presents 
the summary of analysis of variance for the 
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unstructured task. By examining thes? analy- 
ses, it can be seen that the predicted ©‘ erac- 
tion effect was significant for all three c-iteria 
and for the total score, thus suppor! Hy- 
pothesis 2. 

Hypothesis 2a tested Cell 3 of the lel. 
To test this, an individual comparis.: was 
made between the high and low LPC aps, 
with strong position power on each the | 
criteria used to judge the unstructur: isk | 


and with the total score. The appropriate F 
tests can be found in Table 4, where it co» be 
seen that each of the F tests was sign’! cant 
at the .05 level. By referring to Tabi: it 
can be seen that the low LPC leader: 
significantly more effective than the hig 
leaders when the position power was n 
and the task was unstructured. Thus. the 
data supported Hypothesis 2a, and Celi 3 of 
the model was replicated. Mj 
Hypothesis 2b tested Cell 4 of the » ‘cl. 
To test this, the appropriate individua! .^m- 
parisons were made for each of the cri(cria 
being used to judge the unstructured tas» «nd 
for the total score. The results of these a^^. y- 
ses can be seen in Table 4. All of the F tests 
were significant at the .05 level except sde- 
quacy of recommendations, which was greater. 
By referring to Table 2, it can be seen that the - 
high LPC leaders were significantly more | 
effective than the low LPC leaders when ‘he 
position power was weak and the task was un- 


TABLE 4 | 


INDIVIDUAL COMPARISONS BETWEEN Hron AND LOW 
LPC LEADERS WITH STRONG OR WEAK POSITION 
POWER ON STRUCTURED AND 
UNSTRUCTURED TASKS 


Task Power F 
Structured Strong | 6.946* 
Weak 2.549 
Unstructured style Strong | 8.839** 
Weak 6.615* 
Unstructured persuasibility Strong | 5.012* 
Weak 4.615* 
Unstructured adequacy of Strong | 5.247* 
recommendations Weak 2.266 
Unstructured total Strong | 7.977** 
Weak 4.296" 
*p <05, ram 
> <01 


structured, except when the criterion used to 


judge t k was adequacy of recommenda- 
tions. the adequacy of recommendations 
criterion in the predicted direction and 
since the rest of the analyses were significant, 
it was luded that the data supported 
Hypothe b, and that Cell 4 of the con- 
tingen del was replicated. 
DISCUSSION 
The of this study partially support 
the application of Fiedler's contingency model 
to cla m group work. Four of the six 
stated ! theses and three of four cells of 
the contingency, model were supported by the 
> tesult: his study. Only Cell 2 was not sup- 
ported by the results of this study, and in 
| this insiance the direction of the data was in 


reverse to what was predicted, but not to a 
significant amount. The failure of the main 
effect develop in Hypothesis 1 resulted 
from « ‘ailure to replicate Cell 2 of the model 
(Hypothesis 1b). 

If the assumption of good leader-member 
relation: was not valid, this could possibly 
account for the failure of this study to repli- 
cate Cell 2 of the model. This explanation was 
rejected for two reasons. First, the data from 
the group atmosphere scales and the group 
evaluation scales clearly indicated that the 
relations were good, Second, if the relations 
were not good when the task was unstructured 
and the position power was weak, the results 
would have been that the low LPC leaders 
were more effective than the high LPC lead- 
ers, and this was not in accordance with the 
data. Consequently, this explanation was re- 
jected. 

If position power was improperly manipu- 
lated and the weak power leaders were ac- 
tually strong, the cell being tested would have 
been one instead of two. If both position 


power and task structure were improperly 
manipulated, Cell 3 would have been tested 


instead of Cell 2. In both of these cases, low 
LPC leaders would have been more effective 
than high LPC leaders. The data obtained 
were in the reverse direction. In addition, the 
data from the results of the position power 
checklist indicated that the power manipula- 
tion was successful. Thus, these explanations 
were rejected. 
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If task structure was improperly manipu- 
lated, and if the structured task was unstruc- 
tured, the cell tested would have been Cell 4 
instead of Cell 2. If this were the situation, 
high LPC leaders would have been favored 
over low LPC leaders, and the data were in 
that direction. Further, if structure was im- 
properly manipulated, Cell 3 would have been 
tested instead of Cell 1. The direction of the 
relationship was the same in both of these 
cells, and consequently this was in accordance 
with the data. Even though task structure 
was defined in accordance with Fiedler, an 
ineffective manipulation of it remained as a 
possible explanation of the failure of this 
study to replicate Cell 2 of the model. 

Another explanation that could account for 
the failure of this study to replicate Cell 2 
was the assumption that position power was 
the least decisive of the situation dimensions 
in determining the favorability of the situa- 
tion. In the classroom setting in this study, 
the ability of the leaders to influence a stu- 
dent's grades could have made the power di- 
mension more decisive than task structure in 
determining the favorability continuum. If 
this described the decisiveness of the situation 
variables in determining the favorableness of 
the situation, the most important variable 
would have been leader-member relations; the 
second, position power; and the third, task 
structure. It is important to note that in this 
arrangement Cells 2 and 3 interchange posi- 
tions, as well as Cells 6 and 7. When consid- 
ering this modification, it must be remem- 
bered that Fiedler's theory states that in the 
most favorable and least favorable situation, 
low LPC leaders will be more effective than 
high LPC leaders, and that in the moderately 
favorable conditions the reverse is true. When 
examining the data in respect to this modifica- 
tion, Cells 1 and 4 remained the same, and 
this was in accordance with the data. Cell 3, 
which became a more favorable condition, 
would be characterized by a negative correla- 
tion between LPC and leadership effective- 
ness, and this was in accordance with the data. 
To maintain the U-shaped function of the 
model, Cell 2, which was now less favorable, 
would be predicted to have a correlation be- 
tween LPC and leadership effectiveness that 
was between the values obtained in Cells 3 
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and 4. The fact that this study showed no 
significant difference between high and low 
LPC groups, even though the direction of the 
data favored high LPC groups, offered sup- 
port for this modification of the favorability 
continuum. 

It should be noted that several other stud- 
ies, including two by Fiedler, obtained a posi- 
tive correlation between LPC and leadership 
effectiveness when investigating Cell 2 of the 
model. A review of these studies and accom- 
panying methodological considerations can be 
found in Mitchell, Beglen, Onchen, and Fied- 
ler (1970). In this review, five out of seven 
laboratory experiments obtained a positive 
correlation in Cell 2, while three out of three 
field studies obtained a negative correlation 
in Cell 2. Consequently, generalizations con- 
cerning Cell 2 of the contingency model 
should be made with caution. 

Two possible explanations were proposed 
to account for the failure of this study to sup- 
port Cell 2 of the contingency model: first, 
that the degree of structure of the structured 
task was too small and that this task assumed 
the properties of the unstructured task; and 
second, that position power was a more de- 
cisive determinant of the favorableness of the 
situation than was task structure. The author 
acknowledges that these proposed explana- 
tions were only speculative, and that addi- 
tional explanations could exist to better ac- 
count for the data. 

The results of this study have implications 
for the use of group work in the classroom. 
Specifically, if Fiedler's contingency model 
remains valid in a classroom setting, the in- 
structor of the classroom can increase the 
effectiveness of a group by selecting a leader 
who is most appropriate for the particular 
conditions within that group, or the instructor 
can manipulate certain situation variables that 
place a particular leader in the most favorable 
situation for his leadership style. 

Even though the results of this study 
strongly supported the contingency model 
when it was applied to group work within the 
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classroom, the results only applied to particu- 
lar sections of undergraduate educational psy- 
chology. Future experimentation should be 
done in educational settings which can be gen- 
eralized to a larger population. Further, addi- 
tional experimentation is necessary to deter- 
mine the reason for the failure of this study 
and other experimental studies to support 
Cell 2 of the model. 
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Attitude topics were scaled for the 
affecting a person's sense of com 
personal attraction containing evalua’ 


from the Byrne and Clore compet 
Subjects were shown attitudes of ot 
exposed, potency of the topics, and 
called more disagreement t 
salience of disagreements in the situa! 
attraction (p < 05), supporting the 


implicit in the model. 


Stucies of the effects of interpersonal dis- 
agreement continue to occupy a major posi- 
tion i: sonality-social research. The com- 
petence-reinforcement model of interpersonal 
attraction (Byrne & Clore, 1967) holds that 
attitudinal agreement acts as positive indica- 
tion to a person that he is interpreting the 
world correctly. Attraction then is a condi- 
tioned response to the person being paired 
with the agreement. 

Intuitively, disagreements would appear to 


be more salient to a person than agreements 
within the competence-reinforcement model. 
In this vein, subjects remember a significantly 
higher proportion of the disagreement topics 
following exposure to attitudinal agreement 
and disagreement (Gormly & Clore, 1969). 
Byrne and his associates have emphasized 
that attraction is a linear function of the 
proportion of agreements. A likely conse- 
quence of a reinforcement model and the pro- 
posed salience of disagreement is that the 
number of statements should have an effect 
on attraction. 

Considering the reinforcement model of 
attraction, agreements are posited to act as 


1 The authors wish to acknowledge the members 
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potency of peer disagreement on the topics 


petence, and a measure to assess inter- 
tive polar adjectives and behavioral in- 
ms were used to test predictions 
inforcement model of attraction. 


ence-rei 
hers which varied on number of topics 
Subjects re- 


proportion of agreements. 
t topics ($ < 001), illustrating the 


reinforcement position in the attraction 
supported motivational assumptions 


positive reinforcements, and disagreements 
are posited to act as negative reinforcements; 
these properties of agreements and disagree- 
ments come from their effects on a person’s 
sense of competence (Byrne & Clore, 1967). 
At an intuitive level, the effect of the number 
of units of evidence for or against a person's 
sense of competence should be, in some sense, 
cumulative. That is, 1 disagreement on a 
total of 1 topic should furnish less negative 
reinforcement than 10 disagreements out of 
10 topics. 

Considering the salience issue, the present 
authors assume that the larger impact of dis- 
agreements, as indicated by recall data, will 
result in a greater reinforcement value for 
disagreements than agreements, which will, in 
turn, have an effect on attraction. A conse- 
quence of this reasoning is that as the number 
of topics increases within any proportion, then 
attraction will decrease. 

Ina study by Byrne and Nelson (1965a), 
four proportions of agreements, .33, .50, .67, 
and 1.00, and three quantities of attitude 
topics, 4, 8, and 16, were examined in a be- 
tween-groups design. Only the proportion fac- 
tor reached an acceptable level of significance. 
While Byrne and Nelson clearly interpreted 
the number-of-topics factor and the interac- 
tion as not significant, their data were in the 
direction of decreased attraction being related 
to an increased number of topics. 
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In reexamining the issue of the effect of 
number of topics on attraction, the present 
study employed a larger number of topics and 
more points on the continuum of proportion of 
agreements than the Byrne and Nelson study. 
It was hypothesized that (a) subjects will 
recall more topics on which the stranger dis- 
agreed than agreed; and (5) increasing the 
number of statements, while holding the pro- 
portion of agreement constant, will decrease 
attraction. 

A second implication of the competence- 
reinforcement model is that the nature of the 
topic of agreement-disagreement should influ- 
ence attraction. Disagreement on some topics 
reflects more on a person's sense of compe- 
tence than disagreement on others. It has 
been shown that statements that were highly 
verifiable and, therefore, depended less on 
interpersonal consensus for correctness, in- 
fluenced attraction less than statements of 
low verifiability (Byrne, Nelson, & Reeves, 
1966). Other research (Byrne & Nelson, 1964, 
1965b; Clore & Baldridge, 1968) have re- 
ported rather equivocal effects, attributable to 
the potency of the topic. 

In dealing with the problem of topic po- 
tency (e.g., relevance to a person's sense of 
competence) in the current study, it was de- 
termined that subjects could reliably scale 
items on the degree of incompetence implied 
by having a peer disagree with a person on the 
topics. It was hypothesized that the potency 
of the topics would be related to attraction. 

Typically, in Byrne-influenced attraction 
research, the Interpersonal Judgment Scale 
(IJS; Byrne, 1961) is used to measure attrac- 
tion. The range of attraction scores from this 
measure is quite limited, and the question 
regarding participation in an experiment is a 
rather impersonal indicant of attraction. In 
the current study, an evaluative-behavioral 
measure of attraction is presented which per- 
mits more variance in scores by requiring 
more information concerning behavioral in- 
tentions and more judgments that are directly 
evaluative in nature. 


METHOD 
Subjects 


All subjects were students enrolled in the intro- 
ductory psychology course at the University of Ili- 
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nois. They participated in this study to fulfill 
course requirement, Samples of 40, 121, and 172 stu- 
dents were used in the three aspects of the study, 
Each sample was approximately balanced for sex. 

Competency scaling. Forty subjects judged 42 
attitude statements according to the degree of compe- 
tence-incompetence implied by peer disagreement. 
The judges were trained for the scaling task using 
the following procedure: (a) Competence was defined 
according to ideas presented in the Byrne and Clore 
(1967) monograph. (b) Using the method oí paired 
comparisons (Edwards, 1957), the subjects practiced 
selecting the item for which peer agreement would 
imply more competence. After the judges saw that 
they could make this distinction, the sample items 
were rated along a 6-point equal-appearing interval 
scale (0-5) for the degree of competence implied by 
peer agreement. (c) The procedure in b was re- 
peated on another sample of topics scaling the degree 
of incompetence implied by peer disagreement. (d) 
After the practice session and when the experimenter 
was convinced that each judge understood the | 
the judges rated 42 attitude statements of interest 
college students. The statements were rated on an 
11-point equal-appearing interval scale, with comi 
tent and incompetent as end points. Each item was 
ranked twice, once for agreement by a peer and once: 
for disagreement. The order of the 84 judgments was 
randomly determined. 

The interquartile range of the scale value for each 
of the 42 items judged according to the degree of 
incompetence was less than 1.40. This high degree of 
homogeneity in judgments is an indication that the 
criterion is scaleable. All items scaled for competence 
had an interquartile range of less than 1.60, which is 
also indicative of the scaleability of the criterion. (See 
Edwards, 1957, Ch. 4, for a discussion of the method 
of equal-appearing intervals.) 

Evaluative-behavioral attraction measure. Eight 
scales specifying behavioral intentions and 11 pairs. 
of evaluative polar adjectives were selected on the 
basis of their conceptual relevance to interpersonal 
attraction. The behavioral intention items were: would - 
ask to meet, would select for a roommate, would - 
study with, would ask advice from, would work | 
with, would invite to my home, would admit to my 
social group, and would accept as an intimate friend. 
The polar adjectives included pleasant-unpleasant, 
wise-foolish, nice-awful, beautiful-ugly, familiar- 
unfamiliar, kind-cruel, stable-neurotic, sociable-un- 
sociable, good-bad, clean-dirty, friendly-uníriendly. 
After 121 subjects had rated the stranger on the 
Byrne scale, they were asked to also rate him on the 
19 scales given above. These scales were presented in 
a six-alternative forced-choice format. The subjects 
came from several experimental procedures with 
varying proportions of agreement ; the general pro- 
cedure, however, always involved a stranger who 
disagreed with the subject on a certain number of 
attitude statements. Analyses of variance of attrac- 
tion as a function of proportion of agreement were 
conducted for each of the scales. The evaluative-be- - 
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; measure was formulated from the 
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relation with the proportion of agreement. The eval- 
uative-behavioral attraction measure items and their 
formats are: 


‘(PLEASANT 


!IUNFAMILIAR 


LAU Scc 2L —IFRIENDEDE 


ACCEPT THIS PERSON AS A ROOMMATE 


1 l IWOULD NOT 


ACCEPT THIS PERSON AS A MEMBER OF MY SOCIAL GROUP 


I | !'WOULD 


ACCEPT THIS PERSON AS AN INTIMATE FRIEND 


havioral 
three pol ive pairs and the three behavioral 
intention at showed the strongest positive 
UNPLEASANT! ! | 
FAMILIAR! ! l 
UNFRIENDLY!___!___ 
WOULD! ! i 
WOULD NOT! | | 
WOULDI___!__—_!|__—_" 
Inter? ; attraction, The 42 attitude items 
which | ^n scaled were split at the median of 
the sca! for degree of incompetence implied 
by pec zreement. High-competence topics in- 
duded ics toward premarital sex relations, the 
necessit war to solve world problems, and the 
meanin <s of life without the goal of eternal 
life. Th: w-competence items included attitudes 
toward <> oyment of gardening, the desirability of 
allowin cshmen to register cars on campus, an 
preferen r a foreign sports car over a Cadillac or 
Lincoln 
One lii: cired and seventy-two subjects participated 
in this aspect of the study. Three factors were ma- 
nipulated in a between-groups design: proportion of 
agreement, number of attitude items, and potency of 
the items 
The cts were tested on two occasions. In the 
first session, the subjects completed a survey of 
attitudes which included the 42 scaled topics plus 


a variety of personality scales. 

Fictitious attitude responses were devised by the 
experimenter which systematically agreed and dis- 
agreed with the subject’s responses made on the first 
occasion. The attitudes were supposedly given by a 
stranger (same sex and same year in school as the 
subject), and the subject was instructed to carefully 
examine this person’s responses. The subjects were 
given 7, 14, or 21 attitude statements of either high- 
competence or low-competence potency. The state- 
ments agreed with the subjects’ attitudes in a pro- 
portion of either 2/7, 3/7, 4/7, 5/7, OF 6/7. 

Agreements were one position removed from the 
subject’s original record but on the same side of the 
issue, for example, pro or con. Disagreements Were 
three positions removed from the subject's original 
responses on the 6-point scale. The choice and order- 
ing of the statements were random within the high- 
and low-competence item pools. Each statement was 
on a separate sheet of paper and in a 6-point forced- 
choice format. ' 

After the subjects had examined the strangers 
record, they were asked to make judgments about 
the person on the evaluative-behavioral attraction 


na ————Á— á—Á—— —————— 
measure and the IJS. Following these ratings, the 
subject was instructed to recall the attitude topics 
contained in the stranger’s booklet. 


RESULTS 


In examining the proportion of items re- 
membered, the 6/7 agreement condition was 
excluded, and the data were collapsed over all 
factors for a single comparison of the propor- 
tion of agreement topics remembered versus 
the proportion of disagreement topics remem- 
bered. Disagreements were more often remem- 
bered (z = 4.27, p < .01), thus supporting 
the first hypothesis. 

Ratings on the evaluative-behavioral attrac- 
tion measure were highly correlated with rat- 
ings on the IJS (r = .72), with the effects of 
proportion of agreements partialed out. Both 
measures were predictably related to the over- 
all proportion of agreements (for the evalua- 
tive-behavioral attraction measure, F = 4.59, 
df = 4/142, p < 01; and for the IJS, F= 
4.16, p < 01). 

The interaction between number of state- 
ments and proportion of agreement ap- 
proached the .05 level for the evaluative-be- 
havioral attraction measure scales (F = 2.96, 
df = 2/142, 05 < p< 40). Figure 1 illus- 
trates that as the number of statements and 
the proportion of agreements both increase, 
the attraction ratings stabilize at a moderate 
level of attraction. This interaction did not 
approach significance for the IJS. 

The effect of the number of statements on 
attraction was significant for the evaluative- 
behavioral attraction measure (F — 3.06, df 
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Fic. 1. The interaction between the number of topics and proportion oí 
agreements, 


— 2/142, p < .05; for the IJS, F = 2.63, ns). 
Attraction scores were highest for 7 items and 
lowest for 21 items. 

The competence factor was significant for 
both measures (for the evaluative-behavioral 
attraction measure, F = 5.25, df — 1/142, p 
< .02; and for the IJS, F = 4.10, p < .05). 
The high-competence items resulted in more 
attraction than the low-competence items. 
This effect is best understood by considering 
the interaction between the potency of the 
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Fic. 2. The interaction between the potency of the 
items and the number of topics. 


item and the number of statements. This in- 
teraction was significant for the evaluative- 
behavioral attraction measure (F = 3.44, df 
= 2/142, p < .05), but not for the IJS, al- 
though the means from both measures follow 
the same pattern. The potency of the item had 
its greatest effect when a large number of atti- 
tude topics were used. The three-way interac- 
tion for both the evaluative-behavioral at- 
traction measure and the IJS scores did not 
approach significance. 


Discussion 


This study supports two aspects of the 
competence-reinforcement model and presents 
a new set of scales for the similarity-attrac- 
tion paradigm. 

The recall data suggest that disagreements 
are more salient than agreements when con- 
sidering the attitudes of another. If agree- 
ments and disagreements have different rein- 
forcing values in influencing attraction, then 
the Byrne-Clore reinforcement model suggests 
that a larger number of topics (and hence a 
greater number of disagreements within pro- 
portion of agreements) would result in a lower 
attraction. The main effect for the number of 
topics was significant on the evaluative-be- 


traction measure, and the means 
in the predicted direction. Hence, 
of this study add support to the 
nt model. 


havioral 
were « 
the fin 
reinfor 


Initi ;cedures of this research demon- 
strate he degree of competence-incompe- 
tence ociated with attitudinal agreement— 
disag it is a viable criterion for scaling 
techn: Scaling on this competence-in- 
comp: . dimension provided items which 
allow lirect test of the motivational dy- 
nami ihe Byrne-Clore model. The predic- 
tion ti he potency of the topic would affect 
attra was supported using both the tradi- 
tiona ‘sure of attraction and the behav- 
joral-cv aluative measure devised for the pres- 
ent si This finding is support for the use- 
fulness: of the competence explanation in un- 
derstanding the effects of interpersonal dis- 
agre: ıt compared to dissonance or balance 
theories. While each theoretical position could 
probably be used to justify predictions about 
the quality of the topics of interpersonal dis- 
agreement having an effect on rejection of the 
disagree, the most reasonable prediction from 
dissonance or balance theories would have to 
be a prediction of general topic importance, 
rather than importance on the competence- 
incompetence dimension. The two studies 
which examined the effects of general impor- 


tance failed to find a significant difference 
(Byrne & Nelson, 1964, 1965b). 

The results from this study demonstrate 
that the larger the number of items used, the 
more definite are the differences for potency 
of the topics. Increasing the number of low- 
competent items resulted in a lowering of 
attraction, while increasing the number of 
high-competent items raised attraction. It is 
not clear why the number of topics should 
interact with the potency of the topics in this 
way. Perhaps there is a reluctance on the 
part of the subject to admit disliking a person 
on such a limited amount of information. 
Whether the subject knows 7, 14, or 21 atti- 
tudes of the stranger, he has very little infor- 
mation about that person. The task of judg- 
ing a stranger on the basis of a few responses 
is a difficult one for the subject. The subject 
has little information about why the stranger 
responds as he does or whether he acts on his 
beliefs. As one might expect, groups of sub- 
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jects seldom get extreme in their ratings of 
attraction-rejection toward the stranger. As 
the number of attitude items increases, how- 
ever, the subject may feel obliged to make  —— 
increasingly definitive judgments about the 
person. Vet, the low-competence items give 
very little substantive information about the — 
stranger. It is possible that in this situation, — — 
the subject feels that he should know some- —— 
thing about the stranger on the basis of the 
sheer number of statements, but because of 
the low potency of the statements, he realizes 
that he knows very little about that person. 
Hence, the subject feels frustrated by being 
asked to perform a task that he feels he can- 
not, and this causes the subject to reject the — 
other person. This is a frustration-aggression "i 
hypothesis that could be tested by seus A 
ing the instructional set to the subject. A 

The interaction between potency and nu 
ber was significant only for the emis a 
behavioral attraction measure scales. Similarly, 
the main effect of number of topics and the 
interaction of the number with proportion of 
agreement were significant for the evaluative- 
behavioral attraction measure but not for the 
IJS. The evaluative-behavioral attraction mea- 
sure scales offer the advantage of a larger 
range of attraction scores. Another possible 
advantage is that the behavioral intention 
items are more strongly related, intuitively, to 
the concept of interpersonal attraction than 
the IJS items. In the present study, several 
relatively weak IJS relationships were signifi- 
cant on the evaluative-behavioral attraction 
measure. Further, most of these findings were 
predicted from the competence-reinforcement 
model. It appears that efforts to test the com- 
petence-reinforcement model might profitably 
put greater emphasis on more evaluative and 
behavioral indicants of attraction. 

The interaction between the number of top- 
ics and the proportion of agreement illus- 
trates that when the proportion of agreements 
is small, that is, when disagreement is per- 
vasive, then the number is not important, but 
when the proportion of agreements is high, 
then the number of disagreements has an 
effect. The absence of any report of this rein- 
forcement effect in earlier studies may be 
attributed to methodological differences: the 
smaller number of statements used, the smaller 


1 


380 


proportion of agreement used, and the inabil- 
ity of the IJS to assess such subtle differentia- 
tions, 

Two points might be raised in relation to 
future attraction research. The role of dis- 
agreement, both in terms of number and po- 
tency of the topic, has been neglected and is 
not well formulated; yet, it may be the most 
salient aspect of the situation for the subject. 
The practice of giving the subject a small 
number of attitude statements from an alleged 
peer seldom results in ratings that indicate 
outright rejection of that person. A value 
probably exists for college students that makes 
extreme ratings of either attraction or rejec- 
tion inappropriate on such limited informa- 
tion. This limited range of variation in the 
bogus-stranger paradigm makes tests of theo- 
retical statements difficult, particularly when 
these tests depend on interactions among fac- 
tors, A methodology in which the proportion 
of agreement could continue to be rigorously 
controlled, but where the potency and amount 
of information could have an expanded range, 
would be a major advancement, Recent de- 
velopments in videotape systems may prove 
useful here. 


J. Gormiy, A. GonMLY, AND C. JOHNSON 
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QUITY AND THE DISTRIBUTION OF REWARDS 


IRVING M. LANE ap L. 


AWRENCE A. MESSÉ! 


Michigan State University 


This study reports the results of two studies that examined propositions derived 
m equity theory concerning how persons distribute rewards to themselves and 
hers. Such behavior can be classified into three types of distribution responses. 
idy I investigated the distribution responses of persons who each had unilateral 


to determine their own rewards and those of one other person, 


d equal inputs. Results support 
ning the relative frequencies of 


when both 


predictions generated from equity theory con- 
distribution responses. Also, a number of situa- 


nal and personal variables were found to affect responses. In Study II, subjects 


listributed rewards in one of four groups, 
input levels of the chooser 
ly partially support equity 
hoices. Taken together, the results of both 


constituted such that differences in the 


and his co-worker were varied systemically. Results 
theory since only the inputs of the co-worker affected 


studies indicate that persons do conform 


a norm of equity which lessens the potential for conflict present in all social 


situations. 


lamental fact of social science that 
ion in groups in part because 
a system of norms that allows 
them both to generate expectations about, and 
to weigh the appropriateness of, their own 
behavior and the behavior of others. A subset 
of these norms serves to mitigate the inter- 
personal conflict that is potentially present in 
many social situations. This study reports the 
results of two investigations whose focus was 
one such norm: the sense of equity. 


It isa fi 
men can i 
they develop 


Equity Theory 


Adams (1965) has developed a theory that 
attempts to explain behavior influenced by the 
norm of equity. The theory is basically a 
modification and extension of the concept 

distributive justice” (Homans, 1961). Equity 
theory states that persons should be concerned 
about the way rewards are distributed in a 
group. An equitable distribution gives each 
person a reward (outcome) that is proportional 
to his inputs—those qualities that are perceived 
to be a person’s contribution to the group. 
According to equity theory, persons should be 
satisfied with a situation as a function of the 
degree to which they perceive that a distribu- 
tion of rewards is equitable. 

On the other hand, persons should be dis- 
Satisfied whenever they perceive that either 
(1) they are receiving less reward than they 
ju iuests for reprints should be sent to Lawrence A. 
Gane, Department of Psychology, Michigan State 

sity, East Lansing, Michigan 48823. 


should, given their level of inputs, or (2) they 
are receiving more than they should. Moreover, 
(3) a person should feel more dissatisfied with 
the first distribution than with the second. 
These three propositions are the basic premises 
of equity theory. 

Adams (1965), 
research, concluded that 


in his review of the relevant 
there is support, both 


experimental (Adams, 1963; Adams & Jacob- . 


sen, 1964; Adams & Rosenbaum, 1962; 
Arrowood, 1961; Thibaut, 1950) and natura- 
listic (Clark, 1958; Homans, 1953; Jacques, 
1961, 1962; Patchen, 1961; Sayles, 1958), for 
the first two propositions of equity theory. In 
addition, a number of studies since then 
have also supported these two propositions 
(Andrews, 1967; Goodman & Friedman, 1968; 
Lawler & O’Gara, 1967; Leventhal & M ichaels, 
1969; Weick & Nesset, 19068)? However, as 
Adams (1965) noted, there are few if any 
studies directly relevant to the third proposi- 
tion of equity theory. A major goal of the 
research presented here as Study I was to test 
directly this proposition, as well as the first two. 


2 There has been some research (Friedman & Good- 
man, 1967; Gordon & Lowin, 1965; Lawler, 19682; 
Lawler, Koplin, Young, & Fadem, 1968) which has been 
nonsupportive of equity theory to varying extents. In 
addition, equity theory has also been criticized on 
theoretical grounds (Lawler, 1968b; Weick, 1966). 
However, both the research producing negative results 
and the theoretical criticisms involve a number of 
separate issues that are beyond the scope of the present 
research. Therefore, a detailed discussion of them has 
been omitted. 


vet Unt >» 


The research cited above typically has 
examined responses of subjects to situations in 
which an inequity of some type already exists. 
Most experimental studies, for example, have 
presented subjects with a potentially inequit- 
able outcome in comparison to an input/out- 
come ratio of a referent, and examined how the 
subjects adjust their inputs. Thus, for such 
subjects, inequity is a fait accompli. 

The present research, on the other hand, 
presents subjects with a situation in which an 
inequity, if generated, is solely a product of 
their own responses. More specifically, a sub- 
ject is asked to distribute monetary rewards to 
himself and a co-worker for a task they both 
have performed. The co-worker has no power 
to determine, in any way, the rewards to 
himself or any other person. The subject is 
presented with two distributions of rewards, 
and is asked to choose one of them as the pay- 
ment to himself and his co-worker. One distri- 
bution is always more equitable than the other. 
Thus, the subject’s choice of distributions is an 
index of the degree to which he conforms to the 
norm of equity. 

It is believed that the situation used in the 
present research has two major advantages 
over the methodologies used in most past 
experimental studies of equity. First, the 
situation in which a condition of inequity 
already exists probably occurs most frequently 
in industrial settings. In fact, most, if not all, 
naturalistic studies of equity have been per- 
formed in work settings. Such situations, on 
the other hand, probably occur much less fre- 
quently outside of industry. In most social 
situations, the problem is not how to resolve 
inequity, but rather how to respond in such a 
way that inequity is not created. This latter 
situation is simulated more accurately by the 
methodology of the present research than by 
procedures used in previous studies. 

Second, the methodology of past studies 
makes it difficult to test the third proposition 
of equity theory. The situation used in the 
present research, on the other hand, readily 
permits such a test. This proposition is tested 
by comparing responses made when subjects 
are faced with a choice of alternatives, in which 


3 The one exception was a recent study by Leventhal 
and Michaels (1969) which also allowed subjects to 
allocate rewards. 
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one distribution is inequitable in his favor, to 


responses made when one of his two alterna- 
tives divides rewards inequitably against him, 
Distribution Responses 

A person, in a situation where he niust 
distribute rewards to himself and another 
individual, can respond in a number of E, 
For the present research, three genera es 
of "distribution responses"—based on ‘he 
interpersonal orientations developed by Sawyer 
(1966)—are of theoretical relevance: 

Self-interest. A self-interested response is one 
in which a person maximizes the amounl of 


reward he obtains for himself. , 
Aliruism. An altruism response is one i 

which a person maximizes the amount of r 

ward that the other person obtains. e 

Role symmetry. Role symmetry is a general 
name for a number of different responses that 
have one quality in common: they take into 
account the rewards of both parties. Two 
prominent role-symmetric responses are ¿9 ual- 
ity—the response that minimizes the differ=nce 
in rewards to the two persons—and reward 
maximization—the response that yields the 
greatest sum of rewards. 

It should be noted that the names of the 
three distribution responses imply no motiva- 
tion on the part of the subject; rather, they 
merely describe his overt behavior. Thus, a 
person may respond self-interestedly because 
he feels he deserves more pay than the other. 
This is the major distinction between the 
distribution responses and the interpersonal 
orientations of Sawyer, which are cognitive 
states. 

The two studies reported here test predic- 
tions generated by equity theory concerning 
the three distribution responses. Study I 
examines the nature of distribution responses 
when the inputs of the subject and his co- 
worker are equal. Study II examines dis- 
tribution responses under conditions where 
inputs of subject and co-worker are varied 
systematically. 


Stupy I* 


As noted above, distribution responses de- 
scribe the possible behavior of persons choosing 


‘Study I is an adaptation of the second author's 
doctoral dissertation presented to the faculty of the 


ative distributions of rewards. 
best to generate a standard set 
‘ions. Study I used the 18 
. distributions presented in 

c 1, and thereafter, the person 
ices is designated as the 
er," is co-worker is designated as 


- first alternative distribution 
3) in all 18 sets divides the 
, $4 for one person and $1 for 

his reason, these alternatives 

designated the standard distribution. Note 
ries X), the standard distribu- 
to the chooser and $1 to the 

c remaining 9 sets (Series Y), the 
‘bution gives $1 to the chooser 

^ceiver. 
ions of rewards, designated as 

c outcomes, are paired with the 
ributions and are presented in 

l 5 of Table 1. Thus, a subject 

«d with Set A of Series X has the 
selecting the standard distribu- 

cives him $4 and his receiver 

y the alternative outcome— 

$3.50 and his receiver $2.50. 

in two dimensions of reward 


The distri 
alternativ 
dard d 


Which gives 
Variatio: 


chooser in the alternative out- 
e greater than that to the receiver 
eries X, Sets A, B, and C; Series Y, Sets G, 
, and I), equal to that of the receiver (Series 
X and Y, Sets D, E, and F), or less than that 
Of the receiver (Series X, Sets G, H, and I; 
Series Y, Sets A, B, and C). 

- Joint welfare refers to whether the sum of 
ewards in the alternative outcome is greater 
an that in the standard distribution (Series 
and Y, Sets A, D, and G), equal to that in 
he standard distribution (Series X and Wy 
Sets B, E, and H), or less than that in the 
Standard distribution (Series X and Y, Sets 
C, F, and T). It should be noted that when 


artment of Psychology, Graduate Division of 
‘Social Sciences, University of Chicago. This research 
L WAS supported by United States Public Health Service 
Grant MH 05350, “Experiments in the Resolution of 
Interpersonal Conflict,” Jack Sawyer, principal investi- 
gator. The generosity and assistance of Jack Sawyer 
acknowledged gratefully. 
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TABLE 1 
Tue 18 SETS or ALTERNATIVE DISTRIBUTIONS 
or REWARDS 
meee St ioe Se 
epit cca orien anaa 
se |. ————— ama mm 
Choosers | Receiver's | Chooser's | Receivers 
reward reward reward reward 
Series X 
A $4 gi EE $2.50 
B $4 $1 | $3.00 $2.00 
e $4 $i | $250 $1.50 
D $4 $1 $3.00 $3.00 
E $4 $1 $2.50 $2.50 
F $4 $1 $2.00 $2.00 
G $4 $1 $2.50 $3.50 
H $4 $1 $2.00 $3.00 
I $4 $1 $1.50 $2.50 
nA ILL LR 
Series Y 
pate yee 2 a r 
A $1 $4 $2.50 $3.50 
B $1 $4 $2.00 $3.00 
[9 $1 $4 $1.50 $2.50 
D $1 $4 $3.00 $3.00 
E $1 $4 $2.50 $2.50 
E $1 $4 $2.00 $2.00 
G $1 $4 $3.50 $2.50 
H $1 $4 $3.00 $2.00 
T $1 $4 $2.50 $1.50 


CENE y S -———— ow 


rewards in an alternative outcome differed, 
they always did so by $1; likewise, when the 
sum of rewards in the standard distribution and 
alternative outcome differed, they always did 


so by $1. 


Measurement of Distribution Responses 


From each person’s choices in the 18 sets, it 
is possible to obtain measures of the degree to 
which he makes each of the three distribution 
responses. This is done by comparing the two 
sets of alternative distributions that are the 
mirror image of each other, that is, the two 
sets where the rewards to the chooser and 
receiver in both distributions are in terchanged. 
In Table 1, the members of a mirror-image pair 
are separated into the two different. series, X 
and Y, but are identified by the same letter. 
For example, Series X, Set B, and Series Y, 
Set B, are the mirror image of each other. For 
each of these pairs, (a) if a person chooses the 
standard distribution in the Series X set and 


4 Trvinc M. LANE AND LAWRENCE A. MrssÉ 


the alternative outcome in the Series Y set, he 
is responding self-interestedly. This is so be- 
cause, in each case, he responds to maximize 
his own reward. (b) If a person chooses the 
alternative outcome in the Series X set and the 
standard distribution in the Series Y set, he is 
responding altruistically. This is so because, in 
each case, he responds to maximize the reward 
of the other. (c) If a person either chooses the 
standard distribution in both the Series X and 
Series Y sets, or he chooses the alternative 
outcome in both series, he is responding role 
symmetrically. This is so because such choices 
take into account the reward of both persons. 


Variables and Hypotheses 
Frequency 


'Taken together, the three propositions of 
equity theory suggest an order in which the 
frequencies of the three distribution responses 
should occur under conditions of equal inputs. 
The first proposition predicts that a person 
should choose the alternative outcome in the 
Series Y sets, since choosing the standard 
distribution of this series would result in an 
inequity against himself. The second proposi- 
tion predicts that a person should also choose 
the alternative outcomes in Series X, since 
choosing the standard distribution of this 
series would result in an inequity in his favor. 
Therefore, since a person should choose the 
alternative outcomes in both Series X and 
Series Y, equity theory predicts that the most 
frequent distribution response should be role 
symmetry. 

Moreover, the third proposition suggests 
that a person should choose the alternative 
outcomes in Series Y more frequently than he 
should in Series X, since he should be more 
tolerant of an inequity in his favor than of an 
inequity against himself. Thus, self-interest 
should be the second most frequent distribution 
response and altruism the least frequent. 


Situational and Personal Variables Affecting 
Distribution Responses 


As noted above, the three propositions pre- 
dict that role-symmetry responses should be 
more frequent than self-interest responses but 
that there still should be a substantial amount 


of the latter. However, equity theory does not 
specify any variables that would affect the 
degree to which the frequencies are different, 
that is, the degree to which a person would 
tolerate an inequity in his favor over an 
inequity against himself. It seems reasonable 
that there should be variables, both situational 
and personal, which would affect such differ- 
ences in the two thresholds. Study I examined 
a number of these variables which, for the 
reasons discussed below, were thought to 
influence distribution responses. The first four 
variables presented below are systematic 
manipulations of different aspects of the situa- 
tion presented to the subjects. The remaining 
variables are a number of personal charac- 
teristics whose relationships to the distribution 
responses were also assessed. 


Situational Variables 


Sex of receiver. Based on role theory (Biddle 
& Thomas, 1966), it is reasonable to assume 
that people behave differently in social situa- 
tions when interacting with persons of the 
opposite sex than they do with persons of the 
same sex. Further, it was felt that the general 
pattern of past behaviors would carry over into 
the present situation, and, therefore, choices 
would differ depending on the sex of the 
receiver. 

Public versus private choice. There is some 
evidence (e.g., Mouton, Blake, & Olmstead, 
1956) that persons are more likely to conform 


to norms when their behavior is public, that is, | 


when their identity is known and their re- 
sponses can be observed by others. From this 
it was predicted that choosers would be more 
likely to conform to the norm of equity and, 
therefore, make fewer self-interested responses 
under conditions of public choice. 

Order of presentation. As noted below, every 
subject responded to all 18 sets of distributions. 
"Therefore, the sets had to be presented in some 
systematic order so as to achieve a high degree 
of experimental control. It was decided to 
randomize the sets in one series, use this same 
order for the other series, present each series 
as a unit, and systematically alternate which 
series was presented first. This was done 
because it was thought that an initial exposure 
to one series might influence the responses of a 
subject to the second series. 
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Real versus hypothetical choices. Since Study I 
examined distribution responses under condi- 
tions of equal inputs, it was necessary that all 
subjects perform exactly the same task. For 
this reason, it was important that the receivers 
as well as the choosers respond to the 18 sets 
of distributions. However, it was equally im- 
portant to have the subjects realize that the 
responses of the receivers would be hypo- 
thetical; that is, they were to act as if they 
were choosers while knowing that their re- 
sponses did not determine rewards. 

This procedure not only optimized experi- 
mental control but it also allowed for the 


examination of possible differences between 
such “real” and "hypothetical" choices. It was 
thought that there would be fewer self-interest 
responses when choices were hypothetical, 
since, in this condition, conforming to the norm 
of equity would cost the subject nothing. 
Personal Variables 

A number of personal characteristics of the 
subjects were examined, mostly by means of 
paper-and-pencil tests. The specific variables 


examined and the rationale for doing so are 
as follows. 

Sex of chooser. Both males and females served 
as choosers. Since a number of studies have 
found that females are both more generous 
(e.g., Sawyer, 1966; Ugurel-Semin, 1952) and 
more conforming to norms (e.g, Bronfen- 
brenner, 1961; Christensen, 1948; Sears, 
Maccoby, & Levin, 1957), it was predicted 
that females would be less self-interested than 
males, 

Differences in background. A number of 
studies (e.g., Sawyer, 1966; Ugurel-Semin, 
1952) indicate that persons with different back- 
grounds exhibit different degrees of concern for 
the welfare of others. For this reason, Study I 
examined 10 background variables: father’s 
occupation, father’s education, mother’s educa- 
tion, number of siblings, religion, pattern of 
attendance at religious services, perceived 
cultural background, perceived social class, 
age, and year in school. 

Personality variables, A number of per- 
sonality variables also were thought to relate 
to differences in conformity to the norm of 
equity, Three paper-and-pencil personality 


inventories, as noted below, were used as the 
nonevaluative task that the subjects performed 
for pay: Survey of Interpersonal Values 
(Gordon, 1960), a personality/attitude sched- 
ule for use in experimental bargaining studies 
(Shure & Meeker, 1965), and the Rand scale 
of philosophical orientations (developed by 
the second author). 

The Survey of Interpersonal Values (SIV) is 
composed of six scales: Support, Conformity, 
Recognition, Independence, Benevolence, and 
Leadership. The Personality/Attitude Sched- 
ule also is composed of six scales: Aggres- 
sive Militarism, Conciliation, Authoritarian 
Nationalism, Risk Avoidance, External Con- 
trol, and Suspiciousness. The Rand scale is 
composed of four subscales: Altruism, Personal 
Freedom, Individualism, and Self-Reliance. 

It was predicted that those variables which 
imply a concern for the welfare of the other or 
a need for normative restraints—that is, Be- 
nevolence, Conformity, Conciliation, and Al- 
truism—should correlate negatively with self- 
interest. Further, it was predicted that varia- 
bles which imply self-interest or a negative 
orientation toward others—that is, Support, 
Recognition, Aggressive Militarism, Authori- 
tarian Nationalism, Suspiciousness, Personal 
Freedom, Individualism, and Self-Reliance— 
should correlate positively with self-interest. 


Method 


Subjects and Recruitment 


Since it was essential that subjects be placed in a 
meaningful situation where their choices actually 
mattered to them, an attempt was made to recruit 
persons who would be interested in a monetary reward. 
Therefore, it was crucial that they not just volunteered, 
but that they volunteered to “make some money.” 
To this end, five undergraduate classes at the Chicago 
Circle campus of the University of Illinois were ap- 
proached for subjects." In each class, students were 
asked to take part in an opinion survey and told that 
participants would be paid up to $4 for their time. In 
this manner, 16 subjects were recruited for each of 
eight sessions. "Therefore, 64 males and 64 females 
served as subjects in Study I. 


5 The help of Roger L. Dominowski and I. E. Farber, 
Department of Psychology, and Robert L. Hall, John 
W. Martin, William S. Place, James Wilkens, and 
Joseph Zelan, Department of Sociology, University of 
Illinois, Chicago Circle campus, in recruiting subjects 
is acknowledged gratefully. 
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Reward Distribution Booklet 


The sets of distributions presented in Table 1, in 
terms of standard distributions and alternative out- 
Comes, were given more neutral labels—Alternative A 
and Alternative B, respectively—and put in the form 
of an 18-page booklet, with one set per page. As noted 
above, the nine sets in Series X were randomized, and 
this order determined the page on which a given choice 
appeared. This same order was used for the Series Y 
sets as well. Thus, a set and its mirror image appeared 
in the same position within each series. 


Other Materials and Instruments 


Personality inventories. Subjects were presented with 
four separate personality instruments. One instrument 
was a 134-item “opinion survey" which asked the 
subjects to respond to each statement on a 7-point 
scale ranging from strong agreement to strong dis- 
agreement. The first 30 items comprised the Rand 
scale.5 The remainder of the opinion survey was com- 
posed of items from the Aggressive Militarism, Concilia- 
tion, Authoritarian Nationalism, and Suspiciousness 
scales of the Personality /Attitude Schedule intermixed. 

A second instrument, the ‘‘temperament inventory,” 
was composed of 15 items from the Suspiciousness and 
Risk Avoidance scales of the Personality/Attitude 
Schedule, and asked the subjects to respond on a 
3-point scale. A third instrument, the "social reaction 
inventory," was the External Control scale of the 
Personality /Attitude Schedule presented as 10 paired 
comparisons. The fourth instrument was the 30-item 
SIV. 

Postexperimental background questionnaire. This in- 
strument asked the subjects to provide a number of 
facts about various aspects of their background. It 
yielded indexes of the background characteristics al- 
ready named. 


Experimental Design 


To insure that choosers fully realized that they were 
paired with a real person, and that the identities of 
those paired were kept from each other when the choices 
were being made, the subjects were run in groups. In a 
given group, half of the subjects were choosers and half 
were receivers; a given chooser, when responding, knew 
that he was paired with one of the receivers, but he 
could not tell exactly which one. 

For reasons of experimental control, every chooser 
within a given group had to be of the same sex. Like- 
wise, the sex of every receiver in a group had to be the 
same. However, the sex of the receivers differed from 
that of the choosers for half of the groups. Also, in a 
given group, all choosers responded under either the 
public- or private-choice conditions. On the other hand, 
the two conditions of order of series presentation, since 
they were less visible, were both presented within a 


6 The scale is a series of statements culled from the 
novels and essays of Ayn Rand. Those readers in- 
terested in obtaining more information about the Rand 
scale can contact the second author, 
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given group. Since each of these variables 
mous, the required a 2X 2X 2 
factorial design, yield 


research 
g 32 cells 


Procedure 


The procedure that follows was u or all eight 


groups of subjects. For each gro jects were 
assigned randomly to alternate si classroom; 
Each subject was given a , a folder that contained 


velope that] 
First, they 


the personality inventories, and a seal 
contained the reward distribution boo 


subjects were asked to open up their folders and coms 
plete the inventories. This task was finished in 35-40 
minutes. 

The experimenter then told the 'up that thg 
remaining time would be spent determining how muchi 
each person in the group was to be paid. Subjects 
opened up their envelopes, and the experimenter ins 
structed them in the use of the reward distributiom 


booklet. He explained that the group 1 been divided 
in half, and that each person in the front, a chooser, 
was to determine both his own payment and that of one 
person in the back, a receiver. 

The experimenter then instructed the subjects in the 
use of the booklet. He told the receivers that they too 
should make choices, but that their responses would be 
hypothetical. He then demonstrated how payments 
would actually be made. Subjects were shown how one 
of each chooser’s responses would be selected by chance, 
and this choice used to pay that chooser and his 
receiver. In the private-choice groups, the experimenter 
indicated that each person’s pay would be placed in @ 
small envelope so that no one would know either the 
chooser-receiver pairs or how much each person was 
paid. In the public-choice groups, the experimenter 
explained that when it was time to pay them, he would 
call up each chooser-receiver pair together, and tell the 
whole group the payment each member was receiving: 

Subjects then went through the booklet, answering 
the question on each page which indicated their choice 
of distributions. The experimenter collected the booklets 
and paid the subjects as demonstrated. The subjects 
completed the background questionnaire while the ex 
perimenter was determining and distributing payments. 
In all, a given session lasted about 1 hour and 30 
minutes. 


Results 


Following the procedure described above, the 
number of role-symmetric, self-interested, and 
altruistic responses was tabulated for each 
subject. Since there were nine sets of mirror- 
image pairs, a subject's score for each distribu- 
tion response could range from 0 to 9, and 
his total across the three types of responses 
always equaled 9. Scores were subjected to 
three types of analyses. Nonparametric tests 


were used to examine differences in frequencies 4 


between the distribution responses. An analysis 
of variance was performed to examine the 


hs 


effects situational variables and the 
persor , sex of chooser. Correlational 
analys verformed to examine the 
relatior veen the distribution re- 
sponse remaining personal variables, 
Frequei istribution Responses 

Since ir responses really determined 
rewards, data from the 32 males and 
32 fema | served as choosers are pre- 
sented. re more alternative outcome- 
alternati me role-symmetric responses 
(312) tl interested responses (240), and 
more s ested responses than altruism 
respons this overall difference in fre- 


quencies tistically significant X*, = 98.2, 


df = 2, 001; Friedman analysis of 
variance eover, the difference in fre- 
quency en role symmetry and self- 
Interest, ı compared alone, was also 
significar 2.02, p < .03, sign test), as 
was the ence between self-interest and 
altruism ses (z = 7.88, p < .00001, sign 
test). 

i In addi here were 18 standard distribu- 
tion-stand distribution role-symmetric re- 
sponses. is significantly less than the 
number of alternative outcome-alternative 
outcome ro metric responses (3 = 7.28, 
$ < .00001, sign test).? 

The reward distribution booklet allows for 


the comparison of the relative strength of the 
"It should be noted that an examination of the 
responses of all 128 subjects in Study I in terms of fre- 
quency produces the same results. Although the abso- 
ps frequenci¢ s change, the relative differences between 
em remain more or less constant. 
Since symmetry is multidimensional, the 
ilferences in frequencies might have been caused by 
ariables other than the concern for equity. Conditions 
Joint and differential welfare could have influenced 
€ relative frequency of the distribution responses. For 
E on the 64 responses to Set E (see Table 1) 
xamined separately, since choices in this pair of 
pou could not have been affected by concerns 
this e ese dimensions of reward structure. Results of 
aon were equivalent to those already 
ee » all 18 sets. There were more alternative 
39) ila Alternative outcome role-symmetry responses 
eee Tesponses (24) and more self- 
Was Pon MAE than altruism responses (0; there 
Tole- a standard distribution-standard distribution 
YR metry response. Morcover, results of statistical 
Performed on these data mirrored those for the 
imd Thus, the results presented above can be 
Preted in terms of the predictions of equity theory. 


role 
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concern for equity with concerns for joint and 
differential welfare through an examination of 
the type of role-symmetric response emitted in 
certain sets of alternatives. If a subject makes 
an alternative outcome-alternative outcome 
response in Sets C, F, and I (see Table 1), he is 
indicating a stronger concern for dividing 
rewards equally than for maximizing the sum 
of rewards. On the other hand, a standard 
distribution-standard distribution response 
would indicate a stronger concern for joint 
welfare than for equality. In these sets, the 
subjects made 84 role-symmetric responses; of 
these, 67 were alternative outcome-alternative 
outcome responses, and 17 were standard 
distribution-standard distribution responses, a 
significant difference (p < .01, sign test). 

Likewise, an alternative outcome-alternative 
outcome response in Sets G, H, and I would 
indicate a stronger concern for equal rewards 
than for not allowing the other to receive the 
greater reward. Of the 75 role-symmetric re- 
sponses in these sets, 62 were alternative 
outcome-alternative outcome choices, and 13 
were standard distribution-standard distribu- 
tion choices, a significant difference (p < .01, 
sign test). 


Situational Variables and Sex of Chooser 


Since there were so few altruism responses, 
role symmetry and self-interest were almost 
perfectly inversely correlated (r =— .988). 
Because of this, only self-interest responses are 
examined in terms of situational and personal 
variables. However, it should be kept in mind 
that the opposite of any statement made about 
self-interest is true of role symmetry. 

As previously noted, the receivers were asked 
to pretend that they were choosers and to 
respond to the distributions. However, the 
conditions under which they responded could 
not be completely comparable to those for real 
choosers since the hypothetical responses of the 
receivers were always made as if in the private- 
choice condition. Therefore, an analysis of 
variance was performed on the self-interest 
scores of the 64 choosers, classified by sex of 
receiver, public versus private choice, order of 
series presentation, and sex of chooser. These 
results are discussed below in terms of each 
of the independent variables of the analysis. 


TABLE 2 


MEAN NUMBER OF SELF-INTEREST RESPONSES 
CLASSIFIED BY SEX OF RECEIVER AND 
PUBLIC VERSUS PRIVATE CHOICE 


Public versus private choice 


Sex of receiver 


Public Private 
ae eee ae 
Same as chooser 4.50 3.44 
Different from chooser 2.56 4.50 


Note,—N = 64. 


Sex of receiver. The Sex of Receiver X Public 
versus Private Choice interaction was signifi- 
cant (F = 5.02, df = 1/48, p < .05). Table 2 
presents the mean number of self-interest 
responses classified by these variables. An 
analysis of the simple effects (Winer, 1962) 
revealed that in the public-choice condition, 
choosers made fewer self-interest responses 
when paired with receivers of the opposite sex 
(p « .05). 

Public versus private choice. Two first-order 
interactions—with sex of receiver (see above) 
and order of series presentation (F = 4.51, 
df = 1/48, p < .05)—involving public versus 
private choice were significant. Tables 2 and 3 
present the mean number of self-interest re- 
sponses classified by the appropriate variables. 
Analyses of the simple effects revealed that (a) 
when the sex of the chooser was different from 
that of the receiver, self-interest scores were 
lower in public than in private (p < 025); and 
(b) when choosers started with Series X, self- 
interest scores were also lower in public 
(p < .05)? 

Order of series presentation. As already noted, 
the Public versus Private Choice X Order of 
Series Presentation interaction was significant 
(see Table 3). Analysis of the simple effects 
revealed that in the public-choice condition, 
choosers who started with Series X were less 
self-interested than were those who started 
with Series Y (p < .01). 

Sex of chooser. The main effect for sex of 
chooser was significant (F = 5.33, df = 1/48, 
p < 05). The mean number of self-interest 
responses was less for female choosers (2.97) 
than for male choosers (4.53). 


?The probability values are one-tailed, since the 
hypothesis about public versus private choice was 
directional. 
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TABLE 3 


Mean NUMBER or SELF-INTEREST RESPONSES 
CLASSIFIED BY PUBLIC VERSUS PRIVATE 
CHOICE AND ORDER OF SERIES 
PRESENTATION 


Public versus private choice 
Series presented 
rst 


Public Private 
x 2.19 4.00 
X 4.94 3.88 


Note,—N = 64, 


Real versus hypothetical choices. As noted 
above, real versus hypothetical choices was 
confounded with public versus private choice. 
However, it seemed of interest to examine the 
overall effect of this variable, at least for the 
private-choice-condition groups. Subjects who 
made real choices had higher mean self-interest 
responses than did subjects making hypotheti- 
cal choices (£ = 2.43, df = 126, p < .02); the 


mean values were 3.90 and 2.40, respectively." 


Background and Personality C haracteristics 


Nine background characteristics and the 16 
personality variables were correlated with the 
number of self-interest responses made by each 
subject. None of the nine correlations of 
background characteristics with the number of 
self-interest responses was significant. Three 
of the six with the SIV scales, Support 
(r = .166), Recognition (.182), and Benevo- 
lence (—.204), were significant. Two of the six 
with the Personality/Attitude Schedule scales, 
Aggressive Militarism (.188) and Conciliation 


w As with frequency, it was possible that the situa- 
tional variables were affected by the expression of con- 
cern with conditions of reward structure rather than 
concern with equity. For this reason, the responses of 
the 64 choosers to Set E were examined separately with 
respect to the four situational variables through an 
analysis of variance. The pattern of results was equiva- 
lent to that of the analysis of variance of scores derived 
from all 18 sets. The only differences were that in this 
analysis, the main effect for order of presentation was 
significant (p < .05), and the Public versus Private 
Choice X Order of Series Presentation interaction was 
marginally significant (p < .08). However, since these 
tests were performed on dichotomous data, and prob- 
ably violate the assumptions of analysis of variance, it 
was thought best to present the results for all 18 sets 

11 Both real and hypothetical choices were included in 
the analysis to maximize the sample size. 
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(—.271), were significant. Three of the four 
with the Rand scale variables, Altruism 
(—.169), Individualism (.160), and Self-Re- 
liance (.200), were significant.” 

Discussion 
Frequencies of Distribution Responses 


The hypothesis generated from the three 
propositions of equity theory stated that the 
number of role-symmetry responses should be 


greater than the number of self-interested re- 
sponses, which in turn should be greater than 
the number of altruism responses. Results of 


Study I provide strong support for this hypo- 
thesis and, thus, direct support of the three 
propositions. The hypothesis concerning fre- 


quency of distribution responses was based on 
the assumption that role symmetry, as defined, 
is a response that takes into account the re- 
wards of both parties. It could be, however, 
that role-symmetry responses are motivated by 
a need to be consistent rather than a concern 


about rewards. A high need for consistency 
might motivate subjects to choose the same 


alternative in both series, Results of Study I 
Indicate that such an alternative interpretation 
1s implausible. First, if consistency were a 


primary motivational determinant of choices, 
subjects should have made as many standard 
distribution- standard distribution responses as 
they did alternative outcome-alternative out- 
come responses; results indicate that this was 
hot so. Second, such an interpretation would 
make it difficult to explain why subjects who 
Started with Series Y tended to choose the 
alternative outcome in that series and then 
Switched to standard distribution choices in 
Series X, while subjects who started with 
Series X tended to choose the alternative 
Outcome in both series, Results also indicated 
that the concern with equity is more motivat- 
Ing than concerns with maximization of joint 
and differential welfare, unlikely findings if 
Consistency were a salient concern. 
to ser the reasons already stated, responses n pee 
background a Mns separately A terms of possible 
and personality correlates, Again, the same 
m emerged. Seven of the eight personality charac- 
Cant correlate significantly with total. «eu 
Significant], eao correlate "urs x — 
correlation i bhy ; self-interest "s 5 y Aat re 
tion, failed 1 i : support, althoug in the 
; o reach significance. 


As noted, past studies of equity have 
examined situations where inequities already 
existed. For the most part, these studies have 
supported at least the first two propositions of 
equity theory. Study I extended such research 
to that situation where the generation of 
inequity was in the hands of the participants, 
and it provided strong support for equity 
theory. 


Situational Variables 


As already mentioned, Proposition 3 of 
equity theory implies that there should be 
variables which affect differences in the thresh- 
olds of the two types of inequities; that is, in 
terms of distribution responses, there should be 
variables which affect the frequency of 
self-interest. 

The examination of the four situational 
variables yielded results of relevance to this 
hypothesis. The observed effects of both sex of 
receiver and public versus private choice 
indicate that one determinant of the difference 
in thresholds is the role relationship of the 
participants. Choosers were less self-interested 
when receivers were of the opposite sex, but 
only in the public-choice condition. This im- 
plies that college students are highly concerned 
with cross-sex relations, and try to “put their 
best foot forward” when their behavior is open 
to public scrutiny. These results are also con- 
gruent with the idea that surveillance is an 
important dimension of conformity behavior. 

The results relevant to real versus hypo- 
thetical choices are also of theoretical interest. 
They imply that the degree of difference 
between the two inequity thresholds is related 
positively to the cost of an equitable response. 
Moreover, the results for order of series pre- 
sentation imply that response set might affect 
differences in thresholds. A self-interested re- 
sponse set may have been induced when the 
subjects first were presented with Series Y 
alternatives, since, for these choices, equity 
and self-interest were congruent. 

Thus, the results of Study I imply that there 
are a number of situational dimensions which 
affect conformity to the norm of equity. They 
provide direct evidence that role relations, 
surveillance, cost, and response set are among 
these dimensions. 
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Personal Variables 


Sex of chooser. As predicted, females were 
less self-interested than males. This finding is 
congruent both with past research (e.g., 
Ugurel-Semin, 1952) and the scientific and 
popular conception of the female sex role in 
Western culture; that is, females are both more 
generous and more conforming than males. It 
is further evidence that role relations are à 
determinant of differences in the inequity 
thresholds. 

Background characteristics. As noted, none of 
the correlations of self-interest responses with 
indexes of background characteristics was 
significant. As with all negative results, such 
findings are difficult to interpret. A reasonable 
explanation is that some of the variables 
examined are irrelevant to the norm of equity. 
Others might have been relevant, but no 
differences were found because the sample was 
too small and/or homogeneous. 

Personality variables. Eight of the 16 per- 
sonality variables correlated significantly with 
number of self-interest responses. All of these 
were variables that were predicted to be 
related to distribution responses. Moreover, 
these results were congruent with the hypoth- 
esis that personality characteristics implying 
a concern for others (e.g., Benevolence, Con- 
ciliation, and Altruism) should correlate nega- 
tively, and characteristics that imply a self- 
interested orientation (e.g., Support, Recogni- 
tion, Individualism, and Seli-Reliance) or a 
negative attitude toward others (e.g., Aggres- 
sive Militarism) should correlate positively 
with self-interest responses. 

An unexpected finding was that there was 
not a negative correlation between Conformity 
(as measured by the SIV) and self-interest 
responses. It could be that the norm of equity 
is almost universal for middle-class persons. 
Thus, persons who tended not to conform to 
other norms (eg, those concerning sexual 
behavior) might still conform to equity. On the 
other hand, it could be that conformity, as 
operationalized in the SIV, is somewhat 
different from the meaning of the term as used 
in Study I." 

1 There is evidence that this latter explanation has 
some validity. Lichtman and Lane have unpublished 
data that indicate a significant negative correlation 
between self-interest and importance of obedience (as 
measured by the Rokeach, 1968, Value Survey). 
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To summarize briefly, the results of Study I 
indicate that when inputs are equal, persons. 
distribute rewards as predicted by equity 
theory. Moreover, there seem to be a number 
of situational and personal variables that are 
related to differences in the thresholds of 
response to the two types of inequity. 


Srupy H 


In Study I, all subjects had identical inputs. 
However, equity theory places a great deal of 
importance on the comparison of differential 
inputs. Therefore, the design of Study IL 
basically using the method of Study I, 
tematically varied the work quantity 
quality of each chooser and receiver to investi- 
gate their effects on the choice of rewards. 

For each type of participant, choose and 
receiver, two levels of inputs, low h 
were used. Thus, four experimental 
were generated: Condition 1: the cho 
high inputs, and the receiver has low 
Condition 2: both the chooser and the receiver 


have high inputs; Condition 3: both the 
chooser and the receiver have low inputs; 


Condition 4: the chooser has low inputs, 
the receiver has high inputs. 


Hypotheses 


Equity theory states that persons will feel 
dissatisfied with a distribution of outcomes 
that is not commensurate with comparative: 
differences in inputs. Therefore, it was pre- 
dicted that a subject will choose that distribu: 
tion of rewards which is most congruent with | 
the differences in his inputs and those of h 
receiver. More specifically, three hypothes 
follow from this principle. 


1. When the inputs of the chooser a 
receiver are equal, as in Conditions 2 and 


u Study II is an adaptation of the first author 
doctoral dissertation presented to the faculty of 
Department of Psychology, Michigan State Univers! 
This research was funded by grants from the Uni! 
States Air Force Office of Scientific Research (F446 
69-C-0114) and the Department of Psycholo! 
Michigan State University. The authors wish to th 
Ronald Cohen and Melvin Goldstein for their help Y 
running subjects. 

15 Note that Condition 2 was more or less a replicatio 
of the general situation used in Study I. Subjects} 
Study I worked for somewhat less time than those 
Condition 2 of Study LL, but they were given no re 
to believe that their inputs were not high. 


uld make alternative outcome- 
come responses, since such 
ovide the most equal distribu- 


the subject 
alternative 
choices wot 


tion of rew 

2, Wher nputs of the chooser are greater 
than those > receiver, as in Condition 1, 
the subje uld choose self-interestedly, 
since such 'es would give greater reward 
to the pers 10 did more and better work, in 
this case t! ooser. 

3. Wher inputs of the chooser are less 
than those :e receiver, as in Condition 4, 
the subjec uld choose altruistically, since 
such choices would also give greater reward to 
the person who did more and better work, but, 
in this case, that person is the receiver. 

£ Method 
Recruitmen! and Subjects 

The subjects were 58 male undergraduates at 

Michigan 


announceme 
stated that ; 
industrial oj 
be paid for t: 


Instruments 


Four instruments were used: a reward distribution 
booklet, a istrial opinion questionnaire, an attitude 
nd a student background questionnaire. 
bution booklet, The reward distribution 
2 educed from 18 to 6 pages by elimination 
of joint welfare as a reward structure variable. Thus, 
Sets B, E, and H of Series X and Y (see Table 1) were 
used. In all other respects, the instrument was the same 
as that used in Study I. 
Industrial opinion questionnaire. There were two 
forms of the industrial opinion questionnaire. Form 1 
Contained four short essay questions that asked the 
subjects their opinions concerning a number of problems 
in the business world. Form 2 contained 12 such ques- 
tions. The number of questions in each form was 
Selected so that subjects would work different lengths 
of time, 30 minutes or 90 minutes, and, in this way, 
inputs were varied systematically. 
Altitude questionnaire. The main purpose of the 
Seven-item attitude questionnaire was to determine 
the degree to which inputs had been manipulated 
Successfully, 
Student background questionnaire. The student back- 
ground questionnaire contained items that pertained to 
c subject’s educational background and work history. 
€ subject’s responses to this instrument were u 
un to manipulate work quality and as the basis for 
€ assignment of persons to tasks of different lengths. 
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Variables and Experimental Design 


Four variables were manipulated in Study IL. Two 
of these, (a) hypothetical versus real choices and (b) 
order of presentation, were also manipulated in Study 
I and were included here mainly for reasons of replica- 
tion and control. The other two variables, inputs of the 
chooser and inputs of the receiver, were the main focus 
of Study II. 

Inputs of the chooser. There were two conditions of 
inputs of the chooser, Choosers with high inputs were 
told that, based on their responses to the student back- 
ground questionnaire, they were expected to produce 
high-quality answers to the industrial opinion question~ 
naire. Also, they were given Form 2 of the industrial 
opinion questionnaire. The experimenter told choosers 
in the low-input condition that low-quality answers 
were expected of them, and they were given Form 1 of 
the industrial opinion questionnaire to complete, 

Inputs of the receiver. Inputs of the receiver were 
manipulated in the same manner as were inputs of the 
chooser. 

Experimental design. Thus, four situational conditions 
were varied systematically. This produced a factorial 
design whose dimensions were 2 (inputs of the chooser— 
high versus low) X 2 (inputs of the receiver—high 
versus low) X 2 (choices—hypothetical versus real) X 2 
(order of presentation Series X presented first versus 
Series Y presented first). 


Procedure 


All of the students in the industrial psychology class 
where subjects for Study II were recruited. compl 


the student background questionnaire at least 2 weeks | 


before they were asked to participate in the 

All subjects, who were assigned randomly to con- 
ditions, participated in two sessions. In the first session, 
the subjects were given the appropriate form of the 
industrial opinion questionnaire and told whether they 
were expected to produce high- or low-quality answers. 
All subjects were required to stay the time allotted to 
complete their form of the questionnaire. Thus, the 
subjects in the low-input condition left after 30 minutes, 
and those in the high-input condition left after 90 
minutes. 

That afternoon, the subjects returned, were given 
the reward distribution booklet, and, depending on 
their input condition and the requirements of the experi- 
mental design, were assigned the role of chooser or 
receiver. With two exceptions, a procedure equivalent 
to that used in the private-choice condition of Study I 
was followed. First, before the subjects responded to 
ihe reward distribution booklet, the experimenter 
announced the amount of time the choosers and re- 
ceivers had worked on the industrial opinion question- 
naire. Second, the subjects in Study IL completed the 
attitude questionnaire instead of the background 
questionnaire used in Study I. 


Results 


The number of role-symmetric, self-inter- 
ested, and altruistic responses for each person 
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TABLE 4 
CELL MEANS FOR SELF-INTEREST SCORES 
Real choices Hypothetical choice: 
Chooser has Receiver has | 
Series X Series Y Series X Series Y 
presented first. presented first. presented first | presented first 
High inputs High inputs 156) 3.00) 800 254 
Low inputs 3.00% 2,006 2.00 | 2.33) 
Low inputs High inputs 2254) 2.00% 50%) 
Low inputs 2.33(m) 2.330) 2.006 | 
Note.— Since the self-interest scores for each subject could vary only from 0 to 3, the means presented in this table likewise 
could vary only from 0 to 3. For convenience and ease of identification in later discussion, each cell mean has a letter next to it, 


was computed following the procedure de- 
scribed in Study I. Since there were no 
altruistic responses, role symmetry and self- 
interest were perfectly inversely related. For 
this reason, for the most part, only self-interest 
is referred to in the following sections. There- 
fore, as in Study I, the opposite of any state- 
ment made concerning self-interest is true of 
role symmetry. 


Effects of Situational Variables 


"The effects of the four situational variables, 
inputs of the chooser, inputs of the receiver, 
real versus hypothetical choices, and order of 
presentation, on the manifestation of self- 
interest were determined by a four-way un- 
weighted-means analysis of variance." The 
main effect for inputs of the receiver was 
significant (F = 17.58, df = 1/42, p < .01), 
as was that for order of presentation (F — 4.85, 
df — 1/42, p « .05). In addition, the Inputs of 
the Receiver X Order of Presentation inter- 
action was significant (F — 6.35, df — 1/42, 
p < 05), as was the Inputs of the Receiver 
X Real versus Hypothetical Choices X Order 
of Presentation interaction (F = 5.04, df 
= 1/42, p< .05). Finally, the third-order 


16 It should be noted that, in Study II, all role- 
symmetric responses were alternative outcome-alterna- 
tive outcome choices; that is, there were no stan- 
dard distribution-standard distribution role-symmetric 
responses. 

17 Both real and hypothetical choices were included in 
the analysis since, unlike Study I, there was no con- 
founding of this variable with other manipulations. 
Therefore, the responses of the receivers to the reward 
distribution booklet were classified as responses of 
choosers making hypothetical responses in the ap- 
propriate input conditions. 


interaction was significant (F = 10.14, df 
= 1/42, p < .01). Therefore, tests of simple 
effects were performed to determine under 
which conditions each of the four situational 
variables had a significant effect on the mani- 
festation of self-interest. Table 4 presents th 

cell means for self-interest scores; for con 
venience and ease of identification in latei 

discussion, each cell mean has a letter next to it 

Inputs of the chooser. Analysis of the simple 
effects revealed one significant difference for 
inputs of the chooser (between Cells d and | in 
Table 4). When receivers had high inputs and 
choosers made hypothetical responses starting 
with Series Y, the subjects were less self- 
interested when they had high inputs than low 
inputs (p < .05). 

Inputs of the receiver. Four simple main 
effects for inputs of the receiver were significant 
(p < .05). These tested differences between 
Cells a and e, d and h, i and m, and k and o in 
Table 4. All differences were in the same direc- 
tion: the subjects made less self-interested 
responses when their receivers had high inputs 
than when they had low inputs. 

Real versus hypothetical choices. One simple 
main effects for real versus hypothetical choices 
was significant (p< .01). This difference, 
between Cells b and d in Table 4, indicates 
that when both choosers and receivers have 
high inputs and Series Y is presented first, the 
subjects were less self-interested when their 
choices were hypothetical than when they 
were real. 

Order of presentation. Analysis of the simple 
effects for order of presentation revealed three 
significant differences (p < .05). These differ- 
ences (between Cells a and b, i and j, and k and 
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}) were all in the same direction: the subjects 
who were presented with Series X first were 
less self-interested than those presented with 
Series Y first.’ 


Validity of Manipulations 


On the attitude questionnaire, the subjects 
were asked to indicate, on a 5-point scale, the 
degree to which they thought their work 
quantity difiered from that of their receiver. 
Their responses, Classified by experimental 
condition, are summarized in Table 5, in terms 
of percentages. Table 6 presents the same 
information with respect to work quality. Both 
tables indicate that subjects in three of the four 
conditions (1, 2, and 3) reported very little, if 
any, distortion of inputs. On the other hand, 
the subjects in Condition 4 reported percep- 
tions of inputs which differed from the reality 
imposed on the situation by the experimenter. 
n this condition, almost all of the choosers felt 
at their answers were of the same quality as 
the receivers. In addition, over one-third of 
these choosers reported working at least as 
long as their receivers. 


Discussion 
Effects of Inputs 


Taken together, the results for inputs of 
chooser and inputs of receiver provide insight 
into those conditions under which the norm of 


equity is a mediator of behavior. There is 
strong evidence that a chooser in distributing 
rewards to himself and another individual, is 


guided only by the other's inputs. Choosers 
lend to give themselves as much reward as 
possible unless they perceive that their receiver 
has worked hard enough to deserve more than 
minimal pay. It is as if the inputs of the 
receiver must exceed a threshold value before 
4 chooser feels that his receiver deserves more 
than this minimum. If this threshold value is 
exceeded, then equity motives tend to domi- 
nate. On the other hand, if the inputs of the 


"To insure that the results presented above were 
not influenced by concerns other than equity, a similar 
analysis of variance was performed only on the re- 
sponses to Set E. Results of this analysis mirrored those 
eg on all the sets. The only difference was that 

€ third-order interaction was now marginally signifi- 
cant (p < .06). 


TABLE 5 
PERCEIVED DIFFERENCES IN WORK QUANTITY 


As compared to his receiver, 
a chooser reported working 
Experi- T 
mental 4 times ce 2) About | About + 
condition |asmuch| times the }to} |asmuch 
or more | acmuch| me |as much| or less 
% of responses 
1. High-low* 14 86 0 0 0 
2. High-high 0 6 94 0 0 
3. Low-low 0 0 100 0 0 
4. Low-high 0 7 33 60 a 


a This indicates that the high-inputs group determined the 
rewards for themselves and their receivers in the low-inputs 
group. 


receiver are less than this value, subjects tend 
to maximize their own rewards. The relation- 
ship between inputs and choice behavior is seen 
clearly in Table 7 where the percentage of role- 
symmetry and self-interest responses is classi- 
fied by experimental condition. Inspection of 
this table reinforces the idea that differences 
between conditions with respect to distribution 
responses occur only as à function of inputs of 
receiver and not at all as a function of inputs 
of chooser. 

The original hypotheses generated from 
equity theory assumed an interaction between 
the inputs of both participants. The results of 
Study II suggest that such an assumption is 
untenable. Two of the conditions (1 and 2) 
produced results that apparently were con- 
gruent with the interactive model of equity 
theory. However, à simpler model, which 
considers only the inputs of the receiver, would 
make the same predictions for these two con- 
ditions and also would predict correctly the 


TABLE 6 
PERCEIVED DIFFERENCES IN Work QUALITY 


As compared to his receiver, a chooser 
felt the quality of his work was 
Experi- C E - 
"onsid- | Some- | About | Some- Consid- 
don erably what the what erably 
= better | better | same worse | worse 
% of responses 
1, High-low 14 50 36 0 0 
2. High-high 0 17 71 12 0 
3. Low-low 8 0 92 0 0 
4. Low-high o 7 86 7 0 


ul ol ditti ne a 


bon ds 
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TABLE 7 


PERCENTAGE OF ROLE SYMMETRY AND 
SELF-INTEREST RESPONSES 


GA D 


] 
| 76 d 
Cusdiion| e Ei | M eo I LR 
| chooser receiver | 7 esponses, | responses 
1 High | Low 21 79 
2 High High | 61 39 
3 Low Low 25 75 
4 Low High | 62 38 


ote.—The category of altruism has been omitted from this 
thie because no altruism responses occurred in any condition. 


remaining results. Thus, a simplified equity 
theory seems more valid, at least for situations 
such as those studied in the present research. 
There are, however, alternative explanations 
that can account for the findings of the present 
research within the framework of equity theory 
as originally formulated. One of the hypotheses 
of equity theory is that subjects can distort 
reality to alleviate feelings of dissatisfaction 
produced by perceptions of inequity. This 
mechanism seemed to operate in Condition 4 
where choosers worked one-third as long as 
their receivers, but indicated on the attitude 
questionnaire that they did as much work of 
the same quality as did their receivers. Given 
that equity theory makes predictions on the 
basis of perceived inputs, the results for Condi- 
tion 4 could be interpreted as supportive of it. 
Distortion of reality, at most, can be only a 
partial explanation of why the results of Study 
II are in part contrary to the predictions of 
equity theory. The theory also did not predict 
correctly the behavior of choosers in Condition 
3, where the perception of inputs was not 
distorted. In this condition, both chooser and 
receiver had low inputs, and, therefore, neither 
deserved the $2.50 average payment. Choosers 
in this condition might have felt that $1 was 
equitable compensation, and, therefore, they 
chose to give their receivers that amount, while 
keeping $4 for themselves. Equity theory might 
predict, however, that subjects in this condition 
should also have chosen role symmetrically in 
order to have alleviated feelings of dissatis- 
faction produced by perception of overpay- 
ment. In any event, it is clear that a simplified 
equity theory must remain a plausible alterna- 
tive explanation of the behavior of persons in 
situations where rewards are distributed. 


Leventhal and Michaels (1969) also manipu 
lated inputs and examined differences in the 
allocation of rewards. However, their research 
had a different specific focus from the present 
research. They examined differences in rewa 
allocation as a function of the degres 
actual inputs confirmed expectatio: 
Study II exdmined differences pe: 
actual inputs of the participants. Because of 
this difference in foci, the results of the two 
studies are not entirely comparable. However, 
a comparison of results, where appropriat 
raises three points of interest. 


dyad of Leventhal and Michaels' 
were perfectly inversely correlated; that is, 
the subject did more than was expected, his 
partner had to do less. Such a Broce D 


complex model of equity theory, m 
viewed as somewhat tentative; that. 


both their own and the other's inputs, but 
their data do not permit a direct test of this 
proposition. 

Moreover, the results of Study II indicate 
that only inputs of the co-worker affected the 
distribution of rewards. This simpler model can 
also explain Leventhal and Michaels’ results; 
subjects gave the other the greater reward (and 
therefore gave themselves the least reward) in 
the condition where the other had the highest: 
inputs, and they gave the other the least 
reward in the condition where he had the 
lowest inputs. Therefore, given their findings, 
it is difficult to determine which version of 
equity theory best accounts for the behavior of 
the subjects in their specific situation. 

Second, the results of Leventhal and Mi- 
chaels’ study do not support Proposition 3 
of equity theory, while the results of Study TE 
(and Study I) of the present research do. 
Leventhal and Michaels indicate that when 
subjects performed better than their co-worker, 
they allocated 60% of the reward to them- 
selves; when the co-worker performed better, 
they allocated 60% of the reward to him. 
Equity theory predicts, however, that there 
should have been a difference in these values, 
such that subjects should have allocated à 


n to themselves when they per- 


higher propo 
han to the co-worker when he 


formed bet: 


performed 

The results -ê Study II, on the other hand, 
indicated subjects were more self-in- 
terested w! e inputs of the other were low 
(Condition ind 3) than they were role 
symmetric the inputs of the other were 
high (Cond 2 and 4). Thus, there were 
more equit esponses when the alternative, 
inequitable :e favored the co-worker than 
when it fa | the subject.” These findings 
support Pr ition 3, 


lanation of why the results of 
ies differed in their support of 
Proposition > lies in the difference in procedure 
used to induce subjects to participate. Leven- 
thal and Michaels (1969) used extra course 
* credit as an inducement; the present research 
recruited ects by the offer of money. 
Subjects recruited for extra credit may not 
have been very interested in the slight differ- 
ences in reward among which Leventhal and 
Michaels’ procedure permitted them to choose. 
In fact, all of their subjects agreed to forego 
payment since, as the authors explain, “they 
had not come to the experiment with the 
expectation of being paid [p. 306].” On the 
other hand, differences in payment should 
have been more salient to the subjects in the 
present research since money was their only 
reward. It seems reasonable that subjects in 
the present research should have indicated a 
greater concern for their own welfare (vis-à-vis 
that of the other) than should have subjects in 
Leventhal and Michaels’ study. 
Third, the results of the two studies, taken 
together, raise an interesting empirical ques- 
tion: Are the effects of inputs due to absolute 
or relative differences? The explanation ad- 
vanced in Study II to account for the effects of 
inputs of the receiver (and the lack of effect of 
Inputs of the chooser) implies that absolute 
level is the important dimension; that is, a 
given receiver's performance must exceed a 
threshold value before the chooser perceives 
that the other deserves more than minimal pay. 
meine Comparison was not reported in the Raus 
fnis y Say S since it was just a replication es 
of any xen y I; it did not involve the manipu » 
E itional independent variables and is some 
hat irrelevant to the focus of Study II. However, the 
erence was significant (t = 1.90, p < .05). 


A possib! 
the two st 
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On the other hand, subjects in Leventhal 
and Michaels’ research were led to believe that 
the other worked at most for 19 minutes. In 
terms of absolute time, these referents had 
lower inputs than did the receivers in the low- 
input conditions of Study II. Even so, relative 
differences in inputs of these referents did 
affect behavior. These results imply that the 
threshold criterion used by the chooser, if it 
exists, changes from situation to situation. 

A complex version of equity theory implies 
that this threshold varies as a function of the 
chooser’s own inputs. It could be, however, 
that other variables are relevant. For example, 
the structure of the situation as defined by the 
experimenter might indicate to the subjects 
what this threshold should be. In Study I, 
subjects were told that working for 30 minutes 
was less than optimal performance, while in 
Leventhal and Michaels’ study, working for 
19 minutes was sometimes more than adequate. 
Thus, the performance of the other could be 
judged in terms of what the experimenter said 
was good or bad without reference to one's 
own performance. This explanation is con- 
gruent with the simpler version of equity 
theory advanced above. 


Replication and Control Variables 


As noted, two of the variables which were 
examined in Study I were included in Study IT 
mainly for reasons of replication and control. 
The results of the two studies were very 
similar with respect to the effect of order of 
presentation. To a lesser extent, the findings 
with respect to real versus hypothetical choices 
were also similar. Moreover, the results of the 
two studies, where comparable, are strikingly 
similar with respect to the frequency of the 
distribution responses. Table 8 presents the 
percentage of each type of distribution re- 
sponse made by subjects in Condition 2 of 
Study II and males in Study I» 

In summary, the findings of Study 2 support 
equity theory only in part. Moreover, where 


2% Of the conditions in the two studies, these were 
selected because they seemed most comparable. Con- 
dition 2 had both choosers and receivers work about the 
same amount of time as did the subjects in Study qe 
Moreover, since Study II used only male subjects, only 
the responses of males in Study I were included in the 
present comparison, 
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TABLE 8 
PERCENTAGE OF DISTRIBUTION RESPONSES MADE BY 
MALE Susyecrs IN Stupy | AND SUBJECTS IN 
Convition 2 or Srupx Il 


Distribution response 


Sample 
Rol Self- 4 
symmetry | interest | Altruism 
Males in Study I 60 39 1 
Subjects in Con- 
dition 2 of Study 
Il 61 39 0 


comparable, they replicate well the results of 
Study I. 


CONCLUSIONS 


It was noted in the introduction to this 
paper that men can live together because they 
develop norms, or rules of conduct, that guide 
their interpersonal relations. Such rules can 
only serve their purposes to the extent that 
members of the group are willing to conform 
to them. The present research indicates that 
persons do conform to the norm of equity, and, 
thus, it does have the power to affect behavior. 

It might be argued that the situation used to 
study distribution responses presented subjects 
with trivial decisions; that is, their behavior 
might have been very different if they had to 
choose among distributions of rewards that 
involved thousands of dollars, or even just 
hundreds of dollars. This is certainly a possi- 
bility, but it is somewhat beside the point. 
Persons in real life make relatively few decisions 
that would be equivalent to choosing among 
distributions that involve hundreds of dollars. 
Instead, they tend to make many rather 
pedestrian decisions which certainly affect 
welfare but, when taken alone, do so to a small 
extent. It is precisely these sorts of everyday 
decisions—hopefully modeled by the sets of 
alternative distributions—which are mediated 
by the norm of equity. Certainly, the results 
of the present research imply that, among 
other reasons, men can live together and func- 
tion in groups because they conform to a norm 
which says that they should consider the 
welfare of others. 
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EFFECTS OF FURNITURE ARRANGEMENT, PROPS 


AND PERSONALITY ON SOCIAL INTERACTION ' 


ALBERT MEHRABIAN ? Axp SHIRLEY G. DIAMOND 


University of California, Los Angeles 


Three experiments explored the facilitating and inhibiting effects of various 
furniture arrangements and two types of objects on social interaction. The 
first two experiments included all possible combinations of four distances and 
three orientations for the seating of dyads who were left alone to wait in a 
room. The results showed no main effects of distance on amount of affiliative 
behavior. However, both experiments showed that the less direct orientations 
(such as when sitting side by side on a couch) were less conducive to con- 
versation, and particularly inhibiting for the otherwise more sociable pairs— 
that is, those in which at least one member received a high score on a 
questionnaire measure of affiliative tendency. In the first experiment, which 
included a measure of relaxation, the subjects relaxed more as distance be- 
tween them increased. The third experiment explored the combined effects of 
a sculpture and a puzzle poster on affiliative behavior. Preoccupation with the 
puzzle was found to be an inverse correlate of affiliative behavior. Pre- 
occupation with the sculpture was not a significant correlate of affiliative 
behavior, but it did facilitate interaction between subjects who were sensitive 
to rejection. These same subjects spent significantly less time examining the 


puzzle than did subjects who were insensitive to rejection. 


Three experiments explored the facilitating 
or inhibiting effects of furniture arrangement 
and “conversation pieces” on the affiliative 
behavior of pairs of strangers. Affiliative be- 
havior is comprised of (a) various indicators 
of amount of conversation, such as number 
of statements per minute, and (5) cues com- 
municating positive affect, such as verbal re- 
inforcers, eye contact, head nodding, and 
gesturing (Mehrabian, 1971; Mehrabian & 
Ksionzky, 1970). 

Most studies in microecology have focused 
on furniture arrangement, warmth of a 
setting, its homelike versus institutional at- 
mosphere, or its sociofugal-sociopetal quality, 
that is, the degree to which it discourages or 
encourages social interaction (e.g, Craik, 
1970; Maslow & Mintz, 1956; Sommer, 1962, 
1967, 1969). Sommer (1969, p. 68) showed 
that dyads tend to place themselves at about 
a 90-degree orientation when they wish to 


1This study was supported by National Science 
Foundation Grant GS-2482, and in part by a grant 
from the National Science Foundation for under- 
graduate research participation to the Department 
of Psychology, University of California, Los Angeles. 

2 Requests for reprints should be sent to Albert 
Mehrabian, Department of Psychology, University 
of California, Los Angeles, California 90024. 
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interact cooperatively. The typical dis! 
which social interaction occurred w | 
range of 4-12 feet (Hall, 1966). Sommer 
(1969, p. 66) found that the upper limit for | 
comfortable conversation between two people 

was 54 feet. Rosenfeld's (1965) subjects as- | 
sumed a 41-foot distance from another person 

when they were instructed to be friendly and 
a distance of about 8 feet when instructed to ' 


seem unfriendly. The implications of these 
findings are that the optimal furniture ar- 
rangement for affiliative behavior is one that 
places the members of a pair at a 90-degree 
orientation relative to one another, approxi- 
mately 44 feet apart. Accordingly, this was 
one hypothesis for the present study. 

Earlier studies of the effects of seating ar- 
rangement on social interaction were largely 
correlational and observed the relation be- 
tween type of social activity (e.g., cooperation 
or competition) and spontaneously chosen 
seating positions. However, the relation of 
seating arrangement and social behavior can 
be affected by a variety of attitudinal and 
situational factors. When a subject is allowed 
to select his own seating position, he tends to 
sit closer to those he likes, sit closer to those 
who are more similar to himself, or sit closer 
and at right angles to others in cooperative 


g., Mehrabian, 1969a; Sommer, 
vhen subjects are allowed to 
select their seating, it is not 
nees can be made from the 
between seating arrange- 
interaction. 
ind II below controlled some 
to permit a less ambiguous 
the effects of seating arrange- 
e of seating was experimen- 
and unacquainted subjects 
minimize the effects of 
iking. In addition, two ques- 
ality measures that relate to 
vior (affiliative tendency and 
cjection; Mehrabian, 1970) 
l to explore possible differences 
of seating arrangement for 


of the personality variables on 
or were expected, based on a 
iven bv Mehrabian and Ksionzky 
l that model, affiliative behavior is 
hypothesized io be a direct correlate of affili- 

ve tendency and an inverse correlate of 
ction. The hypothesis for the 
ure arrangement has already 
no hypotheses were proposed 
ive effects of personality and 


noted, and 


II explored limited aspects of 
nrichment, Sommer (1969, pp. 
86) described a study in which a rather 
en institutional lounge was made more 
elike. The changes included (a) objects, 
uch as flowers and vases that could have 
the topic of comment and conversation; 
b) distractors or inhibitors of affiliative be- 
-havior such as magazines and books; and 
€) a furniture arrangement that was de- 
Signed to enhance affiliative behavior. He 
pound a substantial increase in affiliative be- 
Javior following the rearrangement; how- 
T, since a large number of variables were 
ged, it was not clear which of these 
instrumental in producing the increase. 
Tefore, Experiment IIT studied the con- 
ution of only a small subset of these vari- 
S—conversation pieces. There are two 
Bories of so-called conversation pieces, 
that does indeed facilitate conversation 
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fore, the experimental manipulation included 
two kinds of objects that were designed to 
function separately as a conversation facili- 
tator (an abstract sculpture) and an inhibitor 
(a difficult puzzle poster). 

It was hypothesized that a conversation 
facilitator would have a greater effect on the 
affiliative behavior of those who were sensitive 
to rejection. The rationale for this hypothesis 
was that if a partner did not respond to one's 
comment about the prop (sculpture), it would 
be less painful than if he similarly ignored a 
more personal remark directed to himseli— 
the former being a more ambiguous rejection. 
Accordingly, persons who were sensitive to 
rejection could use the prop as a testing 
ground (e.g, "That's sure an odd looking 
thing") to lead into other topics (e.g., “What 
are you majoring in?"), provided that they 
were first reassured by a positive reaction. To 
enhance the effects of the personality factors, 
the positive contribution of the sculpture was 
counteracted by introducing a distractor 
puzzle, meant to serve as an inhibitor of 
affiliative behavior. Thus, in a setting that 
provided both objects, personality attributes 
could maximally contribute to what use they 
might be put. 

Postexperimental questionnaires in Experi- 
ments II and III also explored accuracy of 
the memory of the experimental room and 
judgment of the pleasantness of that room. 
The memory measure, obtained following the 
waiting period, examined differential pre- 
occupation with people versus things as a 
function of the personality variables. When 
others are present, do persons scoring high 
on a measure of affiliative tendency attend 
less to the physical aspects of their environ- 
ment, and therefore remember less of it? Or, 
conversely, do those who score low on both 
affiliative tendency and sensitivity to rejection 
attend more to the physical environment? 
The second measure, judged pleasantness of 
the room, was used to assess whether positive 
or negative feelings aroused by interaction 
with a stranger generalized to the room in 
which that interaction occurred. It was 
hypothesized that subjects who are more 
sensitive to rejection find interaction with a 
stranger more unpleasant, and that this feel- 
ing is generalized to the place of interaction. 


ay, 
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EXPERIMENT I: EFFECTS or DISTANCE AND 
ORIENTATION ON AFFILIATIVE BEHAVIOR 


This experiment explored the effects on af- 
filiative behavior of four distances and three 
orientations in seating arrangement. The 
dyads were strangers and thought that they 
were waiting for the experiment to start. At 
the conclusion of this waiting period, person- 
ality measures of affiliative tendency and 
sensitivity to rejection were administered to 
the subjects. 


Method 


Subjects. The majority of 302 subjects were paid 
$1 each and were recruited by advertising in the 
campus newspaper for an experiment in “Human 
Relations.” Half of the subjects represented a wide 
variety of academic majors, and the remainder were 
nonstudents. Data from 144 male and 144 female sub- 
jects were analyzed, and the data from 7 pairs were 
discarded. One or both members of the latter pairs 
had remarked during their interaction that they were 
being observed. 

Procedure. Upon their arrival, the subjects reported 
to different locations, and pairs were brought together 
for the first time as they entered the experimental 
room. The experimenter instructed them as follows: 
“(Subjects name), would you sit in this chair 
please? (Other subject's name), would you sit here? 
Td like you to remain seated—I’ll be back in a few 
minutes," 

The experiment was conducted in a 24 X 27 foot 
room, window shades drawn, under constant moder- 
ate illumination. Writing tables were employed to fill 
all the floor space except for a 12 X 20 foot area in 
which the chairs were placed. Two walnut pull-up 
armchairs of Scandinavian design were used as seat- 
ing in all conditions. The chairs had open wood 
arms and blue upholstered backs and seats. 

The experimental conditions were all possible com- 
binations of four distances and three orientations of 
the two chairs. The four distance conditions, as 
measured from the center of the seat of one chair 
to the same point on the other, were 3, 4.5, 6, and 
9 feet. The three orientation conditions were O, 90, 
and 180 degrees, representing the sum of the angles 
that both subjects would have to turn in order to 
assume a face-to-face position. For instance, the 
O-degree orientation was a face-to-face position, and 
the 180-degree orientation was parallel seating, as 
on a couch. 

A one-way mirror, together with a concealed 
microphone and tape recorder, permitted observation 
and recording of the interaction between the pair. 
An observer scored relaxation cues (ie. arm- and 
leg-position asymmetry and reclining angle) at the 
end of the first, third, and fifth minutes of the 
interaction period. Scoring criteria and rationale for 
computing total relaxation from these cues were 
provided by Mehrabian (1969b). 


After 5 minutes had elapsed, the experimenter re- 
entered and led the pair of subjects to another room 
where they responded to questionnaire measures of 
affiliative tendency and sensitivity jection 
(Mehrabian, 1970). When they had finished this task, 
the subjects were debriefed and paid. 

The audio recordings of each pairs interaction 
were subsequently scored for the following: (a) total 
statement rate—the number of simple sentences or 
independent clauses of each subject ; minute; 


(b) percentage of speech duration—t rcentage 


of the 5-minute interaction period duri rich the 
subject spoke; (c) percentage of speech duration for 
the pair; (d) verbal reinforcer rate—the number of 
“uh-huh,” “yes,” or other agreements g to the 
partner; (e) latency of the initial statement—the 
duration of silence of each subject befi s first 
statement in the interaction; (f) positive verbal 
content—rated on a 5-point scale; and (g) negative 
verbal content—also rated on a $-point scale. 


Results and Discussion 


'The intercorrelations among the seven de- 
pendent measures taken from the audio re- 
cordings are given in Table 1. Total statement 
rate (ie. simple sentences and independent 
clauses per minute), speech duration of a 
subject, speech duration of the pair, frequency 
of verbal reinforcers, and a separate index 
of positive verbal content were positively 
intercorrelated, and all of these were nega- 
tively correlated with latency of a subject’s 
initial statement. 

These results support the hypothesis of 
Mehrabian and Ksionzky (1970), that com- 
munications of positive feelings and amount 
of conversation are correlated and together 
define an important factor of social behavior 
(referred to as “affiliative behavior"). These 
data also indicate that negative contents in 
communication are not part of this factor. 

There are three redundant measures of 
amount of conversation in Table 1: total 
statement rate, percentage of speech duration, 
and percentage of speech duration of the pair. 
To avoid disproportionate contribution from 
this source to a composite index of affiliative 
behavior, the best single measure of amount 
of conversation, total statement rate, was used 
in computing the following index where all 
variables were first normalized: Affiliativé 
behavior = total statement rate + positive 
verbal content + verbal reinforcer rate — 
latency. 


Some EFFECTS ON SOCIAL INTERACTION 21 


=, 


6. Latency 
7. Negative verbal content 


TABLE 1 
INTERCORRELATIONS AMONG SOME DEPENDENT MEASURES OF EXPERIMENT I 
Measure | 2 | "e 3^ 5 V we 6 7 icy 
a | | 
T. Total statement rate .86 46 56 = 
2. Percentage of speech duration of the subject 82 45 * - t E" 
3. Percentage of speech duration of the pair 58 50 — 46 20 
5. Positive verbal content A2 —.39 03 
5, Verbal reinforcer rate —26 —.01 
—.19 


A second composite index was the sum of 
the normalized values of arm-position asym- 
metry, leg-position asymmetry, and reclining 
angle, and represented a total postural relaxa- 
tion score for each subject. Negative verbal 
content scores were analyzed separately as 
the third dependent measure. 

Stepwise regression analyses explored the 
main and interactive effects of the following 
set of furniture arrangement and personality 
factors in determining each of the three de- 
pendent measures: distance (D), orientation 
(0), affiliative tendency (Ri), sensitivity to 
rejection. (Rs), partner's affiliative tendency 
(R;), and partner's sensitivity to rejection 
(Ry’). The details of such a multiple-regres- 
sion procedure, which tests main as well as 
Interaction effects, were described by Cohen 
(1968). The advantage of such an analysis 
for the present data was that it eliminated 
the necessity of partitioning the personality 
Measures as would have been the case if 
analysis of variance had been used. Equations 
1, 2, and 3 below summarize the .05 level sig- 
nificant effects from the regression analy- 
Ses for each of the three dependent measures. 
The coefficients in each equation (for Equa- 
tions 1, 2, and 3, R — 41, 28, and .16, re- 
Spectively) indicate the relative contributions 

Of the various terms, since all the variables 

are normalized. 


Affiliative behavior = 
2IR; — 14R; + 21R/ — .16R/ — 23(0) 
[1] 


Postural relaxation = 


.20(0) + .15D + .18DR:R> [2] 


[3] 


Negative verbal content = .16D 


(C 


Note.—Correlations in excess of ,16 are significant at the .01 level (df = 286). 


The first four effects of Equation 1 provide 
confirmation for the model of affiliative behav- 
ior proposed by Mehrabian and Ksionzky 
(1970). According to that model, a person's 
affiliative behavior is an increasing function 
of his affiliative tendency and a decreasing 
function of his sensitivity to rejection, the 
former being the stronger effect. Experiment I 
contained sufficient information to test an 
additional prediction from the model, that 
more reinforcing targets elicit more affiliative 
behavior. The reinforcing quality of each sub- 
ject’s partner was assessed indirectly from the 
partner’s scores on the two measures of affili- 
ative tendency and sensitivity to rejection 
(R;' and Ro’). Such an inference was justified 
on the basis of data provided by Mehrabian 
(1971). The results in Equation 1 support the 
additional prediction from the model by show- 
ing that subjects did indeed exhibit more af- 
filiative behavior with their more reinforcing 
(high R,’) partners and less with their less 
reinforcing (high Ro’) partners. 

For furniture arrangement, Equation 1 
shows that orientation alone is the most im- 
portant single determiner of affiliative behav- 
ior. Seating arrangements which faced the 
subjects away rather than toward one another 
had an inhibitory effect on conversation. In 
particular, the cell means of Table 2 for the 
three degrees of orientation show that al- 
though there was no significant difference in 
the effect of a face-to-face relative to a 90- 
degree orientation, there was a significant 
drop in affiliative behavior at the 180-degree 
orientation. 

Cell means for the effects of Equation 2 
are also given in Table 2. Relaxation was 
found to increase linearly with increasing in- 
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TABLE 2 
CELL MEANS FOR SOME EFFECTS OF EXPERIMENT I 
No. obser- 
Depend 
pene mgamre | vations ja x 
Affiliative behavior Dive 
Orientation 96 0° = zd 90° = .26; 180° = p 
Postural relaxation 
Orientation 96 = ET 90° = —.01; 180° = 23 
Distance 72 3' = —.22; 4.5’ = .09; 6' = —.06; 9 = .2 
tt i í 
DR:Ry’ 18 High Re Low R 
D High R” | LowR/ | HighRy’ | Low 
9 — 20e —.20 —.08 
4.5' .16 —.2 .38 
6 —.06 03 —.04 1 
9! .60— —.27 10 
Negative verbal content 
Distance 72 3! = =—.27;4.5' = —.07; 6 = .16; 9% = .19 


Note.—Arrows connect cell means that, based on the Newman-Keuls test, differ significantly at the .05 level. 


directness of orientation and increasing dis- 
tance. Immediacy in social interaction is de- 
fined as the degree of physical proximity 
and/or perceptual availability of two persons, 
and therefore subsumes close positions and 
more direct orientations (Mehrabian, 1969b; 
Wiener & Mehrabian, 1968). Thus, the main 
effects of Equation 2 show that the postural 
relaxation of participants increased with non- 
immediacy. However, the DRoRo’ interaction 
in Equation 2 shows that only the pairs who 
were matched on sensitivity to rejection were 
more relaxed in the nonimmediate 9-foot dis- 
tance than in the immediate 3-foot position. 
Members of unmatched pairs showed no in- 
crease in relaxation with increasing distance 
from their partners; increasing indirectness 
of orientation was the sole determiner of 
relaxation for these pairs. 

Equation 3 summarizes the results obtained 
for negative content. In scoring negative 
verbal content, low scores were assigned for 
indirect and vague communications of affect, 
and high scores were assigned to statements 
directed to the addressee’s person. Thus, com- 
plaints about school were assigned low 
negative content scores, complaints and nega- 
tive remarks directed at the immediate situa- 


tion received intermediate scores, si 
cisms or insults directed to the 
(which occurred rarely) received the hig 
scores. The scoring of these more tangential 
communications of affect was necessary be 
cause more subtle and indirect indicator 
(e.g, complaints about the situation or ever 
more remote problems) provide one of the fe 
sources for assessing negative reinforcers. 

The first effect of Equation 3 and the con 
responding means in Table 2 show a linea 
trend of increasing negative verbal contents 
with increasing distance. Lesser immediati 
afforded by greater distance seems to hav! 
facilitated the expression of negative feeling 
and resentment. This finding is consisten 
with the immediacy hypothesis of Wiene 
and Mehrabian (1968): Negative affect E 
more readily communicated when there is les 
immediacy between the communicator 4 
addressee. 


EXPERIMENT IT: Errrcrs or DISTANCE A 
ORIENTATION ON SOCIAL INTERACTION— 
A REPLICATION 


This experiment was largely motivated b 
the failure to find a main effect of distam 
on affiliative behavior in Experiment I. 
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this replication, the room where the subjects 


interacted was smaller and seemed more 
appropriate : less obviously a laboratory 
situation. T ne-way mirrors were camou- 
flaged.? Also > instructions were modified 
to provide t bjects with a reason for wait- 
ing together —they were waiting for the ex- 
perimenter epare materials for a music 
listening stu Thus, the interaction period 
was more 7 ıl and less stressful for the 
subjects, since it eliminated much of the 
ambiguity uncertainty inherent in the 
first procedure, which gave no reason for the 
waiting px and which was labeled 
“Human Relations.” 
Method 

Subjects. I were 144 male and 144 female 
University of California undergraduates who served 


in partial ‘ent. of introductory psychology 
Course requir 

Procedure ich subject arrived, he was guided 
to a booth in assistant of the experimenter, 
where he waited alone. When there were two sub- 


jects of the 
menter led bx 
anteroom of t 


sex waiting separately, the experi- 
of them from their booths to the 
experimental room and said: "You 
are going to listen to some music and give us your 
opinion of it, Would you leave your books on the 
table here and come along with me?” Passing a 
tape recorder on a cart, she opened the door to 
the experimental room. There were two chairs in the 
toom which were arranged in any of the 12 possible 
Combinations of four distances and three orientations, 
as in Experiment I. 

The chairs were centered on the carpeted 9.5 X 20 
foot room. Two one-way mirrors, concealed by 
framed abstract transparencies, permitted observa- 
tion and audio recording of the interactions from 
an adjacent room. As they entered the room, the 
subjects were told: “Please have a seat. Tl be in 
as soon as I fix the tape.” The experimenter shut 
the door to the room as she left and allowed 5 
minutes to elapse. She then returned wheeling the 
tape recorder, played 30 seconds of jazz music, and 
asked the subjects to rate it for pleasantness. Sub- 
jects were next led to another room where they re- 
Sponded to several questionnaires. These included rat- 
ings of the pleasantness of the room in which they 
interacted, a multiple-choice questionnaire of mem- 
ory for various aspects of that room, and the scales 
Of affiliative tendency and sensitivity to rejection 
(Mehrabian, 1970). Affliative behavior was scored 
ftom audio recordings of the interaction, as in 
Experiment T, 

— 


8 

i Unfortunately, this laboratory space and the fur- 
hishings were not available at the time when Experi- 
ment I was run. 


Results and Discussion 


For each dependent measure, a regression 
analysis was used to explore the main and 
interactive effects of all the following vari- 
ables: orientation (O), distance (D), affili- 
ative tendency (R;), sensitivity to rejection 
(R3), partner's affiliative tendency (R;'), and 
partner's sensitivity to rejection (R2'). The 
results of these regression analyses are re- 
ported in the following equations which con- 
tain the .01 level significant effects and 
where all the variables are normalized. There 
were no significant .01 level effects for the 
judged pleasantness of the room (R = .46 and 
.18 for Equations 4 and 5, respectively). 


Affiliative behavior = .22 Ry + .18R;' 

— .27(O) + .22(0)RiRy’ — .25DR2R2’ [4] 

Accurate memory of the room 
=-—.18R. [5] 

The cell means for some effects of Equa- 
tion 4 are given in Table 3. Consistent with 
the results of Experiment I, affiliative behav- 
ior decreased with indirectness of orientation, 
and there was no main effect for distance. 
Once again, distance did not interact with 
orientation to determine affiliative behavior. 
Since the size of the room in this case was 
considerably smaller than in Experiment I, 
the absence of findings for distance ruled out 
the possibility that the results of Experiment T 
were invalid or valid only for very Jarge 
spaces. Contrary to the hypothesis, then, it 
is concluded that for a single dyad in a room, 
variations in distance ranging from 3 to 9 
feet are of no consequence for affiliative be- 
havior, The hypothesized effect of distance 
may be more appropriate when there are 
more than two persons in the same room. 

As in Experiment I, subjects and partners 
who scored higher on a test of affiliative tend- 
ency were found to produce, and elicit, more 
affiliative behavior. The effects of speaker and 
listener sensitivity to rejection that were ob- 
tained in Experiment T were not corroborated 
in Experiment IT. This difference in the re- 
sults for sensitivity to rejection is interpreted 
in terms of the greater stress that subjects 
may have felt in Experiment I, which was 
labeled Human Relations. Whereas in Experi- 
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TABLE 3 
CELL MEANS FOR AFFILIATIVE BEHAVIOR AS A FUNCTION OF THE EXPERIMENTAL CONDITIONS 
IN EXPERIMENT II 
Dependent. No. obser- 
measure vations in M 
(Source) each cell 
i ew 
Orientation 96 0° = .31 90° = .00 180° = —.30 
ORiRY 24 High Ri Low Ry 
o High Ri | Low Ri HighR/ |  LowRr 
0* .64— 22—, 34—, .04 
$21 
90* 71 —.14 —.32 | —.28 
180° wate = 45 —.65— .00 
DRR 18 High Rə Low Ra 
D High R4 | Low Ry’ High R/ | Low Ri! 
3 67 a =—.25— 12 
3 
4.5' —.06 21 Al 02— 
$ 
6' .24 —.46 04 .07 
1 1 1 
9' =—3 dic Ac —.63 


Note.—Arrows connect cell means that, based on the Newman-Keuls test, differ significantly at the .05 level. 


ment I the subjects did not know why they 
were waiting, in Experiment II they were 
told that the experimenter would soon return 
with the music. The greater uncertainty and 
laboratory-like atmosphere of Experiment I 
may well have accentuated the concerns of 
those who were sensitive to rejection, and 
thus may have led to their greater hesitation 
to engage in affiliative behavior. If measures 
of relaxation had been taken in Experiment 
TI, it would have been possible to check this 
interpretation by comparing the relaxation— 
tension of those subjects who were sensitive to 
rejection in Experiments I and II. 

The remaining results for affiliative behav- 
ior are interactions among the personality 
measures, distance, and orientation. The 
OR,R,’ effect shows that when at least one 
member of a pair had scored high on a mea- 
sure of affiliative tendency, affiliative behavior 
decreased with indirectness of orientation. 
Variations in seating arrangement had no 
significant effect on the affiliative behavior of 
pairs who both had scored low on the measure 
of affiliative tendency. Thus, the affiliative 
behavior of those who had scored higher on 


the measure was more susceptible to 
facilitating and inhibiting effects of furniture 
arrangement. 

The DR:R»’ interaction for affiliative b 
havior in Equation 4 is reminiscent of a simi 
lar interaction obtained for postural relaxa 
tion in Experiment I. Although these eff 
were not fully replicated in both experiment 
they do complement one another, and they al 
low an interpretation of the results. Affilia! 
behavior is particularly uncomfortable 
dyads consisting of strangers who are bo 
sensitive to rejection. Thus, interaction 1 
avoided at larger distances and accompa 
an increase in postural relaxation (comfo UU 
When both members of a pair are insensitiV 
to rejection, they are apparently less res| 
sive to the implicit social pressures to in 
act, regardless of distance and particularly à 
larger distances where they can be indifferen 
to the other person and relax. * 

Finally, Equation 5 indicates that tho 
who were more sensitive to rejection remer 
bered less about the room in which they int 
acted. They seem to have been more 
occupied with interpersonal cues, such as tl 
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TABLE 4 


)RRELATIONS AMONG MEASURES OF AFFILIATIVE BEHAVIOR IN EXPERIMENT III 


Measure | 2 | 3 | 4 | 5 6 
1. Total sta t rate 86 57 45 —.23 —.28 
2. Percenta f speech duration 53 .29 -21 —.27 
3. Positive verbal content AT -31 —.28 
4. Verbal reinforcer rate -.19 -.32 
5. Latency -20 
6. Percentage of duration of the observation of the puzzle 
Note.—Wit f = 174, correlations in excess of .19 are significant at the .01 level. 
behaviors of the partner that could indicate val. As in Experiment II, the remaining 
acceptance or rejection. dependent measures were scored from the 


ExPerIMeNnt III: EFFECTS oF Props AND 
PERSONALITY ON SOCIAL INTERACTION 


This experiment explored the simultaneous 
impact of two conversation pieces on the so- 
cial interaction of persons differing in affili- 
ative tendency and sensitivity to rejection. It 
was expected that one of these objects (the 
sculpture) would facilitate conversation, 
whereas the other (the puzzle) would inhibit 
it The simultaneous introduction of both 
objects was designed to maximize individual 
differences. The manipulation was based on 
the assumption that different persons would 
use these two objects to either actively avoid 
or overcome the initial awkwardness of inter- 
action with a stranger. 


Method 


Subjects. The subjects were 176 University of Cali- 
fornia undergraduates, 88 males and 88 females, who 
Participated in the experiment in partial fulfillment 
of introductory psychology course requirements. 

Procedure. The procedure and the experimental 
room were the same as in Experiment II, except 
that the subjects always sat 4.5 feet apart and at 
an angle of 90 degrees. Further, in the prop condi- 
tion, an abstract sculpture and a puzzle poster were 
hung from the walls in front and to the right and 
left of the two chairs, The sculpture and poster 
Were absent in the no-prop condition, The dependent 
measures were the same as in Experiment II, with 
the addition of those noted below. 


Results and Discussion 


_For data analysis, the scores for the dura- 
tion of observation of the sculpture and the 
Puzzle were transformed into “percentage of 
duration” scores based on the $-minute inter- 


audio recordings of the interaction. In addi- 
tion to the counts of the total number of 
statements for each subject, separate counts 
were made of each subject’s statements 
about the puzzle and his statements about 
the sculpture. 

A factor analysis of the dependent mea- 
sures yielded the variables in Table 4 as the 
primary factor. The correlations of Table 4 
corroborate those given in Table 1 and show 
that the total statement rate is the best single 
index of affiliative behavior. Further, each of 
the measures of affiliative behavior was in- 
versely correlated with the duration of obser- 
vation of the puzzle. The total score for 
affiliative behavior was computed as in Experi- 
ments I and II. Although the percentage of 
duration of the observation of the puzzle 
could have been included in this “affiliative 
behavior” index, it was analyzed separately 
because it was more desirable to (@) have 
the same composite affiliative behavior mea- 
sure for all three experiments and (5) sepa- 
rately examine the role of the puzzle poster 
in the situation. 

Regression analyses were used to explore 
each dependent measure as a function of 
the following factors and their interactions: 
the experimental manipulation, which was the 
presence (P — 1) or absence (P = —1) of 
the sculpture and puzzle; the subject's aífili- 
ative tendency (Ri), his sensitivity to rejec- 
tion (Rs), his partner's affiliative tendency 
(R,’), and his partner’s sensitivity to rejec- 
tion (Rs). (For Equations 6, 7, 8, and 9, 
R = 25, .26, .54, and .72, respectively.) 


Affiliative behavior = .21R, + .14R4 [6] 
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Total statements per minute 


= 16R; + .13R;’+ 17PR;Rs! [7] 


Percentage of duration of observation of 
sculpture 
= 36P + .27PRaRe’ + .27PR2Re’ 


= 36P + 27R,R; (1+P) [8] 


Percentage of duration of observation of 
puzzle 
= .70P — .14Rs' (1 + P) — 2IR3R/ 


(+P) [9] 


The results for affiliative behavior corrobo- 
rate those given in Equation 4 for Experiment 
II. The role of the sculpture and the puzzle, 
however, is further clarified through a sepa- 
rate examination of the total statements per 
minute, the number of statements about the 
sculpture, and the number of statements about 
the puzzle. The results for total statements 
per minute are given in Equation 7, and 
those for the latter two showed only main 
effects due to prop (ie. the subjects con- 
versed more about the props when these were 
present than absent, and there were no other 
main or interaction effects for these mea- 
sures). 

Equation 7 shows that the total statement 
rate was an increasing function of the sub- 
ject’s affiliative tendency and his partner's 
affiliative tendency, with a significant third- 
order interaction involving both their sensitiv- 
ity-to-rejection scores. Thus, as in Experi- 
ments I and II, the subjects scoring high on 
affiliative tendency both produced and elicited 
more conversation. The cell means for the 
triple interaction are given in Table 5, in 
which high and low Rə designate subjects 
having above and below average sensitivity- 
to-rejection scores, respectively. 

These results showed that although the 
props did not serve as conversation pieces in 
general, they did enhance conversation when 
both subjects were sensitive to rejection. Tt is 
also seen that when props were absent, various 
pairings of subjects had about the same 
amount of conversation, except for those pairs 
in which both members were insensitive to re- 
jection. These pairs evidently felt no pressure 
to talk even in an environment devoid of a 
legitimate escape from interaction—the puz- 


zle. 
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The results given in Equations $ and 9 
show that, as expected, the sculpture and the 
puzzle did serve distinctly different func 
in the waiting situation. Although personali 
variables did not significantly affect the com 


puzzle, they did significantly affect the obser 
vation of both objects. Equations 8 and 9 
the related cell means of Table 5 show oppo 
site results for the sculpture and the puzzle 
When both members of the pair were 
to rejection, there was a marked increase it 
observation of the sculpture and a decrease 
the observation of the puzzle. The sculptum 
attracted attention and somehow stimulati 
interaction, at the same time minimizing pii 
occupation with the puzzle. Such preoccupi 
tion was a nonaffiliative behavior, as evidence 
by the intercorrelations of Table 4. Perh 
the subjects who were especially sensitive f 
rejection also avoided the puzzle because thi 
were reluctant to chance failing in the pres 
ence of another. 

The role of the sculpture is clarified fu 
by the results for verbal reinforcers (for 
tion 10, R = .37): 


Verbal reinforcers per minute 
= 22 Ro + .24PRe + 25PR:R/ 


[10 


Subjects who were sensitive to rejectios 
were able to use verbal reinforcers more ofte 
when props were present, and they did so eve 
more when their partners were also sensitiv 
to rejection. One interpretation of these effe 
is that verbal reinforcers were used to conv 
the impression of amiability and to avoid col 
flict and possible rejection. Since the pa 
who spent more time observing the sculptu 
were also those who conversed more, it fol 
lows that even though they did not talk mol 
about the sculpture, it somehow facilita el 
their conversation. For example, a subject 
who was sensitive to rejection may have i 
tially preoccupied himself with the sculptu 
and ventured a remark about it, at whid 
point he was promptly reinforced if 
partner was also sensitive to rejection. Ha 
ing been reassured that his partner would n 
be rejecting, he could have proceeded to di 
cuss other topics. In comparison, other pa 
were just as likely to comment on the scul 
ture as those who were sensitive to rejecti¢ 
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TABLE 5 
CELL MEANS FOR SOME [INTERACTIONS or ExrrxDukxT LII 
adi No. obser- 
5 dent measure vations in M 
TT IRI ET E: 
PRRY 22 High Ry ! Low Ry 
High, | 2 e —22 
1 
Low Rs -2 -41 
High R: 05 A8 
No props 
Low R; | 28 es —,22 
Percentage ution of observation of sculpture* 
RRs 22 High Ry’ | Low Ry 
High Rs 43 e A3 
Props 1 1 
Low R: I € 62 
Percentage of duration of observation of puzzle* 
PRR: 2 High Ry | Low Ry’ 
l EP AE ei aed edo 
| High R: | -H «e BA 
| Low Rs .99 75 
Verbal reinforcer rate ^ 
PR:R,’ 2 High Ry’ | Low Ré | 


Accurate memory of room 
iR: 


Note.—Arrows connect cell means which, based on £ tests, differ significantl 
In the absence of the sculpture and the puzzle, the four cell means are equat. 


but they were not necessarily reinforced for 
any such comment. 

Equations 11 and 12 below (R= .26 and 
25, respectively) contain the results for ac- 
curate memory and judged pleasantness of 
the room, 


Accurate memory of the room 
= —.14P — 16R; + .18R; Ra 


Judged pleasantness of the room 
= —.19R» T A7PRi 


[11] 


[12] 


.. The results for the memory measure sum- 
marized in Equation 11 show a main effect 


Poco PPS 


A haiz * vr 


High Ry 
Low Ry 


at the .05 level. 


for the prop. Preoccupation with either the 
sculpture or the puzzle understandably de- 
tracted from accurate recollection of the con- 
tents of the room. The results of greater in- 
terest relate to the personality measures and 
show that the effects of Equation 11 do not 
corroborate those from Experiment II given 
in Equation 5. However, the main effects of 
both equations show that subjects who were 
more preoccupied with interpersonal cues (i.e; 
those who scored higher on affiliative tendency 
or sensitivity to rejection) had a less accurate 
recollection of the room. Also, the cell means 
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for the R,R» interaction of Equation 11 given 
in Table 5 show that those who scored lower 
on both sensitivity to rejection and affiliative 
tendency had the most accurate memory. 
Judged pleasantness of the room was an 
indirect measure of the subjects’ stress or dis- 
comfort in the experimental situation. The 
assumption was that, through generalization, 
an unpleasant interaction would contribute to 
a negative judgment of the room. Experi- 
ment II did not support that hypothesis, but 
Experiment III showed that those who were 
more sensitive to rejection did indeed judge 
the room as less pleasant. This is consistent 
with the expectation that persons who are sen- 
sitive to rejection are more anxious in the 
presence of strangers, In contrast, subjects 
with high affiliative tendency judged the room 
as more pleasant, but only when the props 
were present. This indicates that for them, it 
was the objective aspects of the environment 
and not internal cues of anxiety or discomfort 
that determined their reaction to the room. 


GENERAL DISCUSSION 


The results of these experiments were con- 
sistent with the general expectation that the 
physical environment can affect patterns of 
social interaction. Affiliative behavior among 
strangers was characterized in terms of 
amount of conversation and cues communicat- 
ing positive attitude (e.g., positive verbal con- 
tent or verbal reinforcers). The data provided 
support for the model of affiliation proposed 
by Mehrabian and Ksionzky (1970) and indi- 
cated that affiliative behavior was a correlate 
of generalized expectations of positive rein- 
forcement in interpersonal relationships (af- 
filiative tendency), and was even greater when 
the targets were actually more reinforcing. 
Further, in Experiment I, affiliative behavior 
was an inverse correlate of generalized nega- 
tive expectations (sensitivity to rejection) 
and was inhibited even more by less reinforc- 
ing targets. 

In one important respect, the effects of 
sensitivity to rejection on affiliative behavior 
were not corroborated across experiments. The 
main effects of sensitivity to rejection on af- 
filiative behavior that were obtained in Ex- 
periment I were not replicated in Experiments 


II and III. This discrepancy was interpreted 
in terms of the more stressful quality of Ex- 
periment I, which accentuated the effects of 
sensitivity to rejection. 

For the effects of furniture arrangement, 
the findings show that the distance between 
two persons was less important than their 
relative orientation in determining the extent 
of their affiliative behavior. In Experiment II 
affiliative behavior decreased linearly with 
less direct orientation. In the corresponding 
effect from Experiment I (Equation i, Table 
2), the cell means for the 0- and 90-degree 
orientations did not differ significantly. The. 
combined results of both experiments, how- 
ever, did indicate a consistent trend: There 
was less affiliative behavior as a function of 
less direct orientation, such that the 180- 
degree, side-by-side, position was quite detri- 
mental to conversation. This finding clia)lenges 
the widespread use of sofas or coucnes in 
rooms intended for social use. 

There was an added refinement in the ef- 
fects of orientation. It was necessary that at 
least one subject in a dyad score high on af- 
filiative tendency for the orientation manipu- 
lations to have an effect on affiliative behavior. 
This result suggests that although furniture 
arrangement can encourage or discourage | 
affiliative behavior between persons who are 
generally sociable, it has neither effect on the 
less sociable pairs. 

The finding that distance played no signifi- 
cant part in determining the amount of inter- 
action between pairs of subjects was contrary 
to expectations. Possibly the fact that these 
dyads were “alone together" in the room over- 
rode the distance variations; that is, irrespec 
tive of distance, there was still one other per- 
son in the same room to contend with. Con- 
ceivably, the results could be different if more 
than two persons were in the same room, in 
which case distance might have the hypothe- 
sized effect. 

The concept of immediacy was used to con- 
ceptualize joint effects of distance and ori- 
entation. Mehrabian (1969b) defined immedi- 4 
acy as the degree of physical proximity and/ 
or conceptual availability of two persons 10 
one another. In the only experiment where 
relaxation was measured (Experiment I), re 
laxation decreased with greater immediacy i? 


^ 
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the positioning of the dyad. This effect was 
explained in terms of the increasing social 
pressure to interact with the stranger, due to 
the increased confrontation associated with 
greater immediacy. The DR3R' effect on re- 
laxation in Experiment I and on affiliative 
behavior in Experiment II helped to further 
clarify this interpretation. Persons who were 
generally more uncomfortable about associ- 
ating with strangers (e.g, pairs who were 
both high on sensitivity to rejection) avoided 
affiliative behavior and were more relaxed at 
greater dist from each other. Apparently, 
the 9-foot distance afforded a greater oppor- 
tunity to avoid social interaction, and this 
opportunity seized by pairs whose mem- 
bers were both sensitive to rejection. Pairs 
who were both insensitive to rejection also 


showed less affiliative behavior as distance 
increased, not so much because they sought to 
avoid interaction, but because the increased 
distance diminished the partner's salience 
relative to other objects in the situation. 

The combined findings from Experiments 
I and II highlighted the importance of affilia- 
tive dispositions in the study of the effects of 
furniture arrangement, It was also noted that 
Uncertainty or stress could be of significance 
in determining the effects of furniture arrange- 
ment for different types of persons. 

, Experiment I included a measure of nega- 
tive verbal contents. Consistent with the im- 


mediacy hypothesis (Wiener & Mehrabian, 
. 1968), it was easier for subjects to complain 
to one another in the less immediate posi- 
tions, Thus, implicit social constraints pro- 
hibited expression of negative feelings in the 
| Closer positions and only allowed weak ex- 
- Pressions of such affect when the pair were at 
- à greater distance from one another. 
Extrapolating from the findings of Experi- 
Ment ITI to more general social settings, there 
Appears to be some value in categorizing Ob- 
Jects as conversation facilitators (e.g, un- 
Usual or interesting views or art work that 
Can be the subject of comment) and distrac- 
| tors (eg. books, magazines, or puzzles that 
ate usually the subject of individual and non- 
Social activity). 
Because the facilitator and the distractor 
Vere used simultaneously in Experiment IIT, 


the results of the experiment revealed no sig- 
nificant main effect of props on affiliative be- 
havior. However, the relation of these two ob- 
jects to affiliative behavior was evident from 
the correlations between affiliative cues and 
the extent of preoccupation with either the 
sculpture or the puzzle. Table 4 shows con- 
sistent inverse relationships between observa- 
tion of the distractor puzzle and various in- 
dexes of affiliative behavior, whereas the dura- 
tion of observation of the sculpture was not 
significantly correlated with affiliative cues. 

The results of Experiment III also empha- 
sized the role of personality in mediating the 
effects of props on social interaction. Pairs of 
subjects who were both sensitive to rejection 
were found to make the most use of the 
sculpture-conversation piece, which somehow 
helped them through the initial awkwardness 
of conversing with a stranger. These same 
pairs avoided the poster puzzle that inhibited 
affiliative behavior. The findings also suggest 
that in an experiment designed to maximize 
individual differences, it is helpful to use both 
kinds of objects, those that inhibit as well as 
those that facilitate interaction. However, to 
enhance social interaction between persons 
who have difficulty in such a setting (i.e., 
those sensitive to rejection), the introduction 
of an unusual and striking conversation piece 
can be helpful. 

Further exploration of objects that may 
stimulate or inhibit conversation could in- 
volve experimental designs that allow the 
assessment of the main and interactive effects 
of conversation facilitators and inhibitors. The 
study of facilitators would seem to be of 
greater social value, and attempts to identify 
the attributes of such objects would probably 
require the concepts of pleasantness, com- 
plexity, and novelty. 
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INTROVERSION, EXTRAVERSION, AND 
INTERPERSONAL ATTRACTION 
CLYDE HENDRICK ! anp STEVEN R. BROWN 
Kent State University 
eriments investigating the relation between similarity and attraction have been 
Jetely consistent in finding a positive, linear relation between the two variables. 
ict, there are no studies reported showing an inverse relation. The present 
nent tested the hypothesis that similarity-dissimilarity on the trait of 
ion would yield an inverse relation between similarity and attraction. 
typical positive relation was predicted to hold for extraverts. Subjects com- 
i a questionnaire that measured their level of introversion-extraversion. They 
» evaluated two bogus strangers (one introvert and one extravert) on several 
ng scales. The hypothesis was confirmed on four of six measures. Both extravert 
| introvert subjects preferred an extraverted stranger on these four measures. 
wever, on the remaining two measures, the introverted stranger tended to be 
ferred. The results are discussed in terms of Byrne's reinforcement theory of 
traction. 

The purpose of the present research was to is of some interest. Second, there is the theo- 
explore the possible interactive effects of a retical issue of the importance of similarity as 
personality iable, extraversion-introversion a reinforcer. Byrne’s (1969) reinforcement 
(Eysenck, 1952), on the relationship between theory of attraction assumes that similarity 
perceived si:iilarity and attraction. One of the mediates attraction because it possesses reward. 
more reliable findings in social research is that qualities, not because of any inherent charac- 
attraction is 2 positive, linear function of per- teristics per se. The interesting aspect of 
ceived similarity on a variety of attributes (see similarity is that it has consistently been such 
Byrne, 1969, ior a review). The search for a powerful source of reward. The question 
variables that strongly interact with similarity arises as to whether other variables might be 
in determining attraction has not been particu- more rewarding than similarity, and whether 


larly successful. Weak interactions have been 
reported occasionally (e.g, Byrne & Clore, 
1967 ; Hendrick & Page, 1970), Byrne (in press) 
reviewed 20 studies that used individual 
difference variables as possible interacting 
variables, In the few studies where significant 
interactions were obtained, they were always 
due to differences in the slopes of positive 
similarity attraction functions for different 
personality groups. 

The net conclusion from a review of this 
research is that no study has found a significant 
reversal of the similarity-attraction relation- 
Pip. The search for strongly interactive 
Variables is of considerable interest. First, it is 
Most unusual in social research to find a rela- 
tionship that is so impervious to the impact of 
me variables, "Therefore, delermining the 
rad conditions under which the empirical 
Aw of attraction (Byrne & Nelson, 1965) holds 

"Requests for reprints should be sent to Clyde 


pendrick, Department of Psychology, Kent State 
niversity, Kent, Ohio 44242. 
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in appropriate circumstances such other vari- 
ables might reverse the reward qualities of 
similarity. Empirically, the question becomes 
one of whether there are other variables that 
will interact powerfully with similarity in 
determining attraction. 

There are two pieces of evidence suggesting 
that the personality dimension of extraversion— 
introversion might be such a strongly inter- 
acting variable. Star (1962) found that when 
subjects completed the Maudsley Personality 
Inventory (MPI) as if they were the kind of 
person they would actually like to be, higher 
extraversion scores were obtained than for a 
control group. This result suggests the norma- 
tive desirability of extraversion as a social 
trait. More recently, Brown and Hendrick (in 
press) had both introverts and extraverts make 
several Q sorts, using modified M PI statements 
as the pool of descriptive items. Subjects made 
Q sorts for actual private self, actual social 
self, desired private self, desired social self, and 
the ideal leader. The results indicated that 
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extraverts viewed themselves as actually extra- 
verted, and their ideal selves were also extra- 
verted. However, the ideal selves for introverts 
were also extraverted. These ideals were at 
variance with the introverts' perception of 
actual self as introverted. 

"These results suggest that introverts might 
be more attracted to an extraverted other than 
to an introverted other. A more general state- 
ment is that with respect to the extraversion— 
introversion dimension, introverts are more 
attracted to dissimilar than to similar persons. 
For extraverts, the usual positive relation 
would be expected to hold. Therefore, intro- 
version-extraversion should strongly interact 
with perceived similarity on the extraversion- 
introversion dimension in determining attrac- 
tion. This hypothesis was tested in the pres- 
ent study. 


METHOD 


Subjects participated in an experiment which was 
ostensibly a study in person perception. The first part 
of the experiment was a “study of self-perception,” 
during which subjects completed a modified version of 
the MPI. This scale provided a measure of the subject's 
actual position on the extraversion-introversion dimen- 
sion. The second part of the experiment was presented 
as a “study of the perception of other people." Subjects 
examined the response patterns of two (bogus) strangers 
on the extraversion-introversion scale. The response 
pattern for one stranger was highly introverted, and for 
the second stranger, highly extraverted. After examin- 
ing each response pattern, the subjects rated the 
stranger on several evaluative scales. 


Materials 


Extraversion—introversion scale. The 24 questions on 
the MPI (Eysenck, 1962) pertaining to extraversion— 
introversion were modified and used to construct a 
“behavioral preferences inventory.” In the modified 
form, only “yes” and “no” response alternatives were 
used, and some of the questions were revised slightly so 
that extraversion responding required a yes on half of 
the items and a no on the other half. Introversion 
responding required the opposite response pattern. The 
responses were scored by assigning a weight of 1 point 
to each extravert response and summing across the 24 
questions. The extraversion-introversion scores could 
thus range from 0 to 24, with 0 indicating complete 
introversion and 24, complete extraversion. 

Stimulus booklets. The response patterns for the two 
bogus strangers were prepared in booklet form. To 
avoid suspicion, each stranger was given a response 
pattern that was stereotypical for an extravert or 
introvert, but not completely so. The extravert stranger 
scored 20 (out of 24 possible), and the introvert scored 

4 (out of 0 possible). In addition, two of the questions 


were answered in the same way by both strangers. A 
seven-question “interpersonal prediction form'' followed 
each stranger's behavioral preferences inventory. Each 
question was rated on a 9-point scale, h had 
appropriately labeled end points. The ratings of each 


stranger included liking, perceived simila pref- 
erence as a leader, interesting to talk with at a party, 
ideal personality, reliability as a long-term fricnd, and 
honest and ethical in his relations with other people. 

As a control for rating bias, half of the booklets were 
prepared in an extravert-introvert order for the two 
strangers, and the remaining half of the booklets were 
prepared in the reverse order. 
Procedure 

At the beginning of the experiment, the experimenter 
explained in general terms the nature of person percep- 
tion. He then explained that there were two parts to the 
experiment. The first part was a study of sel!-percep- 
tion. The behavioral preferences inventory (the measure 
of actual extraversion-introversion) was then described, 
and copies were distributed. Subjects indicated their 
name, sex, and professor on the cover sheet and com- 


pleted the questionnaire. The questionnaires were then 
collected, and the experimenter explained t! 
part of the experiment. The subjects were to 
main reason for completing the first questio a 
to provide them with a background of expe;.cnce m 
projecting themselves into the questionnair* items. 
They were to use that experience to help--órm an 
impression of two persons who had previou-:y com- 
pleted the questionnaires, and evaluate t two 
persons on several rating scales. The format of the | 
booklets and rating scales was explained, and the 
booklets were distributed. After the ratings were com- 
pleted, the experimenter asked the subjects to write a 
brief paragraph describing their reactions to the 
experiment. Afterward, the booklets were collected, and 
the true nature of the experiment was fully explained. 


Subjects 


Students from classes in introductory, adjustment, 
and educational psychology participated in the study. 
Class size ranged between 25 and 75, and the data were 
collected during a regular class session, Altogether, data 
were collected from 205 students. Subjects were retained 
for data analysis if their extraversion-introversion score 
was 18 or higher (extraverts) or 9 or lower (introverts)- 
With these cutting points, the mean extraversion^ 
introversion score for each group was very similar to 
the extraversion-introversion score assigned to one of 
the bogus strangers. The cutting points yielded 34 
extraverts and 32 introverts. Two extraverts were 
randomly discarded to equalize the N. There were 13 
male and 19 female extraverts and 15 male and ! 
femaleintroverts. 


RESULTS 
Extraversion—I ntroversion 


The mean extraversion score for the €* 
travert group was 20.8, which was slightly 
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TABLE 1 
SUMMARY ANALYSIS OF VARIANCE FOR THE SEVEN MEASURES 
——— — — 
| F ratio 
Sou df 
Similari iki Interesting Ideal Pref. i 
1 es Liking at party | personality Kader. mg | Hos 
Extraversion-in tco ie 
version trait of | 
) subjects (^ 1 3.53 4.67** 7.09*** |  4,09** 10.91*** 3.70 1.82 
Error 62 : | 
Similarity to ilus 
persons (B | 1 | 244.09" | 28,05*** 2f TEM hs KEV A bane 8.70*** | 19.37*** bear 
AXB ies! 11.15*** | 39.30*** | 153.75*** | 78.25*** | 107.09*** iw aie 
Error | 62 
* 10. 
TET 
stb c 01 


higher than the score of 20 assigned to the 
bogus extravert stranger. The mean introver- 
the introvert group was 6.4, 


Sion score f. 

slightly above the score of 4 assigned to the 
bogus introvert stranger. The difference be- 
tween the two means was highly significant 
(E = 840.8, jf = 1/62, p < .0001). Thus, the 
Wo groups were very different in their re- 


Sponses to the extraversion-introversion scale, 
and the mean score for each group was extreme 
enough so that the groups could be appro- 
priately labeled “introvert” and “extravert.” 

An analysis of the subjects’ comments de- 
Scribing their reactions to the experiment 
indicated that extraverts were more “wordy” 
than the introverts. The extraverts averaged 
49.6 words per comment, and the introverts 
averaged 36.3 words. The difference was signifi- 
Cant (F= 458, df= 1/62, p< 04). Al- 
though no formal analyses were made, the 


es also seemed more emotive in the 
anguage they used to describe their reactions. 
Manipulation of Similarity 
i The rating question on perception of simi- 
p was included as a check on the manipula- 
lon of similarity. Extravert subjects rated the 
Introvert stranger was rated only 2.0 in 


extravert stranger 7.0 in similarity, but the 
larity, Introvert subjects rated the intro- 
E. stranger 6.5 in similarity, but the extravert 

Tanger was rated only 3.3 in similarity. Thus, 


appropriate perceptions of similarity-dissimi- 
: fo for the two bogus strangers were created 
t both introvert and extravert subjects. This 


difference in perceived similarity was highly 
significant, as shown in the first column of 
Table 1 (F = 244.09, df = 1/62, p < .01). 
The manipulation of similarity between the 
two strangers was slightly stronger for extra- 
verts than for introverts, and this difference is 
reflected as a significant interaction (F = 11.15, 
df = 1/62, p < .01). The mean extraversion— 
introversion score of 20.8 for extravert sub- 
jects was, in effect, 16.8 extraversion-introver- 
sion units from the bogus introvert stranger. 
However, the mean extraversion-introversion 
score of 6.4 for introvert subjects was only 13.6 
extraversion-introversion units from the bogus 
extravert stranger. This small difference in the 
actual manipulation for the two groups of 
subjects was rather accurately reflected in their 
mean ratings of perceived similarity. 


Dependent Variable Measures 


Each of the remaining six rating questions 
was considered as a dependent variable. The 
analysis of variance summary for each of these 
measures is given in Table 1. The mean ratings 
for each question are shown in Figure 1. The 
results for four of the six measures provide 
strong support for the predictions. The most 
direct measure of attraction was the ratings of 
liking. Extraverts liked a similar extravert more 
(7.3) and a dissimilar introvert less (3.9). 
However, introverts liked a similar introvert 
less (5.9) than a dissimilar extravert (6.2). The 
interaction was highly significant (F = 39.30, 
p < 01) The same pattern of results was 
obtained even more strongly for ratings of 
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Fic. 1. Each dependent variable plotted 


introversion and personality similarity—dissimilarity to the stimulus strangers. 


interesting al party, ideal personality, and pre- 
ferred leader. On all four measures, the extra- 
verts’ ratings of the similar extravert was 
significantly higher at the .01 level by the 
Newman-Keuls test than the rating of the 
dissimilar introvert. However, introverts rated 
the similar introvert significantly less favorably 
(01 level) than the dissimilar extravert on 
interesting at party, ideal personality, and 
preferred leader. The reversal for liking did 
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as a function of the subjects' extraversion 


not reach significance. These interactions wer 
all highly significant, as shown in Table : 
A very different pattern of results was 0 
tained for ratings of reliable friend and ho 
and ethical, Both extraverts and introverts p! 
ferred similarity to dissimilarity on these t 
items, but the introverts preference Wi 
stronger than the preference of the extravere 
This differential preference is reflected in th 
small interaction effects shown in the last twé 
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1. In fact, for the extravert 
n ratings of the similar ex- 
differ significantly from the 
f the dissimilar introvert on 
However, for introvert sub- 
ratings of the similar introvert 
s were significantly greater at 
han the mean ratings of the 
ert, 


columns of 
subjects, t! 
travert did 
mean rati 
either que 
jects, the 
on both qì 
the .01 lev 
dissimilar « 


DISCUSSION 


The int ion hypothesis was strongly 
confirmed or: four of the six dependent mea- 
sures. The sent results clearly indicate a 
condition u - which the similarity-attrac- 
tion relation does not hold. On measures of 
liking, int ing at party, ideal personality, 
and prefe: leader, introverts preferred 
dissimilar extraverts to similar introverts. 
Byrne, Clore, and Worchel’s (1966) generaliza- 
tion that “in any type of social comparison 
with other human beings, , .similarity is pre- 
ferred to dissimilarity [p. 223]? must be 
qualified to exclude similarity on the trait of 
introversion for the measures noted above. 

The interesting question remains as to why a 
positive relation was obtained for introverts on 


ratings of reliable friend and honest and 
ethical. Indeed, the positive relation obtained 
for these two measures was considerably 
stronger for introverts than for extraverts. The 
results for these two measures tend to rule out 
à general social desirability bias favoring extra- 
version as an explanation for the results of the 
study. If introverts aspired to be like extraverts 
generally, it seems unlikely that the introverts 
would have preferred similarity on reliable 
friend and honest and ethical. 

However, it may well be that both introverts 
and extraverts were responding to two different 
Patterns of social reinforcement that they 
Perceived the two stimulus strangers could 
Mediate. One pattern involves the perception 
of pleasant relations in an ongoing interaction 
situation, and the other pattern involves the 
Perception of certainty and stability in a long- 
term relationship. There seems to be à general 
cultural stereotype of the extravert as the life- 
Of-the-party, charming-personality, dynamic- 
leader type of individual who is the center of 
attention in interaction situations. Both intro- 
Vert and extravert subjects may have been 
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responding to this stereotype in their ratings of 
liking, interesting at party, ideal personality, 
and prefer as a leader. Thus, the extravert 
stranger was rated more favorably on these 
measures, resulting in a similarity preference 
by extravert subjects and a dissimilarity prefer- 
ence by introverts. 

There also seems to be a stereotype of the 
introvert as consistently stable and con- 
scientious in the long run, someone who can 
be counted on when one is in trouble and in 
need of a “real” friend. The measure of reliable 
friend and honest and ethical may have tapped 
this stereotype of long-term stability. In this 
case, introvert subjects clearly preferred a 
similar introvert. Extravert subjects appeared 
ambivalent. Their ratings on these two mea- 
sures did not differ for the two stimulus 
strangers, indicating a possible conflict be- 
tween similarity preference and preference for 
the perceived reinforcing properties of the 
stable introvert. 

This analysis is consistent with Byrne’s 
(1969) reinforcement theory of attraction. 
People are attracted to other people who can 
mediate rewards for them. For some kinds of 
behaviors, the personality of the extravert is 
perceived as more rewarding, and for other 
behaviors the introvert personality is more 
rewarding. The ratings of both introvert and 
extravert subjects indicated that they differ- 
entiated the two sources of reward in the two 
personality types and preferred them ac- 
cordingly. At the same time, the results of the 
present study contradict any empirical “law 
of attraction" that considers attraction as a 
positive function of similarity. In the present 
study, attraction Was sometimes positively 
related and sometimes negatively related to 
perceived. similarity. The positive relations 
obtained in previous studies may be due to à 
general egocentrism of individuals. For most 
of us, most of the time, our own behaviors and 
attributes are the best, most proper, and most 
rewarding. In our own perceptions, we are 
indeed the greatest. It follows that to the extent 
that others are like us, we are attracted to 
them. To the extent that our egocentrism is 
not completely general, exceptions to the 
similarity-attraction relation should occur. 
'The present study demonstrates one such set 
of limiting conditions. 


, 


36 CLYDE HENDRICK 4 


REFERENCES 


Brows, S. R., & Hexpericx, C. Introversion, extraver- 
sion, and social perception. British Journal of Social 
and Clinical Psychology, in press. 

Byexe, D. Attitudes and attraction. In L. Berkowitz 
(Ed.), Advances in experimental social psychology. Vol. 
4. New York: Academic Press, 1969. 

Byrxe, D. The attraction paradigm. New York: 
Academic Press, in press. 

Byane, D., & Crore, G. L., Js. Effectance arousal and 
attraction. Journal of Personality and Social Psychol- 
ogy Monograph, 1967, 6(4, Whole No. 638). 

Byexe, D., Crore, G. L., Jr., & WoxcHEL, P. The 
effect of economic similarity-dissimilarity on inter- 


EAS r 


re 


M ur AL 


STEVEN R. Brown 


personal attraction. Journal of Personality and 
Social Psychology, 1966, 4, 220-224. 

Byrne, D., & Netson, D. Attraction as a linear fune- 
tion of proportion of positive reinforcements. Journal 
of Personality and Social Psychology, 1965, 1, 659-663. 

Eysenck, H. J. Maudsley Personality In:ciitory. San 
Diego, Calif.: Educational and Industria! Testing 
Service, 1962. 

Henprick, C., & Pace, H. A. Self-esteem, attitude 
similarity, and attraction. Journal of Personality, 
1970, 38, 588-601. 

Star, K. H. Ideal-self response set and Maudsley 
Personality Inventory scores. Psychologica! Reports, 
1962, 11, 708. 


(Received May 6, 1970) 


| 


Journal of Persone? ind Social Psychology 
1971, Vol. 20, ^ 43 


ICED AFFECTIVE STATES AND INTERPERSONAL 
ATTRACTION * 


INI 


CHARLES GOUAUX * 


University of Missouri at St. Louis 


he Byrne and Clore classically conditioned model of evaluative responses, 
affective responses mediate evaluative or interpersonal attraction 
onses. It was hypothesized that as the internal affective state of sub- 

ts is made more positive or more negative, the attraction responses toward 
ranger would vary accordingly. The effect of induced affect on inter- 
ynal attraction was investigated in an experiment having a 3 X 2 factorial 
izn, with 3 levels of attitude similarity and 2 induced affective states, elation 

nd depression. As hypothesized, in an analysis of variance, both main effects 
. found to have a significant (p< .001) influence on attraction scores. 
With attraction as the dependent variable and attitudinal similarity and 
epression scores as independent variables, a multiple correlation coefficient of 


O was obtained (p < .0001). 


Byrne ai 
reinforceme: 
in terms oí 
posed that 


Clore (1970) presented “a 
t model of evaluative responses” 
classical conditioning. They pro- 
the effect of, for example, atti- 
tudes upon interpersonal attraction responses 
is a special case of the relationship be- 
tween reinforcing stimuli and evaluative re- 
sponses (ic., evaluative responses such as 
choices, preferences, verbal assessments, and 
approach-avoidance behavior). 

In terms of their formal model, any stimu- 
lus with reinforcement properties can function 
as an unconditioned stimulus (UCS) for an 
implicit affective response. This implicit or 
covert response has an affective quality that 
varies along the pleasant-unpleasant con- 
tinuum. Any discriminable stimulus (e.g. an 
object or person) consistently associated 
With reinforcement becomes a conditioned 
stimulus (CS) evoking the implicit affective 
response. "This implicit or covert affective 
response mediates the overt evaluative re- 
Sponse. Thus, according to this model, atti- 
tudes and other reinforcements function as 
fhe UCS and produce pleasant or unpleasant 
implicit affective responses. A stimulus person 

l'TThe results of this research were partially pre- 
md at the meeting of the Southwestern Psycho- 

gical Association, St. Louis, April 1970. This re- 
dad was supported by National Institute of Mental 

um predoctoral fellowship MH 45154-01. 

t Requests for reprints should be sent to the au- 
or, Department of Psychology, University of Mis- 


souri at St. Louis, 8001 Natural Bridge Road, St. 
ouis, Missouri 63121. 
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associated with the UCS functions as the CS, 
and the implicit affective response that is 
elicited mediates overt evaluative responses 
(eg., interpersonal attraction responses) 
toward that person. 

The Byrne-Clore model of evaluative re- 
sponses implies that when the implicit affec- 
tive response is positive, the evaluative or 
attraction response will be more positive than 
when the implicit affective response is nega- 
tive, or less positive. It is proposed that as 
this implicit response is experimentally ma- 
nipulated, that is, made more positive or 
negative, the attraction response will vary 
accordingly. In short, the more positive the 
affective state of the subject, the more posi- 
tive will be his attraction toward a stimulus 
person (the CS) associated with the affective 
induction. Mood and affect research tend to 
support this proposal. 

Wessman and Ricks (1966) collected daily 
self-ratings on the moods of Radcliffe and 
Harvard undergraduates. The Radcliffe stu- 
dents participated in the experiment for 6 
weeks, while the Harvard students were part 
of a 3-year personality and assessment 
project. They found, as hypothesized, that 
the self-concept is significantly more favor- 
able in elated moods than in depressed moods 
(p < .025), and that their subjects felt more 
interest in the social environment, more con- 
cern with peer interaction, in elation than in 
depression (p < .025). Thus, in terms of 
Q-sort data, self-descriptions given in depres- 
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sion indicated significant social withdrawal, 
isolation, and loss of self-esteem. On the 
other hand, subjects experiencing elation gave 
seif-descriptions indicating a sense of glad- 
ness and joy, high self-confidence, and an 
eagerness to communicate and participate 
with others. 

Wehmer and Izard (1962) manipulated the 
affect an experimenter expressed toward sub- 
jects—either positive or negative affect. They 
found that the subjects’ ratings of the experi- 
menter and the experiment and the subjects’ 
performances on intellectual tasks were sig- 
nificantly more positive for those subjects who 
had received positive affect from the experi- 
menter. Moreover, more positive ratings (or 
negative ratings for the negative affect treat- 
ment) were given to the experimenter the 
second time he was rated, later in the experi- 
ment. Thus, the level of the subjects’ 
(affective) ratings of the experimenter was a 
function of the duration of exposure to the 
experimenter’s affect. 

In a similar experiment, Izard (1964) 
assigned subjects to one of four conditions 
in which an actress portrayed one of the 
following four roles: a highly enthusiastic 
person; a warm, friendly person; an angry, 
hostile person; and a fearful, anxious person. 
The subjects rated their present feelings be- 
fore and after being with the actress and 
performing several intellectual tasks. As hy- 
pothesized, subjects exposed to the two posi- 
tive emotional conditions performed signifi- 
cantly better (5 < .07) on the tasks than the 
subjects exposed to the two negative emo- 
tional conditions, and they gave more positive 
affective self-ratings (p< .001) and more 
positive ratings of the actress (5 < .001). 
Thus, Izard (1964) concluded that “positive 
affect facilitates constructive behavior or ef- 
fective functioning while negative affect in- 
hibits or decreases constructive behavior |p. 
446]." Whether affect was expressed toward 
the subject or merely in the presence of the 
subject, the subject’s performance and affec- 
tive self-ratings were significantly influenced. 

Griffitt (1970) manipulated the proportion 
of attitudinal similarity between the subject 
and stranger (.25 and .75) and the effective 
temperature of the environment of the sub- 
jects—90.6 degrees (hot) and 67.5 degrees 
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(normal). As hypothesized, both variables 
significantly influenced attraction scores: the 
temperature-humidity variable (p < ./4) and 
the attitude similarity-dissimilarity variable 
(p < .001). Attraction responses were found 


to be more negative under the "ho: condi- 
tion than under the “normal” condition, and 
attraction to the stranger was positively re- 
lated to the positiveness of subjective feelings. 
Griffitt (1970) commented: “It appe..rs then 
that when one is feeling good his response to 
others will be more positive than | he is 
feeling bad [p. 243].” 

The present investigation was designed to 
test the hypothesis that attraction toward a 
stranger is a positive function of attitudinal 


similarity and of the affective state of 
subject. Thus, subjects in whom posi 
affective states are experimentally : 
give more positive attraction po! 


toward a stranger than subjects in "who 
negative affective states are induced. | 


METHOD 
Subjects 


Early in the semester, a 56-item survey 
tudes was administered to 110 female 
rolled in introductory psychology at Purdue Univer- 
sity. Females were used because they have been 
found to perform consistently better than males in 
studies involving the induction of moods (Velten, 
1967, 1968; Weitzenhoffer, 1953). That is, females 
are relatively more suggestible than males, and fe- 
males generally feel less reluctant to express induced 
affect in an experimental situation. Several months 
after the attitude surveys were administered, the 
subjects were seen in small groups for the expel 
mental session, 


Procedure 


Subjects were randomly assigned to one of 85 
experimental conditions. A 3 X2 factorial analysts 
of variance design was used. There were 3 levels 0 
attitudinal similarity between the subjects and à" 
anonymous stranger (proportion of similar attitudes 
=.15, 50, and 85) and 2 induced moods (elation 
and depression) . 

In the experimental session, small groups of sub- 
jects from each of the three conditions of attitudinal 
similarity were shown either the elation film G00 
Old Corn (Warner Brothers) or the depression film 
John F. Kennedy 1917-1963 (20th Century Fo®) 
Averill (1969) used these films successfully |™ 
inducing mirth and sadness, respectively, according 
to both psychological and physiological measures 
The copies used in this experiment were edited ™ 
accordance with Averill’s personal instructions © 


eliminate certain scenes that did not contribute 
to the desi tional effect. 

After vic their respective film, the subjects 
were given a 20-item survey of attitudes, purportedly 
filled out ionymous stranger of the same 
sex as the The stranger's attitude survey 
contained 2 to which the subjects had re- 
sponded a cginning of the semester. These 
were select their high level of topic impor- 
tance (Byrr ; press). The “stranger’s” survey of 


uly prepared by the experimenter 
e attitude conditions of .15, .50, or 
similar attitudes between the 
The subjects were instructed 


attitudes 

to conf 

85 prop: f 

Stranger ar ect 

to read the ers survey and then to evaluate 

the person or e Interpersonal Judgment Scale 

(IJS; Byrne & Nelson, 1965). The IJS is a six-item, 
| 7-point rati scale on which the stranger can be 

evaluated in ‘terms of intelligence, knowledge of 

current events, morality, adjustment, how much the 

"subject would like the stranger, and how much the 

subject would enjoy working with the stranger. The 
| 


last two items are summed to yield, and opera- 
the dependent measure of interper- 
ranging from 2 to 14, with a 
of 85. 

cvaluation of the stranger, each 
d the Today Form of the Multiple 
Check List (MAACL; Zuckerman 
The subjects checked a long list 
ccording to their present subjective 
feelings. The ACL was used to provide indepen- 
dent evidence as to whether the elation and depres- 
sion treatments produced the designated moods in 
he subjects also indicated the nature 
of their subjective feelings on six 7-point semantic 
differential rating scales. The six polar items 
used were sad-happy, pleasant-unpleasant, negative- 
and comfortable- 


tionally defin 
sonal attractio 
split-half reli 
Following 
Subject com 


& Lubin, i: 
of adjectiv 


the subjects 


Positive, bad-good, 
uncomfortable. 

The subjects in the debriefing at the end of the 
exneriment were questioned about how they felt. 
Most of the subjects said that they felt the induced 
mood slightly. After the purpose of the exneriment 
Was explained, the subjects were again asked to talk 
about their subjective feelings in order to insure 
that all negative feelings had dissipated. 


high-low, 


RESULTS 


To determine the effects of the affect 
manipulation, the MAACL Depression scores 
Were compared for the two mood treatment 
conditions, Whereas the mean of the Depres- 
Sion scores for the elation treatment was 
10.4, the Depression score mean for the 
depression treatment was 22.2. This differ- 


ence was highly significant (F= 125.79, 
df = 1/104, p < .0001); thus, the affect ma- 
hipulation was successful in inducing the re- 


Spective moods of elation and depression, or 
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PROPORTION OF SIMILAR ATTITUDES 
Fic. 1. Attraction toward a stranger as a linear 


function of the proportion of similar attitudes and 
the affect treatment condition. 


general positive and negative affective states. 

Analysis of variance was employed on the 
interpersonal attraction scores. Both main 
effects in the 3 X 2 factorial design were 
significant: attitude similarity (F = 54.88, 
df =2/104, p < .0001) and induced affect 
(F=31.16, df=1/104, p<.0001). The 
interaction was not significant. Attraction was 
plotted as a function of proportion of similar 
attitudes for the six experimental conditions 
and is shown in Figure 1. 

Because some of the subjects in each affect 
condition indicated a mood score marginal to 
the intended affect conditions, the attraction 
scores were arranged into three groups accord- 
ing to the following range of MAACL Depres- 
sion scores (Zuckerman & Lubin, 1965): 
elation (0-11), neutral (12-20), and depres- 
sion (21 and below). Analysis of variance 
again indicated two significant main effects in 
this 3 X 3 factorial design—attitude similar- 
ity (F — 68.52, df = 2/101, p < .0001) and 
induced affect (F = 29.16, dj = 2/101, p< 
0001). The interaction again was not sig- 
nificant. Figure 2 shows attraction plotted as 
a function of the proportion of similar atti- 
tudes for each of the three affect groups. The 
relevance of this analysis and Figure 2 will 
become obvious in the Discussion. 

The MAACL can be scored for anxiety and 
hostility, as well as depression. The affect 
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PROPORTION OF SIMILAR ATTITUDES. 


Fic. 2, Attraction toward a stranger as a linear 
function of the proportion of similar attitudes and 
the level of affect (elation-depression). 


treatments also produced significantly differ- 
ent Anxiety and Hostility scores. For the 
“elation” film treatment, the means for the 
Anxiety and Hostility scores were 5.74 and 
6.52, respectively; for the “depression” film 
condition, the anxiety mean was 10.95, and 
the hostility mean was 11.6. The differences 
between these means were highly significant: 
anxiety (F = 71.03, df = 1/104, p < .0001) 
and hostility (F = 44.98, df = 1/104, p < 
0001). The subjects’ Anxiety scores within 
each level of attitude similarity were divided 
into three groups: high anxiety (11-16), 
medium (7-10), and low (1-6). Analysis of 
variance of the attraction scores of the sub- 
jects divided into these groups indicated two 
significant main effects. The anxiety main 
effect was highly significant (F = 15.38, df = 
2/101, p < .001), as was the similarity vari- 
able (F= 59.60, df=2/101, p< 0001). 
When the Hostility scores were similarly 
divided into three groups within each level 
of attitude similarity, it was found that 
they also significantly influence attraction 
(F = 10.64, dj = 2/101, p < .001). 

A correlational analysis was employed as 
another approach to determining the effect of 
induced affect on interpersonal attraction. For 
the total group of subjects, attraction cor- 
related —.41 with Depression scores (p< 
.001) and .33 with the total score from the 


six semantic differential rating scales (p< 


001). The correlations between at‘raction 
and anxiety (r — —.31) and between attrac- 
tion and hostility (r — —.40) w both 
highly significant (p< .001). The correla- 
tion between the Attraction and stility 
scores with the influence of depression elimi- 
nated yields a partial correlation ficient 
of —21 (p< .05). The partial correlation 


coefficient for the attraction and 
variables, with depression held con 
very low (.06). The correlation bet 
traction and Depression scores with t 
ence of anxiety and hostility elimi 
—.27 (p< 005). It appears, theref 
in terms of the methodological ope: 
this experiment and the particular v riables 
studied, depression is the most powerful vari- 
able, anxiety is the least powerful variable, 
and hostility is of intermediate importance 
in affecting attraction scores. 

A multiple correlation coefficient was com- 
puted, with attraction as the depender varia- 
ble and level of attitudinal similarity and the 
Depression scores as independent variables. 
A value of .80 was obtained and found to be 
highly significant (F = 94.44, df = 2/107, p 
< .0001). The correlation between attraction 
and level of attitudinal similarity was high 
(r= .68, p < .0001), but the difference be- 
tween this value and the multiple-function 
value was significant (F = 58.50, df = 1/107, 
p < 001). The Hostility scores were added as 
a third independent variable to a multiple 
function, but this added variable did not sig- 
nificantly increase the multiple function. It is 
interesting to note that the induced affective 
states apparently influenced the subjects’ re- 
sponses only to the last two items of the IJS, 
liking for the stranger (r = —.30, p < .005) 
and willingness to work with the stranger (r 
= —.41, p < .001). The correlations between 
the Depression scores and the other IJS items 
were very low and not significant. 
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Discussion 


As hypothesized, and in terms of the opera- 
tions and variables of this study, interpersona 
attraction is a positive function of the sub- 
ject’s affective state. These results are thor- 
oughly consistent with results of recent unpub- 
lished research, performed before the present 
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ich moods were also manipu- 
Red Balloon (Montsouris 
selected as the elation-in- 

s, and Marketplace U.S.A. 
h, Pierce, Fenner, & Beane) 
"was chose he control film. The same gen- 
“eral experi | procedure was used, except 
‘that the ; ion of attitude similarity be- 
tween the ject and the stranger was .15. 
Unfortunately, the elation-inducing film low- 
ered the : of some of the subjects. Thus, 
the three : ! conditions of elation, neutral, 
and depre 1 could be compared. The dif- 
ference between the attraction means for the 
three moo! groups was significant (F = 
93.064, dj = 2/58, p < .05). The Attraction 
scores correlated —.31 with the Depression 
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‘ducing s! 
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Scores (/ )1). 

— Byrne and Nelson (1965, p. 662) postu- 
Plated that attraction toward a person is a 
"positive linear function of the proportion of 
" positive reinforcements received from the per- 
son. This tionship between attraction and 
the proportion of positive reinforcements was 
Ptentatively given as 


PR, 
Pr (cone) + 


Where PR, is the number of positive rein- 
forcements and NR, is the number of nega- 
tive reinforcements. Byrne and Rhamey 
(1965) extended this law of attraction by 
including a weighting factor in the equation 
to account for the effect of different reinforce- 
‘Ment magnitudes. The attraction law was “re- 
Written as 


n" 
ES x (PR, X M) ) 
Gen X M) + (NR: X M) Ts 


- OF attraction toward X is a positive linear 
function of the sum of the weighted positive 
reinforcements [p. 887]" (Number X Rein- 
forcement Magnitude or Weight) "received 
from X divided by the total number of 
Weighted positive and negative reinforcements 
Teceived from X [p. 887] Byrne and 
Rhamey found that when personal evaluations 
_ Were assigned a weight of 3 and attitude items 
- 8 weight of 1, their data could be plotted as 
- & positive linear function. In the present 
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Fic. 3, Attraction toward a stranger as a linear 
function of the proportion of positive reinforcements 
based on weighted combination of attitude items 
and mood conditions. 


study, using the least squares method, dif- 
ferent weights were assigned to the affect con- 
ditions in an attempt to show attraction as a 
positive linear function of the proportion of 
positive reinforcements. By weighting the af- 
fect conditions on a 9:1 basis compared with 
the attitude items—+9 for elation (or posi- 
tive affect) and —9 for depression (or nega- 
tive affect)—a linear function was obtained 
(Figure 3). The linear function y —8.55X 
+. 441 does not significantly differ from the 
Byrne and Rhamey formula. In fact, the sepa- 
rate linear functions obtained for the condi- 
tions of positive and negative evaluations in 
the Byrne and Rhamey study closely resemble 
the linear functions representing elation and 
depression shown in Figure 2: 

There are some interesting comparisons 
between Figures 1 and 2 of this study and 
Grifütt's (1970) results. The straight-line 
functions for Griffit's normal and hot treat- 
ments for the proportion of attitudinal simi- 
larity versus attraction fall very closely, that 
is, have similar slopes and y intercepts, to the 
functions for the elation and depression treat- 
ments, respectively, in Figure 1. This is sur- 
prising since one would expect the elation- 
treatment function to be above the normal 
treatment, that is, for the former to have a 
higher y intercept than the latter. However, 
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these results are quite consistent. Grifütt used 
the same six 7-point semantic differential rat- 
ing scales as those used in this study. The 
semantic differential mean for Griffitt’s normal 
treatment group was 32.95, whereas the mean 
ior the elation treatment of this study was 
29.95. Thus, that the functions are very simi- 
lar is quite appropriate since they have simi- 
lar affect means. The semantic differential 
mean for the depression treatment was 21.42, 
while the hot treatment group's mean was 
26.30. Since the former group's straight-line 
function lies below the latter's function, the 
comparisons are consistent with their corre- 
sponding affect means, Moreover, Griffitt’s 
normal treatment's linear function falls below 
the elation group in Figure 2, whose semantic 
differential mean was 36.15; the neutral 
group's function (27.94) falls between Grif- 
fits normal and hot treatments’ functions; 
and the depression group (19.90) falls below 
Griffitt’s hot treatment. These results from 
the present study and Griffitt’s study support 
the proposition that the height of the y inter- 
cept is a positive function of the affective state 
of the subjects. 

Since the elation and depression treatment 
groups of Figure 1 have similar affect means 
and similar linear functions to Griffitt’s nor- 
mal and hot treatment groups, respectively, 
these groups should also produce similar linear 
relationships between the proportion of posi- 
tive reinforcements and attraction. As noted 
above, the linear relation Y = 8.55X + 4.41 
was obtained when attraction was plotted as 
a function of the proportion of positive rein- 
forcements for the affect treatment groups. 
When the affect conditions in Griffitt’s study 
were also assigned weights of +9 for the 
normal condition and —9 for the hot condi- 
tion, the linear function Y = 8.32X + 4.48 
was obtained. This second formula represents 
a cross-validation of these weights derived 
empirically in the present study. Thus, it 
seems that subjects under comparable affective 
conditions give attraction responses toward a 
stranger which vary with the proportion of 
similar attitudes between the subject and 
the stranger. 

The consistency with which these experi- 
mentally induced affective states influence at- 
traction suggests that any strong positive or 
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negative affect will significantly inflvence in. 
terpersonal attraction responses. These find- 


ings support the assumption that positive 
affect is associated with more posit inter- 
personal perceptions and evaluations 2nd that 
negative affect is associated with n nega- 
tive interpersonal perceptions and eva uations. 
These data can be interpreted in a reinforce- 
ment framework; that is, positive- | nega- 
tive-induced affective states function ^s posi- 
tive and negative reinforcements, rc tively, 
which are associated in time with ano! er per- 
son (Griffitt, 1968). As Griffitt (1970) ar- 
gued: “attraction is not simply a function of 
the qualities directly attributable ‘o the 


nction 
litions 
n [p. 


stranger being evaluated, but also a : 
of additional positive or negative c 
of the [internal and external] situ 
240]." 

It is also possible to interpret the results in 
terms of drive theory (e.g, Gewirtz & Baer, 
1958). That is, these affective states represent 
different levels of a drive (perhaps social) for 
which the same reinforcing stimuli have dif- 
ferent reinforcement values. It is possible that 
affective states are differential levels of a 
generalized state of activation or arousal. On 
the other hand, these affective states may in- 
volve a specific motivational state. It is possi- 
ble that persons experiencing positive affective 
states are more sensitive to social reinforcers 
than nonsocial reinforcers, and that nonsocial 
reinforcers may have greater effeciiveness 
than social reinforcers for persons in negative 
affective states. The experimental results from 
the studies by Wehmer and Izard (1962); 
Exline and Winters (1965), and Wessman and 
Ricks (1966) indicate increased social ap- 
proach behavior associated with elation and 
social avoidance behavior associated with de- 
pression. Further research is required t0 
further clarify the nature of affective states 
and their influence on interpersonal attraction. 
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SOME EFFECTS OF FATE CONTROL UPON THE TENDENCY 
TO BENEFIT AN EXPLOITATIVE OTHER ! 


ROBERT S. WYER, Jr? anp STEVEN J. POLEN? 


University of Illinois, Chicago Circle 


Each subject (P) interacted with a simulated other (O) in a two-person game. The 
amount of fate control each party had over himself and the other was systematically 
varied over experimental conditions. Over the course of 30 trials, O typically chose 
the response that would maximize his own gain at cost to P. P was relatively less 
likely to benefit O at cost to himself when O had substantial control over his own 
outcomes. On the other hand, P's attitude toward O was more favorable under these 
conditions. Neither the amount of fate control that P had over O, the amount of 
fate control that O had over P, nor the amount of control that P had over his own 
outcomes greatly affected P's overall response probability. Although the theoretical 


amount of behavior control by P and O 


outcome matrices, the actual contingency of P's behavior upon O's behavior varied 
with the amount of fate control of the participants. There was little evidence that 
P's willingness to benefit O during the course of the interaction was mediated by 


his attitude toward O. 


Previous research on the dynamics of two- 
person interactions has attempted to clarify 
the conditions under which a person will 
benefit another, despite the risk of being ex- 
ploited (cf. Bixenstine & Wilson, 1963; Lave, 
1965; Terhune, 1968). This research has been 
devoted almost exclusively to the study of a 
particular type of interaction situation, com- 
monly called the **Prisoner's Dilemma game" 
(cf. Rapoport & Chammah, 1965). The most 
useful parameters for describing such an inter- 
action and for predicting behavior in the 
situation have not been identified and sys- 
tematically varied ; perhaps for this reason, and 
also because of a frequent failure to tie experi- 
mentation to social psychological theory and 
concepts, progress in attaining the above- 
mentioned objective has been somewhat 
limited. 

As noted by Wyer (1969), controlled two- 
person interactions may be described in terms 
of parameters suggested by Thibaut and 
Kelley's (1959) definitions of fale control and 
behavior control. Consider a situation in which 
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over one another was held constant over 


two persons, P and O, each have two respons 
(1 and 2) available to them, and in which the 
utility of their outcomes in the interaction 
depends upon the particular combination of 
responses they make. Such an interaction may 
be represented by the following diagram : 


O’s responses 


Or Os 
Py P1,01 P2,03 

P’s responses 
P» ps0» p4,04 


In this diagram, pı and o; represent the 
utilities of the outcomes to P and O, respec 
tively, if both choose 1; pə and 0; refer to the 
utility of the outcomes they receive if 
chooses 1 and O chooses 2, and so on. 

One person is considered to have fate control; 
over another if his behavior affects the othe 
outcomes. In the above diagram, the magn 
tude of P’s fate control over O may be indicat 
by the difference between the average utility 
of the outcomes available to O if P chooses 1 
and the average utility of the outcomes avail 
able to him if P chooses 2; in other words, bY 
the quantity (o1 + 0; — o; — o,)/2. The 
amount of fate control that P has over hims 
may be measured by the difference bet 
the average outcome available to P by choosing 
1 and the average outcome available to him b; 
choosing 2, that is, (pi-- p» — ps — pa)/ 
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Estimat« ©’s fate control over P and also 
over himse! 1 be obtained similarly. 

Accordi l'hibaut and Kelley, one person 
has beha ntrol over a second if the re- 
sponse the ind would make to maximize 
his own « nes depends upon the behavior 
of the fir e amount of this control might 
therefore be estimated by the extent to which 
the magnitude of the second’s control over his 
own outcomes depends upon the response of 
the first. Thus, in the above example, P’s 
behavior control over O could be represented 
by the qi 
[(o: i, C); fos 04) ]/s 

= (01 + 04 — 02 — 03)/2.4 

The typical Prisoner’s Dilemma matrix (see 
Rapoport & Chammah, 1965) may be inter- 
preted as one in which (a) each person has more 
fate control over the othersthan he has over 
himself, and (b) the response each would make 
to increase the magnitude of his own outcomes 


would decrease the outcomes available to the 
other. In such a situation, it is in O's interest if 
P can be induced. to benefit him, while at the 
same time O chooses the response that will 
reduce P’s outeomes greatly for some addi- 
tional personal gain. The present study ex- 
plored the effects of different types of fate 
control upon the likelihood of P's benefiting 
O under these circumstances. 

It seemed reasonable to expect that P would 
be more willing to benefit O if P did not 
greatly decrease the magnitude of his own 
outcomes by so doing, and if O did not harm 
P too severely by exploiting him to O's own 
advantage. To this extent, the tendency for P 
to act in O's behalf should be related negatively 
both to P's fate control over himself and to O's 
fate control over P. 

The effects of the other two fate-control 
variables upon P's behavior are less predict- 
able. An increase in O's control over his own 
Outcomes could increase the perceived legiti- 
macy of his exploitation of P, but could also 
decrease P’s feelings of responsibility to him. 
These factors could have opposing effects upon 
P’s tendency to benefit O. An increase in P's 
fate control over O might increase his attempt 


‘Division by two in this case is only for convenience 
to make the expression comparable to those used as 
indexes of fate control. 


to coerce O into acting in his behalf by “punish- 
ing” him, and thus might decrease his tendency 
to benefit O; this phenomenon is referred to by 
Thibaut and Kelley (1959) as the conversion 
of fate control into behavior control. On the 
other hand, an increase in P's fate control 
over O could increase his sense of responsibility 
to O in this situation and thus increase his 
tendency to benefit him. Which, if either, of 
these effects would predominate was unclear. 
A secondary but important purpose of this 
research was to investigate the effects of O's 
behavior in the interaction upon P's attitude 
toward him. 'There are several reasons to 
expect that the variables that influence P to 
act against O's best interests would be related. 
negatively to P’s attitude toward O. On one 
hand, dislike of O because of his exploitative 
behavior may mediate actions against es 
interests. On the other hand, a decr in 
liking for O may be the consequence ofa 
decision to behave in a way harmful to him 
(Festinger, 1957; Glass, 1964; Secord & 
Backman, 1961). In either event, information 
about the effect of independent variables upon 
attitudes toward O was expected to help clarify 
the reasons for differences in P's behavior 
toward O in the situations being studied. 


METHOD 


Introductory psychology students participated in the 
study in pairs. Although the subjects were placed in 
separate rooms, they were given the impression that 
they would interact with one another by typing mes- 
sages and responses into a teletypewriter. In fact, how- 
ever, each subject (P) interacted with a computer (O) 
which was programmed to respond in a manner to be 
described. 

P and O interacted for 30 trials. On each trial, P and 
O each had two responses available. As a consequence of 
cach of the four possible combinations of responses by 
P and O, each person received a certain number of 
points. Subjects were told that these points would be 
converted into money to be paid them at the end of 
the experiment. 

Over the series of 30 trials, the simulated other 
typically responded in a way that increased the number 
of points he personally received at P's expense. In part 
to increase the salience of O's desire that P act in his 
behalf, and in part to determine what effect verbal 
communications from O would have upon P's behavior 
under conditions in which O typically exploited him, the 
subjects in the present study were ostensibly given the 
opportunity to write messages to one another at three 
predetermined points during the interaction. At these 
points, P received fictitious messages from O that 
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either were positive in affective tone (indicating a 
willingness to act in P's behalf if P would reciprocate), 
negative in tone (threatening P with exploitation 
unless he acted in O's best interests), or contained no 
information about O's wishes, Each subject received 
communications of only one type over the course of 
the interaction. 

Nine different outcome configurations were con- 
structed which were designed to represent different 
combinations of fate control by each participant over 
himself and the other. Fate control was manipulated by 
varying systematically the points available to partici- 
pants as a result of different combinations of behavior. 
To determine the effectiveness of this manipulation, the 
utility attached by each subject to the outcomes avail- 
able to him was estimated both before and after the 
interaction. After this interaction, each subject's atti- 
tude toward the other was also measured. 

A total of 135 subjects was run, with 5 under each 
combination of three levels of control over P, three 
levels of control over O, and three levels of com- 
munication type. 


Construction of Outcome Matrices 


To manipulate the amount of fate control that P and 
O ostensibly had over themselves and one another, a 
basic outcome matrix similar in form to that typically 
described as a Prisoner's Dilemma game was selected. 
This matrix (the center matrix in Table 1) was such 
that the response which would maximize the points 
available to oneself (in each case, Response 2) would 
decrease the points available to the other. Each person's 
fate control over himself and the other is theoretically 
defined in terms of the utility of the points received by 
both participants as a result of different combinations of 
responses in the interaction. However, since utility data 
were unavailable before running the experiment, it was 
assumed that the approximate magnitude of this control 
could be estimated by considering only the points 
received by the person being controlled. Thus, in the 
matrix in question, the fate control of each person over 
the other, measured in terms of the points received by 
the other using the expressions described previously, is 
equal to [5+3 — (—2) — (—1)]/2, or 5.5, while the 
fate control of each person over himself, defined in terms 
of the points available to him as a consequence of his 
own choices, is equal to [3 + (—2) — 5 — (-1)]/2, or 
— L5. The behavior control of each participant over the 
other is equal to [3 4- (—1) — (—2) — 5/2, or —.5. 
The fate control of each party over himself and the 
other was systematically manipulated in relation to this 
“base-line” matrix by varying the points available to 
the recipients of this control, while the behavior control 
of each party over the other was held constant. To 
increase P's fate control over himself, 10 points were 
added to those available to him by choosing Response 
2; to increase P's fate control over O, 10 points were 
added to those available to O when P chose Response 1, 
"The fate control by O over himself and P was manip- 
ulated similarly. Table 1 describes the nine outcome 
matrices that were assumed to represent three levels of 
control over P and three levels of control over O. 
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Measurement of Utilities 


The essential features of the procedure í 
utilities have been reported elsewhere (\\ 


'ssessing 
r, 1969, 


1971) and need not be elaborated here. U; ntering 
the experimental situation, each subject ; given 
detailed information about the interaction in which he 
would take part, including the monetary v: f points 
to be received by each person, but not inc! ding the 
particular outcome matrix governing the i action. 
‘The subject was then asked to estimate the rability 
of each of 21 different outcomes, each defi: 1 terms 
of the number of points P would receive (^ id the 
number that the other person would re: (Ro). 
Values of R, and Ro each ranged from —10 o +10.5 
(To give a specific example, when Rp = =- 3, 
the subject was asked to indicate how desirabl: it would 
be if he received 5 points and the other pers ceived 
—3 points.) Estimates were made along 2]-point 
scale from 0 (very undesirable) to 20 (very irable). 
A description of each outcome to be rated was printed 
out on a teletypewriter, and the subjects typed their 


estimates into the teletypewriter. ' 

To obtain a general equation for estimating the 
utility attached by subjects to an outcome (//j), a 
function of the following form was assumed: 


Usi = hyRpi + Ex(1 — 9) Ry; 
+ kðaRoi + ka(l — ôi) Roi + he. C1] 


In this equation, Rp: and Re: are the numbers of points 
received by the subject and the other, respectively, as 
represented in the ith outcome; 4; and /; 
attached to points won and lost by O, respect 
the point of intercept (i.e., the utility of an o: 
which both parties receive no points); ô: 
function which is equal to 1 if Rp; > 0, and 
0 otherwise; and 52 is a step function which is 
1 if Ro; > 0, and is equal to 0 otherwise. 

A standard least squares analysis was used to deter- 
mine values of k-k; which would produce the best 
fitting function relating each subject's estimates of Ui 
to R,; and Ros over the 21 outcomes he rated. 

After participating in the interaction, the subjects 
were again asked to estimate the desirability of a 
different set of 21 outcomes using an identical pro- 
cedure. Changes in the weights attached to points won 
or lost by P and O as a result of the interaction could 
therefore be investigated. 


Procedure and Instructions 


Each subject was seated in a separate room and 
received tape-recorded instructions over a loudspeaker- 


* Since the points available in several of the matrices 
used in the present study were greater than 10, the 
range of values considered in measuring utilities should 
perhaps have been extended. However, since the 
relation between predicted and obtained estimates of 
desirability is fairly linear (Wyer, 1969) within this 
Tange, the error in extrapolating to somewhat larger 
values of R, and Ro on the basis of the data obtained is 
probably not large. 
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TABLE 1 
Description OF OUTCOME MATRICES REPRESENTING EAcH LEVEL OF FATE 
CONTROL OVER O AND FATE CONTROL OVER P 
Fate control over O 
ByO Base line ByP 
[en O: O: O: | 0: | Os r 
| 3,3 —2,15 3,3 =2,5 i 3a " —2,15 
15522 919 isa 9,—1 15,—2 9,-1 
3,3 2,15 33 -2, 3,13 —2,15 
5,—2 —1,9 5,-2 -1,-1 $,-2 —L-1 
13,3 E EL 133 ieu 13345 -2,15 
15,—2 -19 15,—3 =1,=1 15: -1,-1 


equipped with a teletypewriter, into which he 
his responses, a set of envelopes containing forms 
ed for the experiment, and the outcome matrix 
he had heen assigned. 
Subjects were told that the purpose of the experiment 
to determine how persons behave when their be- 
or affects both themselves and others. The inter- 
ion situation was then described in detail. To assist 
the explanation, a standard sample outcome matrix 
S shown, and the nature of the interaction was 
eribed. Each subject was designated “A” and was 
believe that the other was B. The outcomes re- 
ting from each set of responses were described to 
ts in detail to eliminate any possible ambiguity 
ng how to use the matrix to determine the 
ices of their behavior. 
ts were told that they would interact for 
eral trials and that they were to record their re- 
onse on each trial by typing the numbers 1 or 2 into 
teletypewriter. The teletypewriters, they were told, 
re attached to a central control unit (in fact, an 
erdata Model 3 computer) which, after both persons 
played, would inform each of them of the other's 
ese, the number of points each person had re- 
don that trial, and the total number of points each 
“Accumulated up to that time. 
- To provide an incentive for the interaction, the 
dübiects were told that each point they would accumu- 
was worth .2¢ and that it was possible to win or 
ose as much as $1 over the course of the interaction. 
were informed that the amount usually won or 
was about 30¢. 
After the subjects received this information, utility 
à wcre collected in the manner described previously. 
jects were then askcd to remove the outcome matrix 


al, data concerning these expectancies were also 
td. Each subject was asked to predict O's 


response on each trial (1 or 2) before making his own 
choice and to record this prediction on a specially 
prepared answer sheet. 

Subjects were told that they would not have a 
chance to talk to one another while making their 
responses, but that at certain points during the inter- 
action each would be given the opportunity to type a 
message to the other person. They were told that they 
would receive specific instructions for doing this when 
the time came to type their first message. 

Transmission of written communications. Subjects 
were allowed to communicate with one another after 
the ninth, sixteenth, and twenty-third trials. After each 
of these trials, an instruction to type in a message was 
printed out on the subjects teletypewriter; these 
instructions were supplemented by information posted 
on the wall beside the subject. Subjects were told to 
type in any message they wished or, if they wished to 
send no message, to type the words “No message" into 
the teletypewriter. After typing his message, each 
subject received the communication ostensioly sent by 
the other participant. 

All messages from O were fictitious and were of one 
of three types. The fjllowing three m.ssazes were 
assumed to convey positive afje.l, while urging P to 
benefit O. (Typing errors were intentional to increase 
the credibility of the messages.) : 

[After Trial 9] I TRUST YOU. IAM GOING TO 

PLAY 1 UNLESS YOU DONOT COOPERATE. 

[After Trial 16] I STII, I STILL TRUST YOU, I 

WILL PLAY 1 MORE OFTEN. 

[After Trial 23] WE MUST TRUST ONE 

ANOTHER, I WILL CHOSE 1 IF YOU WILL. 
The communications intended to convey negative affect 


were: 


[After Trial 9] I DONOT TRUST YOU, I AM 
GOING TO PLAY 2 UNLESS YOU COOPERATE. 
[After Trial 16] I STII, I SYILL DO NOT TRUST 
YOU. I WILL PLAY 2 MORE OFTEN. 

[After Trial 23] WE CLEARLY DO NOT TRUST 
ONE ANOTHER, I AM GOING TO CHOSE 2 
UNTILL YOU CHOOSE 1 MORE OFTEN. 
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Under the third, no-message condition, each com- 
munication received by subjects was simply NO 
MESSAGE. 

Behavior of the stimulated other, The response sequence 
of the other participant was programmed in such a way 
that O chose 1 on the first trial of the interaction and on 
the first trial after each message had been sent. At all 
other times, whenever P chose 2, O would choose 2 on 
the next trial. Moreover, whenever P chose 1, O would 
also choose 2 on the next trial unless P had chosen 2 on 
the immediately preceding trial. Thus, O was presented 
as generally exploitative, choosing the response most 
advantageous to P if and only if P chose 1 immediately 
after having chosen a 2. While this procedure produced 
some variation in the behavior of O over subjects, this 
behavior was not expected to vary systematically over 
experimental conditions. 

Postexperimental data. After completing 30 trials of 
the interaction described above, utility data were again 
collected. Subjects were then asked to complete a short 
questionnarie which included questions pertaining to 
the extent to which written communications from O 
influenced his responses, the desirability of participating 
in a similar experiment with the same person again, the 
favorableness of his “general attitude" toward the other 
person, the desirability of having the other person as a 
friend, and the expected favorableness of the other's 
attitude toward him. The first of these questions was 
answered along an 11-point scale from 0 (not at all) to 
10 (greatly); the remaining questions were answered 
along an 11-point scale from —5 (very undesirable or 
very unfavorable) to +5 (very desirable or very 
favorable), 

At the end of the experiment, each subject was told 
that he had not actually been interacting with another 
person and that therefore he should not take O's 
behavior or written communications as an indication of 
any person’s actual motives or attitudes. 


RESULTS 
Effectiveness of Manipulations 


Three variables were manipulated in this 
study—the type and degree of control over 
P’s outcomes, the type and degree of control 
over the simulated O's outcomes, and the 
affective quality of the communications from 
O during the course of the interaction. Pre- 
liminary analyses were performed to determine 
the effectiveness of these manipulations, 

Although fate control, if defined in terms of 
the average number of points available to a 
person as a result of either his own or another’s 
behavior, was clearly manipulated (see Table 
1), it is more appropriate to define such control 
in terms of the utility of the outcomes available 
to him, which is presumably a function of both 
the points awarded to him and the points 
awarded to the other. To determine the 


amount of fate control over P, the utility 
function derived from each subject’s preinter- 
action estimates of the desirability of various 
outcomes was used to calculate the utility of 


each of the four outcomes described in the 
matrix to which he was exposed. The mean 
utility of outcomes in each matrix is shown in 
Table 2. Although no statistical analyses were 
performed, it is clear that P’s fate control over 


himself was greater in the first row of matrices 
(M = 4.99) than in the second row (4f = 1.86) 
or the third row (M = 1.95); this difference 
was more pronounced when O's fate control 
over himself was great and under base-line 
conditions than when P had substantial fate 
control over O. Similarly, O's fate control over 
P was greater in the third row of matrices 
(M = 9,85) than in either the first row 
(M = 2.76) or the second row (M = 2.90); 
this difference, however, was least under con- 
ditions in which O had substantial fate control 
over himself. Thus, although fate control over 
P was manipulated successfully, it was some- 
what confounded with fate control over O when 
matrix values were transformed to utilities. 

The behavior control of O over P, defined in 
terms of utilities, ranged from —.41 to — 1.89 
over the nine matrices, and did not appear to 
be systematically related to either the fate 
control over P or the fate control over O. The 
assumption that behavior control was held 
fairly constant over matrices seems justified. 

The effectiveness of the manipulation of 
control over O's outcomes is more difficult to 
evaluate. In light of the apparent effectiveness 
of the manipulation of control over P's out- 
comes, this manipulation was also assumed t0 
be effective. 

The effectiveness of the manipulation of the 
affective tone of the written communications 
from O should be reflected in the subjects! 
postinteraction estimates of the favorableness 
of O's attitude toward them. An analysis 9 
variance was performed on the subjects’ esti 
mates as a function of the three experimenta 
variables investigated. The main effect of com 
munication type was significant (F — 3.96, 
df = 2/108, p < .05), and indicated that the 
subjects estimated O's attitude toward them t0 
be slightly favorable under positive affect con- 
ditions (M = .27), slightly unfavorable undef 
no-communication conditions (M =— 80); 
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who were exposed to positive affect com- 
munications should expect O to be more likely 
to choose 1 than subjects who were exposed to 
negative affect communications, Data support 
this. An analysis of variance performed on the 


and moderately unfavorable under negative 
affect conditions (M — — 1.58). 

The effectiveness of the manipulation of 
communication type should also be indicated 
by differences in the expectancy for O to 
respond benevolently. Specifically, subjects 


proportion of times P expected O to choose 1 
over the 30 trials of the interaction yielded a 
significant main effect of communication type 
(F = 1.63, df = 2/108, p < .01); subjects who 


received positive affect messages expected O to 


choose 1 more frequently (M = .437) than did 
either subjects who received negative affect 
communications (M = .329) or those who 
received no communications. (M = .331). The 
Manipulation of the affective tone of the 
communications presented therefore appeared 
to be successful. 

In this study it was possible that differences 


in the subjects’ own response patterns over the 
course of the interaction would produce 
differences in the tendency for the simulated O 
to behave exploitatively. To investigate this 
Possibility, the proportion of times that O 
chose 1 during the interaction was calculated 
for each subject, and these data were then 
analyzed as a function of the two fate-control 
variables and communication type. No main 
effects or interactions were significant. The 
overall probability that O chose 1 was .269, 
Justifying the assumption that O was typically 
exploitative over the course of the interaction. 
However, the comparability of O's behavior 
Over experimental conditions indicates that the 
effects of experimental variables upon subjects’ 
behavior are unlikely to be due to systematic 
differences in the frequency with which O 
behaved exploitatively. 


Interaction Behavior 


The 30 trials of the interaction were sepa- 
ated in four blocks—the first 9 trials preceding 
the first communication and the block of 7 
Successive trials following each of the three 
9bportunities Lo communicate. An analysis of 
Variance was then performed on the proportion 


TABLE 2 


Mean Urrnirv TO P or AvaiLABLE Ourcomes AT EACH 
LEVEL or FATE CONTROL Over O AND 
Fate CONTROL Over P 


Fate control over O 
F: 

econ By O Base line By P 

oi | o o: | o o | o 
By Pi | 7.77 5.27| 8.09 610| 8.90 6.30 
Ps | 13.35 10.24] 12:73 10.33] 14.87 10.87 
: Pi | 7.69 6.90| 7.52 5.26| 6.27 2.55 
Baseline py | 9:16 6:57] 910 5.82] 10.93 6,09 
ByO P. |12.83 7.63|15.56 4104 13.64 — 2.61 
P. |13:87 6.5 18.16 5.36|18.45 6.31 


of times each subject chose Response 1 as a 
function of trial blocks and the three between- 
subject variables under consideration. The 
main effect of trial blocks was significant 
(F = 5.10, df = 3/324, p < .01), and indi- 
cated that the tendency to choose 1 (i.e., to 
benefit O at cost to oneself) increased over 
trial blocks. This finding is inconsistent with 
the results of previous studies (cf. Wyer, 1969) 
in which “noncooperative” behavior increased 
over trials. The effect of trial blocks did not 
interact with any of the three experimental 
variables, and the results reported below were 
therefore assumed to generalize over trials. 
The main effect of communication type only 
approached significance (F = 2.92, df = 2/108, 
p < 10). The tendency to choose 1 was 
greater under both positive communication 
(M = .538) and negative communication 
(M = 423) conditions than under no-com- 
munication conditions (M = .352). The analy- 
ses of the subjects’ postinteraction estimates of 
the amount of influence these communications 
had on their choices also indicated that com- 
munications had more influence under posi- 
tive communication conditions (M — 5.83) 
and negative communication conditions (M 
= 5.71) than under no-communication con- 
ditions (M = 2.32; F= 621, df = 2/108, 
p < 901). It seems justifiable to conclude that 
communications from a generally exploitative 
O which conveyed a favorable attitude toward 
P had no greater effect upon his behavior than 
did communications which conveyed an un- 
favorable attitude toward him, and that 
although these communications differentially 
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affected P's expectations for how O would 
respond, neither type of communication had a 
substantial effect upon P’s own responses over 
the course of the interaction. 

The major concern of this study was the 
effects of different types of fate control upon 
P’s responses. The proportion of times that P 
chose Response 1, collapsed over trial blocks 
and communication types, is shown in Table 3 
as a function of the two fate-control variables. 
An analysis of variance performed on these 
data yielded only a significant main effect of 
control over O's outcomes (F = 7.17, df 
= 2/108, p < .01) ; specifically, subjects tended 
to choose 1 less frequently when O had sub- 
stantial control over his own outcomes than 
under either base-line conditions or when P 
had control over O's outcomes. These data 
support the speculation that a subject is less 
likely to act in another's behalf at a cost to 
himself when the other has control over his 
own outcomes, and thus is less in need of 
assistance. 

The other three types of fate control con- 
sidered in this study did not have significant 
effects upon P's behavior. Table 3 shows that 
P's tendency to benefit O was no different when 
he had substantial fate control over O than 
under base-line conditions. The main effect of 
control over P was not significant (F — 1.02, 
df = 2/108, p > .10), indicating that neither 
O's control over P nor P's control over his own 
outcomes had a substantial effect upon P’s 
tendency to benefit O. While the effects of 
control over P appeared to be more pronounced 
at some levels of control over O than others, 
the absence of a significant interaction of the 
two control variables (F < 1) prevents this 
contingency from being taken seriously. 


TABLE 3 


MEAN PROBABILITY OF CHOOSING 1 As A FUNCTION OF 
FATE CONTROL OVER P AND FATE CONTROL OVER O 


Fate control over O 
Fate control 


over P M 
ByO Baseline} By P 
By P .227 421 488 .379 
Base line 413 433 452 433 
By O 323 474 407 401 
M 321 443 449 


Supplementary analyses. Wyer (1969) fow 
that subjects were more likely to respond img 
manner beneficial to O (ie., to choose 1) j 
they expected O also to choose 1 on that tr 
than if they expected O to choose 2, To tes 
for this in the present study, two additions 
analyses were performed. First, the proportio 
of trials on which P expected O to choose 1 wa 
analyzed as a function of the experimental | 
variables being considered. This analysis 
yielded a main effect of communication type 


main effect of control over P (F = 
df = 2/108, p < .01). The latter result i 
cated that P expected O to choose 1 
frequently when P had fate control 
himself (M = .299) than under either base-lim 
conditions (M = .410) or when O had 
control over P (M = .388). These data in 
that not only was P more likely to behay 
way detrimental to O when O had contro 
his own outcomes, but that he expected 
behave the same way toward ni 
similar conditions. ‘a 

A second analysis of the subjects’ respons 
was performed in which differences in expect 
ancies were controlled. Specifically, the coi 
tional probabilities of choosing 1 (a) give 
expectancy for O to choose 1 and (0) give 
expectancy for O to choose 2 were both calcu 
lated for each subject over the 30 trials of thi 
game, using his predictions of O's response 0 
each trial to determine expectancies. 4 
analysis of variance was then performed on thi 
probability of choosing 1 as a function 0 
expectancy, the two fate-control variables, aml 
communication type. The main effect of co! 
trol of O's outcomes was again significant U 
this analysis (F = 4.58, df = 2/108, p < 5 
Also, the probability of choosing 1 was greatel 
when O was expected to choose 1 (M = .59i 
than when O was expected to choose | i 
(M = .285; F = 93.18, df = 1/108, p < 01 
Of primary interest, however, was the sig 


two fate-control variables (F = 28b 
—4/108, p < .05). Data relevant to this inte 
action are shown in Table 4. Separate SU 
analyses of behavior under each expectant 
condition indicated that the interaction of t 
two fate-control variables was significant wA 
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| 0 was expected to choose 1 (F = 2.68, df 
24/108, p < 05), and approached significance 
when O w: pected to choose 2 (/— 2.10, 
df=4/108, ; )). When O had substantial 
fate contro! over both himself and P, P was 
relatively less likely to benefit O on those trials 
that he expected O to act benevolently, When 
P had control over O's outcomes, the tendency 
to benefit O : : this expectancy was greater 
than base-line conditions except when O had 
fate contro! over P. 
When P expected O to choose 2, his tendency 
to choose 1 was least when both participants 
had control over their own outcomes (M 


= .097), but was next lowest when each person 
had substantial control over the other’s out- 
comes (M = .253). In other words, an expected 
exploitative response by O tended to be recipro- 
cated by P citer when both participants were 
fairly independent of one another or when they 


were highly dependent upon one another. 
The contingency of P's behavior upon his 
expectancy for how O would respond may be 


inferred frorn the difference between the prob- 
ability of his choosing 1 given an expectancy 
for O to choose 1 and the probability of choos- 
ing 1 given an expectancy for O to choose 2. 
Data in Table 4 show that when O had control 
over his own outcomes, P's responses became 
more contingent upon his expectancy for O as 
the relative amount of control he had over his 
own outcomes became greater; however, under 
base-line conditions or when P had fate 
control over O, P’s responses were more con- 
lingent upon O’s when his outcomes were 
greatly controlled by O. 


Effects Upon Attitude and Utility Estimates 


The second major question investigated was 
the effects of the interaction upon the subjects’ 
attitudes toward O. Two sets of data were 
televant: the questionnaire data collected at 
the end of the experiment concerning P's 
řeactions to O and the changes in the utility 
of the points received by P and O as a result of 
the interaction. To determine the effects of 
experimental manipulations upon utilities, two 
analyses were performed. First, the values of 
!-k; in Equation 1, calculated on the basis of 
the subjects’ estimates of the desirability of 
outcomes made before participating in the 
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TABLE 4 


MEAN PROBABILITY OP CHOOSING 1 As A FUNCTION OF 
Fate CoNTROL over O, Fate CONTROL 
OVER P, AND EXPECTANCY 


Fate control over O 


Item 
| ByO | Baseline | ByP 
A. Expectancy for O to choose 1 
Fate control over P 
ByP 588 533 659 
Base line .601 .602 .662 
By O 412 EU 602 
B. Expectancy for O to choose 2 
Fate control over P 
By P 097 358 .362 
Base line .275 .334 310 
By O 305 290 :253 
C. Difference (A — B) 
Fate control over P 
ByP 491 .180 .297 
Base line .326 .268 352 
By O 107 A27 .349 


interaction stituation, were analyzed as a func- 
tion of the two control variables and com- 
munication type. This analysis yielded no 
significant main effects or interactions except 
for a main effect of parameter (F = 292.86, 
df = 4/432, p < .001), thus indicating that 
the subjects did not differ systematically 
before taking part in the experiment. The 
initial values of k1-Fs, averaged over all sub- 
jects, were .599, .387, — .014, —.282, and 6.003, 
respectively. These values indicate that the 
subjects attached somewhat more value to 
winning points (k1) than to losing them (R3), 
and that while they attached effectively no 
value to points won by O (ks), they attached 
some positive value to points lost by O (ha). 
'To determine the effects of participation in 
the experiment on these values, changes in the 
values of k-k; from before to after the inter- 
action were calculated for each subject. Analy- 
ses of variance of these change scores yielded 
a main effect of fate control over P (F — 6.89, 
df = 2/108, p < .01) and an interaction of 
this variable with parameter (F = 4.83, 
df = 8/432, p < .01). Data relevant to this 
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MEAN CHANGE IN UTILITY PARAMETERS AS A FUNCTION 
or Fare ConTrot over P 
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TABLE 6 


MEAN RESPONSE TO ATTITUDE ITEMS AS A 
or FATE CONTROL OVER O 


Parameter 
Control over P - -— — 
|n| & [5| & | ks 
By P 176| —.050 026 | E | —.559 
Base line 130 | p =.787 
008 | Oot | = —.008 | 1275 


By O 


interaction are shown in Table 5. These data 
suggest that the utility of points either won or 
lost by O were not affected under any condition 
(nor were they appreciably affected by the 
type of communication received, as evidenced 
by the absence of any significant effects or 
interactions involving this variable; in each 
case, F < 1.35). The value of points won by P 
increased slightly unless O controlled P’s 
outcomes; under the latter circumstances, the 
value of points won by P remained unchanged, 
but the importance of points lost increased 
substantially, 

The value of the Intercept Parameter ks 
may be related negatively to comparison level, 
or to the magnitude of the outcome regarded by 
the subject as affectively neutral. If this is the 
case, data in Table 5 indicate that P’s com- 
parison level decreased when O had fate 
control over him and acted against his best 
interests, but increased under other conditions. 
If comparison level is determined in part by 
the value of the outcomes one has received in 
the past (Thibaut & Kelley, 1959; Wyer, 1968), 
this result would be expected. 

Analysis of questionnaire data, Three ques- 
tionnaire items were considered: the desir- 
ability of having O as a friend, the desira- 
ability of participating with O again in a 
similar experiment, and the favorableness of 
P’s “general attitude” toward O. Analyses of 
variance were performed on responses to each 
of these items separately as a function of the 
three experimental variables. In each case, the 
main effect of control over O was significant 
(F = 3.87, df = 2/108, p < .05). These data, 
shown in Table 6, are consistent in indicating 
that attitudes toward O were more favorable 
when O had substantial control over his own 
outcomes than under base-line conditions or 
when P had control over O's outcomes. Since 


panit y 
Fate control Dir 
ver O era frd | partidpatins s 
By O 1.84 2.02 
Base line .62 —.71 
By P EU —.24 


the tendency to choose Response 1 was lk 
when O controlled his own outcomes, 
finding runs counter to the hypothesis 

an unfavorable attitude toward O medi 
tendencies to exploit him to P's pe 
benefit. In no case did the affective tone of the 

communications to P affect his attitude toward} 
O (in each analysis, F < 1), despite the 
that it affected P's estimate of O's attit 
toward him. ; 

Correlational analyses. Collapsed over e 
mental conditions, the probability of choos 
1 was not correlated significantly with any 0 
the three questionnaire variables assumed ti 
reflect P’s attitude toward O. (In each 
N = 135, r < .125, ns) The probability 
choosing 1 was correlated .207 with the 
interaction estimates of k; (the weight atta 
to points gained by O) and .047 with ka (th 
weight attached to points lost by O); only 
first of these correlations was signifi 
(N = 135, z = 2.38, p < .05). Changes ind 
and k, were both uncorrelated with the prob 
ability of choosing 1 (in each case, r < 100 

ns). In summary, there was little indicatio 
that attitudes toward O acquired during tW 
experiment, or changes in the utility of poin 
received by O during the course of the intet 
action, were related to the likelihood that 
will behave in a manner beneficial to O. 


DISCUSSION 


factors that were expected to affect f 
willingness to act in O’s behalf at cost 
himself, even when O typically pursued | 
own interests at P’s expense. Of the variabl 
considered, the primary determinant of $ 
willingness to act in O's behalf was the deg 
of control O had over his own outcomes; P Wi 
less likely to benefit O when this control ¥ 
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great. Sinc tendency was apparent even 
when P exp: 1 O to choose 1 (see Table 4), 
P’s behavior cannot simply be attributed to his 
expectation O will exploit him when O had 
control of ! "wn outcomes. It seems more 
reasonable onclude that when P has rela- 
tively more control over O's outcomes than [0] 
has over ! wn outcomes, he feels a re- 
sponsibility to act in O's behalf regardless of 
O's behavio d thus is more likely to benefit 
him than when O can take care of himself. 

Other types of fate control appeared to have 


little effect ı P’s behavior. These negative 
findings call into question several assumptions 


which often underlie the prediction of behavior 
in game-playing situations, For example, it was 
speculated that when P had fate control over 
0, he might coercion to induce O to act in 
his behalf -æ was little indication of such 
a tendency. Data also indicate that the 
amount P could gain by exploiting O was not 
a major determinant of his behavior in this 
experiment. Finally, there was no evidence 
that when O had fate control over P, responses 
to P’s detriment were effective in coercing P 


into acting in O's behalf. If anything, O's 
exercise of fate control over P seemed to 
decrease the tendency for P to act in O’s best 
interest. 

The single best predictor of P's choice on a 
given trial was his expectancy regarding O's 
choice on that trial. While P's choice should 
depend upon how he expects O to respond when 
O has behavior control over him (ie., when 
the choice P would make to maximize his own 
Outcomes depends upon O's behavior; cf. 
Thibaut & Kelley, 1959), it is less obvious that 
this should occur when the same response 
would maximize P’s outcomes, regardless of O's 
tesponse, Subjects who are uncertain about 
how to respond in a novel situation may often 
assume that the other person in the interaction 
is responding appropriately and therefore may 
tend to imitate him, regardless of the points 
Won or lost as a result. The implications of this 
possibility should be explored. 

The difference between the likelihood that P 
chose 1 when he expected O to choose 1 and the 
likelihood that P chose 1 when he expected O 
to choose 2 may indicate the extent to which 
P's behavior depended upon O's, and thus the 
effective behavior control that O had over P 


during the interaction. Note that if behavior 
control is defined in terms of the number of 
points received by P as a result of each com- 
bination of responses, the /Aeoretical behavior 
control of O over P is constant over the nine 
outcome matrices considered in this study. 
Data in Table 4 indicate that the effective 
control that one person has over the behavior 
of another is a function at least in part of the 
theoretical amount of fate control each party 
has over himself and the other. While Thibaut 
and Kelley (1959) hypothesized that fate 
control may be "converted" into behavior 
control through the strategic use of coercion, 
this hypothesis alone cannot account for the 
several contingencies described in Table 4. 
O's effective behavior control was relatively 
great when both P and O had substantial 
control over the other's outcomes. While this 
tendency is consistent with Thibaut and 
Kelley's formulation, the tendency for O's 
control to be even greater when both P and O 
had substantial control over their own out- 
comes would not be predicted from their 
theory. 

A second question raised in this study con- 
cerns the extent to which variables that 
affected P’s behavior in the interaction were 
related to his attitude toward O, and whether 
P's attitude toward O was either a determinant 
or an effect of his tendency to act in O's behalf. 
While the data obtained were not conclusive, 
they suggested that there was little relation 
between P's attitude toward O and his ten- 
dency to benefit him. Attitudes toward O were 
relatively more favorable when O had sub- 
stantial control over his own outcomes, but P 
tended less to act in O's behalf under these 
conditions. Correlational analyses failed to 
show a clear relationship between attitudes 
toward O and choice behavior. Finally, the 
utility that P attached to points won or lost by 
O was unaffected by O's exploitative behavior 
during the experiment. Thus, while there are 
undoubtedly individual differences in the desire 
to benefit or exploit one's partner, the results 
of this study suggest that such differences may 
often be able to be determined before the 
interaction. begins and may remain fairly 
stable throughout it. 

Participation in the study primarily affected 
the utility of points won or lost by P himself 
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and the position of P’s comparison level for 
evaluating his outcomes. When P’s outcomes 
were substantially controlled by O, the utility 
of points lost became more negative, and P’s 
comparison level decreased; when P had sub- 
stantial control over his own outcomes, the 
positive utility of points won, and also the 
comparison level, increased. These data suggest 
that P came to attach more importance to the 
type of outcomes that were more salient to him 
during the course of the interaction, and also 
that his comparison level moved in the direc- 
tion of these outcomes. Future research may 
need to determine more clearly the nature of 
shifts in both utility values and comparison 
level as a function of social interaction vari- 
ables if behavior in these interactions is to be 
predicted accurately as a function of param- 
eters defined in terms of these utilities (Wyer, 
1969, 1971). 
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F CONTRIBUTION OF EMOTIONAL AND INSTRUMENTAL 
NSE PROCESSES IN ADAPTATION TO SURGERY ' 
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edispositional measures of anxiety, internal-external control, and situational 
sures of worry, fear, pain, speed of recovery, and doses of analgesics were 
btained from 62 female surgical patients. Preoperative measures of fear and 


i (b) unrelated to speed of postoperative recovery and doses of analgesics. 
tirth order (first and later) and manifest anxiety (high, medium, and low) 
fected emotionality. Later borns low in manifest anxiety were least emo- 


| 2 e " 
| worry were (a) positively and linearly related to postoperative emotionality 
1 


iter borns high in manifest anxiety and firstborns either high or low 
i est anxiety were most emotional. Internal-external control was associ- 
ted with ability to influence care. Internals obtained more needed analgesics, 


4] id if they were also firstborn, they had longer hospital stays than externals. 


ie evidence contradicts the hypothesis that preoperative emotion is causally 
d to adaptive responses in this stressful situation, The data suggest that 
tionality should be treated as a response and that these responses and 


‘strumental responses can be independent. 


This study examined the effects of patient 
and situational variables upon emotional and 
Anstrumenta! behaviors during hospitalization 
for elective surgery. Two theoretical frame- 

- Works were used to select variables for study: 
the “emotional drive" theory (Miller & Dol- 
lard, 1941; Mowrer, 1956), as applied to 

Surgical patients by Janis (1958), and the 

parallel response model proposed by Leven- 

thal (1970). The purpose was to see which 
framework would best explain patient behav- 
lor during the naturally occurring stresses of 
Surgery. 

The emotional drive model posits that in 
threatening situations, fear causes adaptive 
- behavior (Janis, 1958); the level of fear de- 
termines the adequacy of adaptation. When 
fear is of moderate intensity and situationally 


‘This research was supported by United States 
- Public Health Service Grants NU 00301 and IF-4-NU 
27,185 to the first author and NU 00302 and HS 
00436 io the second author. We wish to thank the 
directors of nursing service of the two divisions of 

e Vale New Haven Medical Center who supported 
Us in our research endeavors and provided the entre 
A the clinical setting. 

Requests for reprints should be sent to Howard 
Leventhal, Department of Psvchology, Psvchology 
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determined, it stimulates the “work of worry” 
(Marmor, 1958) that prepares the patient 
for danger, Neurotically determined, high- 
intensity fear stimulates worry about ante- 
cedent childhood threats which are remote 
from and unrelated to surgery and, therefore, 
of little or no value in preparing the patient 
for danger. When preoperative fear is low in 
intensity because of lack of information or 
denial of threat, there is insufficient motiva- 
tion for the work of worry. Thus, unafraid 
patients will also be unprepared for the dan- 
gers of surgery, and they will become emo- 
tionally disturbed when faced with postopera- 
tive distress. In summary, the emotional drive 
theory predicts a curvilinear relationship be- 
tween preoperative fear level and postopera- 
tive adjustment, with patients moderate in 
preoperative fear showing the best post- 
operative adjustment and patients either low 
or high in preoperative fear showing (for 
different reasons) the worst adjustment. 
Janis (1958) has reported results from pre- 
operative and postoperative interviews with 
22 surgical patients and from a retrospective 
survey with 149 students, which support the 
curvilinear relationship predicted by the emo- 
tional drive model, In both samples, patients 
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at moderate levels of preoperative fear re- 
ported less postoperative emotional disturb- 
ance, anger, and criticism of hospital staff 
than did patients either high or low in pre- 
operative fear. In an attempt to replicate and 
extend these findings, Leventhal (1963) con- 
ducted preoperative and postoperative inter- 
views with surgical patients and related 11 
measures of preoperative emotionality to a 
variety of indicators of postoperative adjust- 
ment. The results did not support the curvi- 
linear hypothesis of the emotional drive theory 
since patients who were lowest in preoperative 
fear were lowest in postoperative distress and 
gave more favorable postoperative evalua- 
tions of the medical staff. 

Experimental studies of patient preparation, 
stimulated by Janis’ work, also provide no 
clear support for the emotional drive hypothe- 
sis. For example, Egbert, Battit, Welch, and 
Bartlett (1964) and Johnson (1966) com- 
pared the level of postoperative distress in 
patients forewarned about postoperative dis- 
comfort with control patients receiving no 
information. While the prepared patients were 
discharged earlier than the controls, there was 
no evidence suggesting that the improved 
adaptation was mediated by emotional proc- 
esses. The emotional drive theory would sug- 
gest that preoperative information should in- 
crease fear and stimulate work of worry for 
patients who would ordinarily be unafraid or 
only slightly afraid, and this work of worry 
should cause a reduction in postoperative 
emotional response. But the results of these 
studies show no differences between prepared 
and unprepared patients in reported severity 
of postoperative pain, and, as a consequence, 
there is no suggestion of differences between 
groups in emotional reactions. The simplest 
interpretation of the findings is that the infor- 
mation, given both preoperatively and post- 
operatively, on how one would feel and what 
one should do to control discomfort and re- 
cover speedily had a direct effect upon ade- 
quacy of adjustment. This conclusion is only 
tentative because the studies did not give ade- 
quate attention to the measurement of the 
separate processes involved in instrumental 
and emotional reactions. The inferences drawn 
from the experimental studies of surgical pa- 
tients are similar to those drawn from Pervin’s 
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(1963) laboratory study on the co: 
tressing stimuli. His subjects preferred to 


receive electric shocks in a condition in which 
they administered it to themselves, rather 
than in a condition in which it was delivered 
by the experimenter. But the painfulness of 
the shocks and the anxiety that they caused 
were not significantly affected by these condi- 
tions. Geer, Davison, and Gatchel (1970) also 
reported no significant effects of perceived 
control for subjective reports of pcinfulness 
of electric shock. On the other hanc, Bowers 
(1968) reported that under some conditions, 
perceived control was found to be associated 
with increased tolerance for pain. Thus, both 
the surgical studies and the shock studies sug- 
gest that acting to control danger does mot 
always significantly affect fear or subjective 
pain ratings, and they also suggest that in- 


strumental behavior which is perceived to 
effect the stimuli may be independe of fear 
and subjective pain ratings. This same con- 
clusion is supported by studies of the motivat- 
ing effects of fear-arousing messages c» health 
attitudes and practices (Leventhal, “ones, & 
Trembly, 1966; Leventhal, Singer, & Jones, 
1965; Leventhal, Watts, & Pagano, 1967); 
different stimuli control fear reactions and 
the behavior which reduces danger. Thus, 
strengthening the fear cues in a message on 
lung cancer and tetanus increases reports of 
fear and desire to take protective action, but 
does not influence behavior; action occurred 
when any fear arousal (beyond some minimal 
level) was combined with specific instructions 
on how to obtain innoculations or reduce 
smoking. The individual acts when he is aware 
of danger and when he is given or can create 
the specific instructions needed to guide his 
behavior. 

Leventhal (1970) and Lazarus (1968) have 
suggested that emotion be considered a re- 
sponse instead of a drive, and that fear and 
Coping responses are a product of the indi- 
vidual's cognitive appraisals of environmental 
danger. The surgical patient experiences dan- 
gers from pain, reduced capacity to satisfy his 
needs, and postoperative discomforts which 
arise while performing physical acts that fa- 
cilitate his recovery. When the patient pel 
ceives that he is threatened, he will perform 
instrumental acts to reduce danger and vill 


rol of dis- 


Hn — 


ctive of his concurrent emo- 
For example, to reduce the 
complications, a patient will 
will do so whether his fear level 
te, or high. An unfearful pa- 
: "I got up because walking 
et well, and it was neither pain- 
ing," or he might refuse to get 


do this irresi 
tional rea 
possibility 

ambulate an 
is low, mod 
tient might 
will help me 
ful nor frigh 


} 


up because he sees no connection between 
walking and getting well. A fearful patient 
might say: “I got up because walking will 
help me get well even though it is painful and 
frightening." All possible combinations of the 
emotional reactions and instrumental be- 
haviors are possible. 

The parallel model also suggests that the 


individual's own behaviors can function as a 
stimulus for further action (Leventhal, 1969, 
1970). For example, when he is frightened, a 
‘patient may act to reduce emotional distress. 
fear or the distress caused by pain, 
ht ask for information, request 
ne stay with him, attempt to relax, 
or request pain medication. In summary, the 
parallel response model depicts threatening 
situations as composed of danger stimuli and 
fear stimuli, and either of these types of stim- 
uli may elicit behaviors to influence the en- 
vironment. However, achieving fear reduction 
need not insure that the dangers will be con- 
trolled, and achieving control of the dangers 
need not reduce the fear. The model seems to 
provide a framework that can encompass the 
findings from the studies of responses to sur- 
gery, electric shock, and fear communications. 
To better explore the fit of the parallel re- 
Sponse model to the surgical setting, separate 
measures are needed for predisposition to emo- 
tional reactions and predisposition to initiate 
action to influence the environment. The ex- 
tent to which patients believe that they can 
influence what happens to them can be as- 
Sessed by means of Rotter's (1966) scale of 
internal-external (I-E) control of reinforce- 
ment. “External control” (low I-E) reflects 
the belief that there is no relationship be- 
tween one's actions and one's experience; out- 
Comes are governed by luck, fate, complex 
orces, or powerful other persons. “Internal 
Control? (high I-E) reflects the belief that 
eg actions, skill, effort, etc., determine one's 
ate. 
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There is evidence suggesting that patients 
high on the I-E scale will try to control the 
environment in the hospital setting. Seeman 
and Evans (1962) compared internal and ex- 
ternal patients matched on socioeconomic 
status and hospital experience and found that 
the internal patients were better informed 
about the disease and asked more questions 
of the physicians and nurses. Though work- 
ing with a nonpatient population, Phares 
(1965) found that high I-E (internal) stu- 
dents were more successful in changing the 
opinions of others than low I-E (external) 
students. This relationship appeared even 
though the actual content of the communica- 
tions was equated for both groups. Thus, the 
I-E scale seemed to be a useful predisposi- 
tional measure for the present study; specifi- 
cally, internal patients were expected to in- 
fluence staff decisions about their care. 

'The Manifest Anxiety Scale (MAS; Taylor, 
1953) was used to measure trait anxiety. In 
accord with Janis’ (1958) findings, it was. 
expected that surgical patients with high trait 
anxiety, the tendency to react emotionally to 
stress settings, would be more likely to mani- 
fest fear behavior. A third variable, birth 
order, has a less specific relationship to the 
theories elaborated, but appears to have po- 
tential significance for response to the stress 
of surgery and postoperative recovery. The 
patient's birth order may be associated with 
both emotional responses and instrumental 
behavior. Schachter (1959) reported that in 
comparison to later borns, firstborns are more 
frightened by the threat of painful stimula- 
tion and, when frightened, prefer to affiliate 
(Gerard & Rabbie, 1961; Sarnoff & Zimbardo, 
1961; Schachter, 1959; Wrightsman, 1960). 
This effect, which is found more consistently 
in firstborn females than in firstborn males, 
has been interpreted as showing the firstborns* 
dependence upon other people for social com- 
parison, approval, and support (Radloff, 
1961; Schachter, 1959; Warren, 1966). These 
findings lead us to expect more fear and de- 
pendency among firstborn female surgical pa- 
tients. It is difficult, however, to anticipate 
the exact direction that affiliative needs would 
take in the hospital setting; dependent pa- 
tients could elect to remain in the hospital 
with the professional staff or could choose to 


E 
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return to their families if this would better 
satisfy their needs. 

In summary, the emotional drive hypothe- 
sis predicts that patients who are moderately 
fearful before surgery will show the least emo- 
tional response and have the most favorable 
postoperative adjustment; patients either low 
or high in fear before surgery will show high 
amounts of emotional reactions and poor post- 
operative adjustment. The alternative hypoth- 
esis from the parallel response model predicts 
that emotional response and instrumental be- 
havior will be associated with different pre- 
dispositional factors and that emotional re- 
sponse and instrumental behavior will gen- 
erally prove to be independent of each other. 
Manifest anxiety and birth order are the 
predispositional factors expected to be associ- 
ated with emotional response. I-E control 
was the predispositional factor expected to be 
associated with behaviors that influence the 
environment, but birth order may influence 
the goal of the instrumental actions. 


METHOD 
Sample 


Complete data were obtained for a total of 62 
female patients, 44 admitted for elective abdominal 
hysterectomy (removal of the uterus) and 18 for 
cholecystectomy (removal of the gall bladder). The 
age range for hysterectomy patients was from 25 to 
51 years (median = 40) and that for cholecystectomy 
patients was from 28 to 69 years (median = 48). 
Forty-one of the women were patients of 24 dif- 
ferent private physicians (28 hysterectomy and 13 
cholecystectomy), and the remaining 21 were clinic 
patients. Patients with known chronic health prob- 
lems, such as diabetes or cardiovascular condition, 
were excluded from the sample. Data were discarded 
for 15 patients. In 14 cases (7 of each type of sur- 
gery), data were incomplete either because of failure 
to contact some patients on the morning of surgery, 
before the administration of preoperative medica- 
tions, or because of refusal to be repeatedly inter- 
viewed early in the morning after surgery, One other 
patient was dropped because of postoperative wound 
infection. These patients did not appear to differ in 
any systematic way from those with complete data. 

Hysterectomy and cholecystectomy were selected 
for study because the operations are frequently per- 
formed and are very similar in their general post- 
operative course, including requirements for pain 
medication (Jaggard, Zager, & Wilkins, 1950; Keats, 
1956). The analysis revealed no significant differ- 
ences between the two types of surgery, and no 
further consideration was given to this factor, 


Procedure 

Patients’ names were obtained from the operating | 
room schedules the evening before surzery, and the 
patients were contacted between 5 and 9 P.M, The | 
data collectors, registered nurses and lent prac- 


tical nurses, wore lab coats and ident hemselves | 
as members of a study team. They askec the patients 
if they would volunteer for an investigation of reac- 
tions to surgery, and the patients who ¿greed com- 


pleted the personality schedules and some other ques- 
tions not of primary interest to the dais reported 
here. Only three refused. 

Between 6:30 and 7:30 a.m. on the morning of 


surgery and again on the first 4 postoperative days, 


data were collected on emotional state «nd certain 
physiological responses. Only the verba! reports of 
emotion are reported here. Between 19 AM. of: 
the fifth postoperative day, the patient was inter- 


of her experience. At this time she was 
she would not be contacted again and w 
for her participation. Additional data were 
from charts and ward records, 


viewed and asked to make some overall evaluation 
told ‘ht 


s thanke 
obtained 


Predictor Variables 


Preoperative fear level, the major predictor varia- 
ble in the emotional drive theory, was measured on 
the morning of surgery before preoperative medica- 
tions were administered. The assessment instrument: 
was the 30-item Mood Adjective Check List 
(MACL), with five adjectives for each of six moods: 
(a) well-being, (b) depression, (c) fear, (d) happi- 
ness, (e) anger, and (f) lethargy. The patient checked 
each adjective on a 3-point scale to indicate whether 
it applied not at all, somewhat, or very much. Scores 
for each mood could range from 8 to 15 

Two short personality scales and a three-item scale 
of worry were given on the evening before surgery: 
The first scale consisted of nine items selected from 
the MAS (Taylor, 1953). Items that reflected sensi- 
tivity to one's emotional response were selected: 
Items asking about physical symptoms and self- 
confidence or esteem were not included.? The number 
of "anxious" responses to items was summed for ea 
patient, and a high score indicated high anxiety. The 
second scale consisted of nine items adapted from 
Rotter's (1966) I-E control scale. These items wet 
presented singly, however, rather than in the forced- 
choice form of the original, Responses were recorde 
on 7-point agree-disagree scales for each item, Five 
of the items were worded so that agreement indicated 
feelings of internal control, and four were worded $% 
that disagreement indicated feelings of emotional con- 
trol. Scores were summed so that high scores indi- 
cated internal control. 


3 The items selected were those numbered 13, 54 
67, 77, 83, 86, 94, 138, and 183 in Taylor (195%: 
Table 1, p. 286). The items were reworded so that 
anxiety responses were indicated by agreement for 
five and disagreement for four of the items l 


> scales were shortened to increase 
nd to reduce testing time. Sim- 
d because the sample included 
levels, and brevity was required 


plicity was 


many educat 
by the deman the hospital setting. To estimate 
the equiva! the short and long forms, both 


e were administered to 50 female 
students at the University of Wis- 


versions of 
first-year nur 


consin. For th S, the entire set of items was 
administered single setting, and two anxiety 
scores were calculated: the one based on nine items 


ihe remaining items. Reliability as 
son correlation between these 
3 alue that compares favorably 
t correlations of 81 and .82 for 
ylor, 1963). Because the abbrevi- 
| Likert-type items, its nine items 
original forced-choice form. For 
o scales were given at two sessions 
weck interval. Reliability as indi- 


and the other 

determined hy the Pi 
two scores wa 
with the test 
the full l 
ated I-E s 
were repeatec 
this reason, th: 
separated by 


cated by the intercorrelation of the scales was .55, 
which falls i: the range of 48-83 for test-retest 
correlations the full scale by Rotter (1966). 


The three : on the scale of worry were: (a) 
"Are you worried about your illness?” (b) “Are you 
worried about your operation?” (c) “Are you wor- 
ried about gc inder anesthesia (going to sleep) ?” 


Each item was answered on a 7-point scale (not at 
all to extrer and scores were summed for a 
total scale sco: 

The patient's birth order was determined from 


tained during the postoperative inter- 
> divided into firstborn (including 
nd later-born groups. 


information 
view. Patient 
only children) 


Dependent Variables 


The primary measures of emotional reaction were 
the MACL and the subjective evaluation of pain. 
The MACL was given each morning from the day 
of surgery through the fourth postoperative day. 
The patient's evaluation of her postoperative in- 
cisional pain was obtained separately for the worst 
pain felt and for the feast pain felt during the post- 
Operative period. Pain severity was rated on a 100- 
Point “pain thermometer" that had anchor points at 
9 (“No pain or discomfort whatsoever”) and 100 
(“The worst pain you can imagine”). 

While most patients experience painful sensations 
during the first hours following surgery, a patient 
who lies quietly may experience little incisional dis- 
comfort by the second and third day. Consequently, 
the severity of the worst pain is likely to reflect a 
variety of physical factors, for example, movement 
after surgery, and to be a relatively poor indicator 
of emotional distress, It seems likely, on the other 
hand, that the severity of the least pain will be 
intensified by emotional disturbance and inability to 
relax. Severity of “least pain” was used, therefore, as 
an indicator of emotional response. 

Patients high in internal control (high I-E) were 
expected to influence staff decisions. This affect was 
observed for two factors: pain medication and time 
of discharge from the hospital. 
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All of the physician’s postoperative orders allowed 
the nurses to decide when the patient should receive 
medication, and  patient-initiated communication 
(verbal or nonverbal) could influence this decision, 
The analgesic prescribed, dosage, and time of admin- 
istration were obtained from each patient’s chart 
for each 24-hour postoperative period (midnight to 
midnight).4 Time of discharge from the hospital was 
decided by the patient’s physician. This decision was 
based upon many factors, and some of these, such as 
the appearance, behavior, and general attitude of the 
patient, are to some extent under her control. Since 
there were many physicians discharging patients and 
since they were unaware of our interest in the time 
of discharge, it is unlikely that systematic bias oc- 
curred because of any particular physician's discharge 
practices, The length of stay was recorded in whole 
days. 


RESULTS 
Emotional Drive Hypothesis 


Levels of preoperative fear were formed by 
dividing the subjects into three more or less 
equal groups on the basis of their fear scale 
scores from the MACL that was filled out the 
morning of surgery. Scores in the interval of 
5-6 were considered low; those in the interval 
7-9, moderate; and those in the interval 10— 
15, high (M — 7.84, SD — 248). Examina- 
tion of the relationship of preoperative fear 
levels to postoperative emotional response re- 
vealed no indication of high postoperative lev- 
els of negative emotion for patients low in 
preoperative fear (Table 1). The significant 
differences for emotional response were a 
result of the high-preoperative-fear group 
reporting high levels of postoperative emo- 
tion, There also was little or no associa- 
tion between preoperative fear level and either 
of the two indicators of control of environment 
(doses of analgesics and length of hospital 
stay; see Table 2). 


‘The amount of analgesics received by patients 
has been used as an indicator of the degree of pain 
in studies of surgical patients (Egbert et al, 1964; 
Johnson, 1966). However, in the Johnson study, the 
patients’ subjective evaluation of the severity of 
pain and amount of analgesics received were cor- 
related only for those patients who had received 
specific instruction about how to cope with pain, 
It was assumed that patients in the present study 
would not systematically receive such instruction and 
that evaluations of severity of pain and amount of 
analgesics received would not be associated. As 
expected, correlations between pain scores and 
amount of analgesics received clustered around zero, 


60 


J. E. Jounson, H. LEVENTHAL, AND J. M. Danses, Jr. 


TABLE 1 
Mean Scores ON POSTOPERATIVE INDICATORS OF EMOTIONAL RESPONSE BY PREOPERATIVE | LEVEL 
| Mood scores 
(M over 1-4 days)* Pain 
Preoperative fear level | ET ` ating 
| Well-being | Depression Fear Happiness Anger | Lethargy 
Se E | 2 =|= 
Low (24) 34, 23. 22, «29. | 435, 15, 
Moderate (24) 35, | 24, 24, 41. | 23. 36. 21«4 
High (14) 32. 304 304 344 27a 434 344 
p» ns «.001 <.001 <.05 <.01 <.01 <.05 
Note.—The means in columns with the same subscript do not differ from each other at the .05 level (Newma uls test). 


ns Sees in parenthesis, 
` 


he possible range of scores was 5-15 for each day. Summing over 4 days gives a possible range of 20-60. 


» From one-way analysis of variance. 


There were no changes in the pattern of 
findings reported above if one substitutes the 
three-item worry scale scores for the MACL 
fear scale. This was not surprising, since the 
worry scale and the MACL fear scale were 
highly correlated (r= .50, df=60, p< 
001). 

Tn summary, the above findings do not sup- 
port the curvilinear prediction generated from 
the emotional drive hypothesis; patients who 
expressed moderate fear or worry before an 
operation did not show a better postoperative 
adjustment than the patients whose preopera- 
tive fear was low. 


Emotional Responses 


Manifest anxiety and birth order were the 
main predispositional variables that we ex- 
pected to relate to the key indexes of emo- 
tional response (fear and pain rating), Tables 
3 and 4 present the means from the MACL 


TABLE 2 


MEAN Scores on POSTOPERATIVE INDICATORS 
or CONTROL OF ENVIRONMENT BY PRE- 
OPERATIVE FEAR LEVEL 


Preoperative Total doses Days post- 
fear level analgesics operative stay 
Low A rae 
Moderate (E Ts 
High 48, Wily 
p ns ns 


Note.—The means in columns with the same subscript do 
not differ from each other at the .05 level (Newman-Keuls 
test), ns are given in parenthesis. The possible range of scores 
was 5-15 for each day, Summing over 4 days gives a possible 
range of 20-60, 

a From one-way analysis of variance. 


for fear level for the first 2 postoperstive days. 
The patients were divided into first! :rns and 
later borns and into high, medium, nd low 
groups on the MAS.* The only fins ‘ng that 
was statistically significant was a m 
for the first postoperative day; pat 


differ in reported fear as their manifest anxi- 
ety level varied; in later-born patients, there 
was a linear relationship between manifest 
anxiety and reported fear on both the first and 
second postoperative days (the Pearson r for 
manifest anxiety and fear was .48 on Day ! 
and 46 on Day 2, dí—38, p< Ol, for 
each); in firstborn patients, there was a curvi- 
linear relationship between manifest anxiety 
and fear, with less fear reported by patients 
in the middle level on the MAS. But when we 
compared the firstborns to the later borns, we 
saw that the curvilinearity for firstborns was 


A 
* The levels of manifest anxiety were formed arbi- 

trarily in a manner which produced nearly equal Ms- 
The arbitrary procedure was necessary because 777 
of the MAS scores fell in the interval bordered by 
the mean plus or minus one-half standard deviation. 
The unequal cell frequencies for birth order prob- 
ably reflect the proportions in the population at 
large. When unequal cell frequencies represent popu- 
lation proportions the least squares approach to the 
analysis of variance is prefrered and therefore W3$ 
used (Winer, 1962). The Newman-Keuls procedure | 
using the harmonic mean of the ns of the cells wel? 
used for all subsequent analysis. 
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fers from low as 
test, p < .0S), W 
mean differs f 
> <.05), 


due to the high 


tive day. 
Preoperative 


postoperative 
Operative f 
shown in Tab 
ratios were 


ficant. 


TABLE 4 


MEAN REPORTED FEAR ON SECOND POSTOPERATIVE 


Day nv BIRTH ORDER AND 
MANIFEST ANXIETY 


TABLE 3 
Mean Rev EAR ON First POSTOPERATIVE 
BiRTH ORDER AND 
UFEST ANXIETY 
Manifest anxiety 
Birth order —————— A 
w | Medium | High 
Firstborn 7.4 5.7 7.8 7.1 
n 7 6 9 
Later bor: 57 | 66 74 | 66 
ee 12 142 16 
M 3 6.3 7.5 
_Note.—Manif P = 3,36, df = 2/56, p <.05; 
birth order / interaction F = 2.25, df = 
2/56, ns. Within dium manifest anxiety mean dif- 


ifest anxiety mean (Newman-Keuls 
w manifest anxiety group, firstborn 
vorn mean (Newman-Keuls test, 


levels of fear reported by first- 
borns low in manifest anxiety. Differences in 
reported fear vanish by the third postopera- 


fear was not predicted as well 
by manifest anxiety and birth order as was 
fear, The pattern of the pre- 
means were similar to those 
3 and 4, but none of the F 


Evaluations of severity of pain as the indi- 
cator of emotional response showed results 
similar to those for fear scores. Compared to 
Manifest anxiety, birth order was more closely 
linked to reported severity of “least” pain; 
firstborns reported more pain than later borns, 


Manifest anxiety 


Birth order dr 
Low Medium High 
Firsthorn 74 62 | 66 | 66 
e 7 6 9 
Later born 53 5.8 69 | 64 
"a 12 12 16 
M 6.0 5.9 6.8 
Note.—Manifest anxiety F = L77, df = 2/56, ns; birth 
p Tw yi df 2 1/88, ns; IRE F = 1.96, df = 


differ: ns. With the low manifest anxiety group, firstborn mean 


3 ftom later-born mean (Newman-Keuls test, P < .05). 


TABLE 5 


Mean Pain Ratios nv BIRTH ORDER AND 
MANIFEST ANXIETY 


| Py 
Manifest anxiety 


Birth order ——————— ü 
Low | Medium High 

Firtbom | 26 | 3 | 31 | 30 

n | 7 | 6 9 
Later born DU iis 26 17 

n 12 12 | 16 

A M 2 | ae | 
| 


Note,—Birth order F = 4.30, df = 1/56, p <.05; anxiety 
F = 1,92, df = 2/56, ns; interaction F < 1, df = 2/56, Within 
the low manifest anxiety group, firsthorn mean differs from 
later-born mean at the .10 level of probability (Newman- 
Keuls test). 


but the difference between firstborns and later 
borns was greatest for low-manifest-anxiety 
patients (see Table 5). 


Control of Environment 


Patients scoring high in internal control 
were expected to have more success in con- 
trolling their environments; it was expected 
that they could influence the doses of anal- 
gesics that they received and the length of 
their postoperative hospitalization. Because 
emotional factors associated with birth order 
could influence the goal of the patient’s con- 
trol efforts, I-E control and birth order were 
both included as independent variables in a 
two-way analysis of variance." 

Patients high in internal control received 
more doses of analgesics than did those with 
middle external scores (Table 6). An unex- 
pected finding was the curvilinearity between 
LE control and doses of anagesics for first- 
borns; firstborns who were in the medium I-E 
group took the fewest analgesics. 

Birth order and I-E control produced a 
significant interaction for days of postopera- 
tive stay (Table 7). For the firstborns, the 
higher the score on internal control, the longer 
the hospital stay; both middle and high I—E 
groups stayed longer than low (external) I-E 

9I-E scores which fell in the interval bordered 
by the mean plus or minus one-half standard devia- 


tion were considered medium, below the interval, 
low, and above the interval, high. 
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TABLE 6 had completed the work of worry prior to re 

Mean Doses oF ANALGESICS BY BIRTH sponding to our fear measure, and ‘hat if we 
Ope Axp I-E Conrxor had measured fear at an earlier poi»: in time, 
eran a, low-fear patients would show adaptive 

Bie oder m | | n postoperative behavior. While the nclusion 
Low | Medium | High | from this line of reasoning is ] isible, it 

> 1 | ia does not readily account for the inconsistency 
Mena A "n T ex between Janis’ (1958) data and tle present 
Later born 58 61 | 95.| 74 data; both studies gave a measure oí preop- 
"a 2 | E pH erative worry on the day before s "y, and 

3 ee | ; the measure used in the present : . given 
Note—L-E F = $58, d/ 2/86, p «01; birth order the night before surgery, producec ‘he same 
firstborn, medium I- mean differs from high LE mean (New. Postoperative data pattern as did MACL mea: 
mre ces Mean o aal ee tom miaii sures of fear that were given the © ning of 
surgery, The present findings are in i 

; i f an unpubli 

patients. High I-E firstborn stayed in the ji by Leventhal (1963) and a voe rl 
hospital longer than high I-E later borns. SOHO Soria] patients conduct ed by. 


Discussion 


There was relatively little support for the 
emotional drive theory prediction, that a mod- 
erate level of fear before surgery is necessary 
to stimulate the work of worry and prepare 
the patient for coping with postoperative 
stress. Instead, the relationships between pre- 
operative fear levels and postoperative dis- 
turbance were generally linear; patients who 
reported a high level of fear preoperatively 
reported high levels of negative emotional 
states during the postoperative period, while 
patients who were low or moderately fearful 
before surgery reported low levels of emo- 
tional disturbance postoperatively. It could 
be argued that our patients who were low in 
fear both preoperatively and postoperatively 


TABLE 7 


Mean Days OF POSTOPERATIVE HOSPITALIZATION 
BY BIRTH ORDER AND I-E CONTROL 


Levels of I-E control 


Birth order | — M 
Low Medium High 
Firstborn 6.4 8.1 8.9 7.8 
n LÁ 8 7 
Later born 74 73 74 Wie 
»- 13 14 1 
M 6.8 TL 7.7 


Note, —Birth order F = 3,79, df = 1/56, ns; I-E F = 2.60, 
df = 2/56, ns; interaction F = 4,17, df = 2/56, p <.05. With. 
in firstborns, low I-E mean differs from medium and high I-E 
mean (Newman-Keuls test, p < .05). Within high I-E group, 
firstborn a differs from later-horn mean (Newman-Keuls 
test, f. <.05), 


Mesas Aces Ads Addi 


Wolfer and Davis (1970); neither of these 
studies found a curvilinear relationship be- 
tween preoperative fear ratings and postopera- 
tive emotional state. 

A study by Andrew (1970) of maic surgical 
patients can also be interpréted as failing to 
support the emotional drive theory. In the 
Andrew study, an individual difference factor | 
(coping styles) was the basis for separating 
patients into groups. Before surgery, half of 
the patients heard a tape-recorded message 
about the dangers of not electing the correc- 
tive operation and the surgery process, its 
preparation, and its consequences. The group 
that benefited most from the message (shorter 
hospital stays and fewer medications than 
their control group) was the group that 
learned poorly the “most threatening" ma- 
terial in the message. This casts doubt on the 
assumption that these patients benefited be- 
cause their fear level was raised by the prê- 
paratory message and stimulated, therefore, 
the productive work of worry. The data from 
sources seem consistent, therefore, with the 
theoretical positions that consider emotional- 
ity as a response instead of a drive (Lazarus: 
1968; Leventhal, 1970). 

If we adopt a position that treats fear reat- 
tions and efforts to influence the environment 
(obtaining analgesics and length of hospita 
stay) as joint products of the appraisal of the 
threat situation, as suggested by the parallel 
model or Lazarus’ (1968) cognitive model 0 
stress, we have no difficulty with the finding 


that fear and instrumental responses form two 
| classes of responses which tend to be internally 
“consistent and independent. Thus, postopera- 
tive emotional response is related linearly to 
pri ative emotional response and, for later 
; , the best predictor of the emotional 
‘response in the total situation is the patient’s 
"— score on the MAS. However, for firstborns, 
1 the relationship between MAS scores and emo- 
tional response was not a simple linear func- 
tion, and neither was it consistently a com- 
plete curvilinear function. 

The two measures of influence of the en- 
vironment, doses of analgesics and days hos- 
pitalized, on the other hand, were related to 
scores on an I-E control scale. The belief that 
one controls his fate, which affects learning 
p (Seeman & Evans, 1962) and performance 
(Phares, 1965) in other situations, seems to 
have had its effects in surgery. But days of 
hospitalization were high, for middles and 
highs in I-E control only if they were first- 
borns; internal patients, regardless of birth 
order, obtained more analgesics than externals. 
The physicians’ orders permitted pain medi- 
cation to be repeated every 4 hours if needed, 
and on the first postoperative day (the day 
_that the most pain is experienced), medium 
and low I-E patients received 2.5 doses or 1 
‘dose every 9.6 hours. High I-E patients re- 
ceived 3.3 doses or 1 every 7.3 hours. None of 
the dosages could be regarded as extreme, and 


TE scorers among firstborns suggests that 
both the efforts to influence the environment 
Land fear mechanisms may interact with one 
F another in affecting the postoperative behav- 
Jor. This interaction can be interpreted by as- 
suming that birth order changes the patients’ 
goals; when firstborns are exposed to stress, 
d become frightened and dependent and 
orient themselves toward maintaining contact 
vith danger control authorities and remain 
hospitalized longer. But firstborns who are 
external have insufficient skill to influence 
their length of stay. Thus, they stay hospital- 
zed no longer than do later borns. Later borns 
Who are capable of influencing their stay are 
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both less dependent and less frightened and, 
therefore, do not seek to lengthen their stays. 

In summary, positions that consider emo- 
tionality and instrumental responses as prod- 
ucts of an appraisal of threat (Lazarus, 1968; 
Leventhal, 1970) seem to account for the data 
more easily than a unitary emotional drive 
model. There is also support for the idea that 
the emotional and instrumental responses are 
often parallel and independent of each other. 
However, it is also clear that these two sys- 
tems of response may interact with each other 
in complex ways; indeed, it seems that the 
activation of an emotional response system 


may change the goals toward which instru- — 


mental behavior is oriented. Further specifica- 
tion is needed for the models to adequately 
handle these interactions. 
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imilarity. 


hers have indicated that the per- 

similar attributes in another person 
Serves to increase the attraction felt toward 
that person (e.g., Byrne, Griffitt, & Stefaniak, 
1967; Griffitt, 1966). In fact, Byrne and his 
associates have postulated a “law of attrac- 
tion"—a positive linear function between at- 
action and similarity, where similarity is de- 
fined on the basis of perception of shared 
a tributes (Byrne & Clore, 1966). 

In a study of the effect of perception of 
similarity on appraisal of others and recall, 
"McLaughlin (1970) found the typical positive 
linear function between personality similarity 
attraction, as well as significantly better 
all for material associated with highly simi- 
lar rather than moderately similar or dissimi- 
lar stimulus persons. It was pointed out by 

his author, however, that a persistent prob- 
lem in studies dealing with personality simi- 
larity and attraction is that by creating a 
person differing from the subject along the 
Similarity dimension, it becomes highly prob- 
able that the experimenter has created a per- 
Son differing to the same extent along the 
likableness dimension. Subjects tend to choose 
likable traits as self-descriptive, and, conse- 
quently, the stimulus person will vary con- 
Comitantly in the possession of likable char- 
Y This research was supported by funds from the 
- National Science Foundation, Grant GS-1867. The 
author wishes to thank David Kleber for gathering 
the data for this study. 
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rhe aim of the present study was to distinguish the effects of similarity and 
lil ness on attraction and recall. Subjects were exposed to stimulus persons 
arving in similarity and likableness of personality traits in a 2 X2 design. 
nderson's ratings of personality-trait adjectives were used to manipulate 
legree of likableness. Both similarity and likableness were found to affect 
traction, though the influence of each variable depended on the nature of 
judgment. Recall was found to be influenced by likableness but not 


> 


Laughlin data supported this contention. Us- 
ing Anderson’s (1968) likableness ratings for 
personality-trait names, a mean likableness 
score was obtained for the self-descriptive 
adjectives that had served as the basis for 


acteristics. A secondary analysis of the 1 
manipulating the similarity dimension. There 


was a significant correlation between likable- 
ness and attraction, and when degree of lik- 
ableness was controlled in an analysis of co- 
variance design, the relationship between simi- 
larity and attraction was not statistically sig- 
nificant. 

Stalling (1970) arrived at somewhat the 
same conclusion in a study in which person- 
ality-trait adjectives previously rated on their 
pleasantness (evaluative dimension) and simi- 
larity to the subject’s own traits (similarity 
dimension) were used as unconditioned stim- 
uli with trigrams as conditioned stimuli in a 
Staats conditioning procedure (Staats & 
Staats, 1957). When evaluation and similarity 
were held constant across levels of each other, 
evaluation was found to predict the condition- 
ing of affective responses to trigrams, whereas 
similarity did not contribute to this predic- 
tion. Stalling concluded that it was the evalua- 
tive rather than the similarity component that 
conditioned the affective response. 

The purpose of the present study was to 
attempt to distinguish the effects of the simi- 
larity and evaluative (likableness) dimensions 
on appraisal and recall. To this end, aside 
from the typical procedure where subjects are 
exposed to stimulus persons concomitantly 
high in similarity and high in likableness, or 
low in similarity and low in likableness, some 
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subjects were asked to judge and recall infor- 
mation about stimulus persons who were simi- 
lar to themselves but relatively low in likable- 
ness, while others judged stimulus persons low 
in similarity but high in likableness. In addi- 
tion, since it was felt that personality char- 
acteristics may influence appraisal and re- 
call, the subjects were administered the Byrne 
Repression-Sensitization scale (Byrne, 1961) 
and the Marlowe-Crowne Social Desirability 
Scale (Crowne & Marlowe, 1960). 


METHOD 
Subjects 


Subjects were 64 females (median age = 18) re- 
cruited from a summer school population at the 
University of California, Santa Cruz, through public 
advertisements soliciting volunteers. Most of the sub- 
jects (75%) had not previously taken part in a 
psychology experiment, and only 7 were psychology 
majors. Each girl was paid $2 for participating in 
the experiment, 


Procedure 


Subjects reported singly for what had been adver- 
. tised as a study in personality assessment. The sub- 
ject was told that she and her partner (supposedly 
another girl in an adjoining room) were to fill out 
various personality questionnaires, The experimenter 
first asked the subject to complete a short personal 
data sheet (requesting such information as age, year 
in school major, hobbies, etc.). Then the subject 
was given a brief selí-descriptive questionnaire list- 
ing 20 adjectives from which she was to check 10 
as descriptive of herselí. She was also asked to list 
5 of the 10 as most descriptive of her personality. 
The experimenter then collected these two sheets (the 
personal data sheet and the self-descriptive question- 
naire) and told the subject that these would be ex- 
changed with her partner. The subject was then to 
fil out a personality test (the Byrne Repression- 
Sensitization scale) that she was told would not be 
shown to her partner. 

While the subject was completing the personality 
test, the experimenter left the room and constructed 
a stimulus person who varied on the similarity and 
likableness dimensions of the self-descriptive ques- 
tionnaire according to a predetermined random order. 
Once the subject had finished the personality test, 
the experimenter returned and told the subject that 
she was to read her partner's personal data sheet and 
self-descriptive questionnaire so as to assess her 
personality. A fabricated personal data sheet (the 
same for all subjects) and the self-descriptive ques- 
tionnaire (varying in similarity and likableness) 
were then presented to the subject, and she had 2 
minutes to read through them. 

After this, the subject was asked to fill out two 
questionnaires that were intended to assess her at- 
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traction to the stimulus person and her willing 
interact with her partner, The subject also 
stimulus person on similarity and likableness al 
a 7-point scale. The subject was then presented Wi 
a blank personal data sheet and asked to give 
the information she could remember that h: 
peared on her partner's sheet. Finally, the sı 
filled in a second personality test (the Marloy 
Crowne Social Desirability Scale). When the subjec 
finished this questionnaire, she was asked what sh 
thought the experiment was about and whether sh 
was suspicious at any point, She was then told 
purpose of the experiment, sworn to secrecy, 
paid for her participation. 


Experimental Conditions and Dependent 
Variables 


Similarity, In the high-similarity condition, tl 
subjects were to evaluate the stimulus person on th 
basis of a self-descriptive questionnaire that was 
same as the subject’s own in 8 of the 10 adje 


checked and in 4 of the 5 acteristic ad 
tives. In the low-similarity condition, the 
descriptive questionnaire the stimulus p 


was the same in only 2 of the 10 adjectives che 
and in only 1 of the 5 most characteristic adjecti 
Likableness. Subjects in the high-likable con 
were provided with self-descriptive questionna 
consisting of 20 adjectives with likableness rai 
ranging from 494 to 555 on Anderson’s (1968) 
of personality-trait adjectives. Subjects in the I 
likableness condition were given self-descriptive q 
tionnaires consisting of 20 adjectives with likab 
ratings ranging from 228 to 399. In the high 
ness condition, the mean rating of the 10 adje 
chosen by subjects as self-descriptive was 529.1, WI 
the mean rating for the 5 adjectives chosen as m 
descriptive was 531.2. The corresponding values ft 
the stimulus persons were 5252 and 527.6. In 
low-likableness condition, the mean rating for 
10 adjectives chosen as selí-descriptive was 3 
and the mean rating for the 5 most descriptive 
tives was 340.6. The corresponding values for. 
stimulus persons were 333.8 and 3354. None of 
differences within conditions was statistically sij 
cant, but the differences between high- and I 
likableness conditions were highly significant. 
Attraction. To assess attraction to the stimul 
person, the subjects rated 10 highly evaluative 
polar adjectives (‘intelligent-unintelligent,” “mat 
immature,” etc.) along a 7-point scale, The 
score for these ratings constituted the measure í 
attraction, with high scores indicating greater 
traction. There was no overlap between the set 
adjectives used to measure attraction and thi 
from Anderson’s list used to establish conditions 
similarity and likableness. K 
Interaction. The measure of interaction was deri 
from the subject’s responses to three questions 
supposedly dealt with “the rest of the experim 
(a) “How willing are you to work with your p 
on this last part?" (very willing-prefer not 


SIMILARITY AND LIKABLENESS ON ATTRACTION AND RECALL 


(6) “How willing are you to discuss your question- 
naire with your partner?” (very willing-prefer not 
to); (c) “How willing would you be to discuss 
matters of pe importance with your partner?” 
(very willing r not to). For each item, a 7-point 
scale was us that the measure of interaction 
could range fro (prefer not to interact) to 21 
(very willing eract). 

Recall. Recaii scores were based on the subject's 


ability to reco 
data sheet, The 
year in college, € 


t the stimulus person’s personal 
were 30 items on this sheet (age, 
hobbies, names of three siblings, 


etc.), and eact ;onse recalled correctly was scored 
1 point. 
RESULTS 

Subjects in the high-similarity condition 
rated the stimulus person as higher in simi- 
larity than did subjects in the low-similarity 
condition (¢ = 5.17, df = 62, p < .001). Sub- 
jects in the } likableness condition rated 
the stimulus person as higher in likableness 


than did sut 


in the low-likableness con- 


dition (1 = 2.71, df = 62, p < .01). Thus, the 
effort to manipulate similarity was successful 
across leve ikableness, and the manipula- 


tion of likableness was successful across levels 
of similarity. 

Mean ratings for attraction are given in 
Table 1. Significant main effects were obtained 
for similarity (F = 6.12, df = 1/60, p < .05) 
and for likableness (F = 11.70, df = 1/60, 
P « 01). The interaction was not significant. 
The data ior willingness to interact with the 
partner revealed a marginally significant ef- 
fect for likableness (subjects being more will- 
ing to interact with more likable stimulus 
persons; F = 3.90, df = 1/60, .10 > p> .05), 
while the main effect for similarity and the 
interaction did not approach significance. 

The mean recall scores for the subjects in 
the various conditions are shown in Table 2. 
The main effect for likableness was significant 


TABLE 1 
.. Mean ArTrRACTION RATINGS PER CONDITION 


Likableness 


Similarity MEANE Total 
High Low 

High | 59.89 54.75 57.32 

Low | 56.69 47.81 52.25 

Total 58.29 51.28 54.78 


Note.—High s i Si: 
: igh scores ^ "ate action, Sixteen sub- 
cte per cell! $ ores indicate greater attracti k 


TABLE 2 
Mean RECALL SCORES Per Conprrion 


1 
Likableness 
Similarity |- - Total 
| ‘High | Low 
High 23.0 26 | 28 
Low 22.6 20.5 21.6 
Total 22.8 20.6 21.7 


Note.—High scores indicate greater recall, Sixteen subjects 
per cell. 


(F = 4.74, df = 1/60, p< 05). Similarity 
had no effect on recall, and the interaction was 
not significant. 

When the subjects were divided at the 
median on the Repression-Sensitization scale 
and the data analyzed in a 2 X 2 X 2 analysis 
of variance design, those at the repression end 
showed significantly more willingness to in- 


teract with the stimulus person than did those _ 


at the sensitization end (F = 5.01, df = 1/56, 
P < .05). A similar analysis for subjects who 


scored above and below the median on the | 


Social Desirability Scale indicated that high - 


scorers were more willing to interact than 
those who scored below the median (F = 6.00, 
dj = 1/56, p < .05). Since the correlation be- 
tween scores on the Repression-Sensitization 
scale and the Social Desirability Scale was 
—.59 (p < .001), it was not surprising to find 
similar outcomes (in the opposite direction) 
for the two personality variables. In fact, 23 
of the 32 subjects in the repressor group were 
in the high social desirability group. Analysis 
of the data for attraction and recall did not 
reveal significant differences between subjects 
scoring above and below the medians on the 
personality tests. 


DISCUSSION 


Byrne and his associates have conceptual- 
ized attribute similarity as a reinforcing event 
(Byrne, Griffitt, & Clore, 1968; Byrne et al., 
1967). As Stalling (1970) has noted, however, 
this is more true of similarity in attitude 
statements than in personality traits. Simi- 
larity in personality traits may not always be 
reinforcing to a subject, since the subject may 
possess traits that are negatively valued or 
lack traits that are positively valued. In such 
cases, highly similar stimulus persons may be 
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less appealing to the subject than persons 
somewhat dissimilar. Stalling has suggested, in 
fact, that similarity does not function as a 
reinforcer beyond the extent to which it cor- 
relates with evaluative meaning. 

In the present study, however, similarity 
was found to influence appraisal across levels 
of evaluative meaning. That is, attraction 
ratings were significantly more positive in the 
high- than in the low-similarity conditions 
regardless of whether the stimulus person was 
high or low in likableness. On the other hand, 
attraction ratings were also significantly more 
positive in high- than in low-likableness con- 
ditions across levels of similarity. It may be, 
therefore, that both variables affect appraisal, 
but in different ways. 

To test this hypothesis, a further analysis 
of the data for attraction was carried out. 
Analyses of variance were performed for each 
of the 10 adjective rating scales, the sum 
total of which constituted the measure of 
attraction. Examination of the data (Table 3) 
revealed significant main effects for similarity 
on “perceptive,” “adjusted,” “moral,” and 
“warm.” Significant main effects for likable- 
ness were obtained on “dependable,” “trust- 
worthy,” “cooperative,” and “warm.” This 
suggests that interpersonal attraction is not 
a unidimensional trait and that appraisal 
is differentially affected by similarity and 
likableness. 

The multidimensional nature of interper- 
sonal attraction has been noted by a number 


TABLE 3 


SUMMARY OF ANALYSES OF VARIANCE FOR RATINGS 
or BIPOLAR ADJECTIVES: MAIN EFFECTS 
FOR SIMILARITY AND LIKABLENESS 


* Similarity Likableness 
Item 
F sz F- A 
Perceptive-not perceptive 7.97 EU! <1 
Adjusted-not adjusted 14.10 | .01 3.53 | ns 
Moral-immoral 4.19 | .05 DE 
Warm-cold 9.90 | .01 9.900 |.01 
Dependable-not dependable <1 7.73 |.01 
Trustworthy-not trustworthy 3.30 | ns 7.73 |.01 
Cooperative-uncooperative <1 37.66 | .001 
Mature-immature 3.70 ns 3,70 | ns 
Intelligent-not intelligent 3.10 ns <1 
Knowledgeable concerning 
current events-not knowl- 
edgeable «1 El 


Note.—In each case, a significant F indicates that subjects in 
high-similarity or high-likableness groups gave favorable rat- 
ings on adjective polarities, None of the interactions attained 
significance. 
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of authors (Coleman, 1961; Kiesler & 
berg, 1968; Newcomb, 1960; Triandi 
Triandis factor-analyzed questionnaire items 
relating to various aspects of attraction and 
found five independent factors. Kiesler and 
Goldberg also employed a factor-analysis ap- 
proach and extracted two factors, one repre- 
senting socioemotional and one representing 
task-related aspects of interpersonal attrac- 
tion. These two orthogonal factors closely 
approximated two of Triandis’ factors and 
corresponded to similarly named roles in 
group process (Bales & Slater, 1955), 

A distinction based on socioemotional and 
task-oriented categories of interpersonal at- 
traction seems compatible with the data of 
the present experiment. Similarity appears to 
affect judgments about the stimulus person 
along the socioemotional dimension, while 
likableness appears to affect judgments along 
the task-oriented dimension. Further support 
for this interpretation comes from subjects’ 
ratings on two statements (obtained immedi- | 
ately after completion of the appraisals): 
(a) “This person [definitely would-dchnitely 
would not] fit in with my circle of close 
friends,” and (b) “This person | definitely 
would-definitely would not] make a good 
work partner.” Similarity (but not likable- 
ness) was found to have a significant effect 
in determining whether the person would fit 
into the circle of friends (F = 11.56, df= 
1/60, p < .01), whereas likableness (but not 
similarity) made a difference when it was à 
question of work partner (F = 5.85, df= 
1/60, p< .05). Thus, in Byrne' terms, 
both similarity and likableness function as 
reinforcers, but the extent to which they 
influence appraisal of others depends on the 
particular judgments the subject is asked to 
make about the stimulus person. 

Scores on the Repression-Sensitization scale 
and the Social Desirability Scale showed no 
significant relationship to attraction or recall 
but did relate to expressions of willingness to 
interact with the stimulus person. Repressors 
and subjects in the high social desirability 
group were significantly more willing to inter- 
act with the stimulus person than were sensi- 
tizers or subjects in the low social desirability 
group. The subjects statements of willingness 
to interact may simply reflect the tendency 


Gold- 
1964). 


ain subjects (repressors and those high 
lity) to be more likely than 
ts to yield to what are per- 


d to be ihe demand characteristics of 
ituation 

ghlin (1970) found that similarity 

nced re (subjects recalling more in- 

m tion about similar stimulus persons), 

dn that study, stimulus persons high in 

ty were concomitantly high in likable- 

iss while those low in similarity were con- 

itantly low in likableness. Thus, subjects 

e earlier study corresponded roughly to 


s in the high similarity-high likable- 


e present study. A comparison of these 
oups in the present study indicates that 
is a significant variable (p < .05, 


low in similarity are compared across 
of likableness, there are no significant 
rences in recall. In. contrast, more infor- 
is recalled about more likable sub- 
"across levels of similarity. 

"this experiment, similarity and likable- 
vere manipulated separately and were 
to have differential effects on interper- 
| attraction and recall It should be 
d out, however, that there may be sev- 
classes of traits besides likableness that 
when similarity is manipulated in 
of interpersonal attraction. Further 
is needed to determine whether this 
case and, if so, how these traits affect 
personal attraction and recall. 
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MEDIATION OF EXPERIMENTER EFFECTS! 


RUSSELL A. JONES? 


University of Kentucky 


In previous research, positive correlations have been found between the number of 
experimenter-subject mutual glances and the extent to which subjects perceived 
success in the photos of Rosenthal's person-perception task. In the present study, 
two experiments are reported which were intended to obtain evidence bearing or 
the hypothesis that subjects’ projection of the feeling tone of their interaction with 
the experimenter could influence the results obtained with the person-perception 
task and that the number of mutual glances between experimenter and subject was 
an important determinant of that feeling tone. A pilot study demonstrated that 
subjects who had just experienced success prior to the person-perception task were 
more likely to perceive success in the neutral photos than were subjects who had 
just experienced failure prior to the task. The main experiment found a direct rela 
tion between the number of times experimenters glanced at subjects and both (a) 
the subjects’ self-reports of happiness, success, etc., and (b) the subjects’ tendency 
to perceive success in the neutral photos. 


Utilizing a “person-perception” paradigm 
in which subjects are shown photographs of 
human faces and are asked to rate the pictures 
on a dimension of success-failure, Rosenthal 
(1966) and his colleagues have demonstrated 
that the responses of subjects in such a situa- 
tion can be unintentionally influenced by the 
expectations and desires of the experimenters, 
by certain biosocial characteristics of the ex- 
perimenters, and by various features of the 
interaction itself. Recently, several authors 
have pointed out the need for studies to deter- 
mine what factors mediate these experimenter- 
bias and expectancy effects (Barber & Silver, 
1968; Friedman, 1967). It has been pointed 
out that until the processes are understood, 
the scope of the phenomena cannot even be 
estimated. 

Barber and Silver (1968, p. 25) have sug- 
gested several possible variables that might 
mediate some of these experimenter effects. 
The most promising leads, however, appear to 
be contained in some suggestive evidence and 
à very tentative hypothesis mentioned by 
Friedman (1967). In an analysis of filmed in- 
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teractions in which experimenters were 
ministering Rosenthal’s person-percc; ion tas) 


to individual subjects, ao zined 

significant correlations. Each f th: followin; 
variables correlated positively with the sub 
jects’ tendency to perceive success in thi 
neutral photos: (a) the number of glance 
exchanged between the experimenter and the 


ber of times the experimenter glan 
subject in the instruction period, (c) th 
the duration of the instruction period, (d) tht 
longer the duration of the prerating period 
(e) the longer the duration of the rating period, 
and (f) the longer the duration of the total 
experiment. Thus, it appears that the tendency 
for the subject to perceive success in the photos 
is related somehow to exchanging glances with 
the experimenter and to time. Friedman offered 
a rather convincing argument about what hé 
thinks the nature of these relations to be. 

Citing Gofiman and Simmel on the impor 
tance of mutual glances in social interaction 
Friedman (1967) suggested that: 


A prosaic deduction from these paeans to the mutua 
glance would be that in exchanging glances with 
experimenter, the subject is given a "good" experience 
he feels accepted; he feels that their interaction ha 
been “warm.” He has no context in the picture fi on 
which to judge whether the person has been experient 
ing success or failure. Might it not be that he uses th 
feeling tone of his interaction with the experiment@ 
in making this judgment? Having felt success, does 
perceive success [p. 55]? 


This certainly appears to be a possibility, bul 
70 


what about t! 
be, for exam; 
menter takes 
more likely t} 
in the photo 


ime variable? Why should it 
that the longer the experi- 
ieliver the instructions, the 
bject is to perceive success 
iedman (1967) pointed out 


that : 

The experi can proceed with bureaucratic 
curtness. In such a case, the subject may feel as though 
he is being tr like an object, a means to an end, 
an uninterestir ;endage to an interesting numerical 
score. To the e hat the greater length of time spent 


indicates to the subject that he is 
human being of some intrinsic in- 


in the interact 
being treated 


terest or worth, to that extent he may have a better 
feeling about . the longer (within limits) the 
experimenter and ject interact, the more likely it 
will be that į ject feels the experimenter is in- 
terested in him . And so, by the same reason- 
ing that was | to the mutual glance, the longer 
the interaction, the more likely the subject will perceive 
success in the acutral photos [p. 56]. 
As Friedman pointed out, the two interpreta- 
tions sugges projection hypothesis: “the 
subject sees in the photos what his interaction 
with the experimenter leads him to feel himself 
[p. 56]." 

There are a great deal of data documenting 
the affective significance of the mutual glance. 
For example, Exline and Winters (1965) found 


that receipt of a negative evaluation from 
another was followed by a sharp decrease in 
mutual glances exchanged with the other. Re- 
Ceipt of a positive evaluation led to a slight 
increase in mutual glances. It also has been 
Shown that the relation between eye contact 
and positive affect for another may be direct 
or inverse depending on the nature of the 
verbal exchange involved (Ellsworth & Carl- 
smith, 1968), 

What appears called for, however, is a direct 
test of Friedman’s notions concerning the in- 
fluence of “feeling tone" of the experimenter- 
Subject interaction on the subject’s perception 
of success-failure in the photos used in Rosen- 
thal’s person-perception task. Specifically, we 
were interested in whether subjects who ex- 
changed a large number of mutual glances with 
the experimenter would be more likely to 
Perceive success in the neutral photos than 
Subjects with whom the experimenter ex- 
changed only a few glances. First, however, it 
Was deemed essential to demonstrate that 
Projection of positive and negative affect can 
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be picked up with ratings on the person- 
perception task. 

To accomplish the latter objective, a three- 
condition pilot study was carried out. Prior to 
the photo rating task, subjects (45 Princeton 
undergraduates) took and were given false 
feedback (success-neutral-failure) on a **high- 
level intelligence test." The experimenter-sub- 
ject interaction was completely standardized 
by use of videotape instructions for the intelli- 
gence test and administration of the person- 
perception task. Feedback on the intelligence 
test was provided by a proctor* who handed 
each subject a card on which his score and 
norms for Princeton undergraduates were 
written. Immediately after receiving this feed- 
back, subjects began the photo rating task. 

The prediction, of course, was that those 
subjects who were told they had done very 
well on the intelligence test would be most 
likely to perceive success in the photos, while 


Laien 6... Nc 


those who had done very poorly on the test r 
would be least likely to perceive success in t 


photos. Each subject rated 10 photographs on p 


a scale of — 10 (extreme failure) to + 10 
(extreme success). A subject’s score was the 
algebraic mean of his 10 ratings. 

Tt was found that subjects who underwent 
a successful experience prior to viewing the 
photographs viewed the faces pictured in the 
photographs as being more successful (X 
— 1.39) than subjects who underwent neutral 
(X = .30) or failure (X = — .77) experiences. 
An analysis of variance performed on the data 
showed the overall effect of the feedback mani- 
pulation to be highly reliable (F = 6.09, 
df = 2/42, p < 01). 

Both the failure and success conditions were 
marginally different from the ratings of sub- 
jects in the neutral condition (p < .10). But 
when the picture ratings of subjects who had 
experienced success were compared to those 
of subjects who had experienced failure, the 
eflect on the picture rating was quite strong 
(E = 12.16, df = 1/42, p < 01). 

Our original intention was to demonstrate 
that the eye contact between an experimenter 


3 The authors would like to thank Cynthia Conroy 
for serving as the proctor in the pilot study and as 
the experimenter in the main experiment. 'They are 
also indebted to Andrea Sedlak for recording the video 
instructions. 
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and a subject is an important factor in making 
a subject feel positively or negatively about 
himself, and the feelings so generated can be 
projected onto the stimulus photographs. The 
pilot study demonstrated that a subject’s view 
of himself is projected: Subjects who were 
made to feel successful saw the photographs 
as successful, whereas subjects who had ex- 
perienced failure saw the same photographs as 
unsuccessful. Next, it is necessary to show that 
subjects who receive consistent and frequent 
eye contact from experimenters feel more 
positively about themselves than subjects who 
receive very little eye contact. As the pilot 
study suggests, differences in the way subjects 
are made to feel should be reflected in the 
ratings given to the stimulus photographs. 

An experiment was designed in which assis- 
tant experimenters, blind as to the hypotheses 
of the experiment, administered the person- 
perception task to the subjects. Half of the 
assistant experimenters were instructed to 
make frequent eye contact with the subjects, 
while the other half were instructed to avoid 
eye contact. To test the hypothesis, the sub- 
jects’ self-evaluations were assessed. It was 
predicted that subjects who had received con- 
siderable eye contact would feel more positively 
about themselves and would project their 
positive feelings onto the stimulus photographs. 


METHOD 
Subjects 


Eighty male high school students participated in the 
experiment. They came in response to advertisements 


promising to pay $1.50 for a study entitled “Picture 
Rating.” 


Procedure 


Two students were scheduled for each experimental 
session. When the experimenter arrived, she announced 
that “for purposes of control,” one of the students 
would assist in conducting the experiment while the 
other would serve as the subject. She asked the student 
who arrived first to serve as the assistant experimenter 
and ushered him into a second room to describe the 
task. 

The experimenter thanked the student for agreeing 
to help and explained that past research had shown it 
to be better if many people conduct an experiment 
rather than just one person. Consequently, half of all 
students who volunteered for the experiment were 
being asked to serve as assistants. The experimenter 

then gave the assistant a printed set of instructions and 
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commented : “In this experiment, you will be replicating 
a very well-known study. Here are the instructions 
that I would like you to read to the subject | will give 
you an opportunity to practice the instructions before 
you actually deliver them to the subject.’ the high- 
eye-contact conditions, the experimenter d: “Oh, 
by the way, it is also very important that you look at 
the subject as often as possible—let's say at least 30 
times—while you deliver the instructions. Here is an 
example of how you should read.” The experimenter 
then illustrated the technique for the assistant 

In the low-eye-contact conditions, the experimenter 
substituted the sentence, “Oh, by the way, i! is very 
important that you not look at the subject while you 
deliver the instructions." She then gave the assistant 
an example of the technique. 

The instructions that all assistant experimenters 
were to read were adapted from Rosenthal (1966, p. 
184). They informed the subjects that Princeton Uni- 


versity was in the process of developing a test of 
empathy and instructed them in the picture rating task 
and the use of the rating scales. As the assistant experi- 
menters administered the instructions to the subjects, 
the experimenter sat behind an observation screen and 
counted the number of times that the assistant glanced 
at the subjects. 

After the assistant, experimenters had explained the 
task, half of all su jects were * shown the 10 photographs 


used in the pilot study and were asked to rate them. | 


However, the other half of the subjects in the high- 
and low-eye-contact conditions heard the assistant 
experimenter add: “Before I show you the pictures, 


Thave been asked to give you this short questionnaire. 
The instructions are at the top of the page. When you 
are finished, I will show you the photographs.” The 
questionnaire was designed to assess the mood of the 
subjects. It was given to only half of the subjects since 
the possibility existed that the administration of the 
questionnaire might, in some way, interact with the 
ratings that subjects would give to the stimulus photo- 
graphs. The questionnaire consisted of seven pairs of 
polar adjectives, such as “happy-sad,” "fortunate- 
unfortunate,” and "unsuccessful-successful." Subjects 
were asked to rate their present feeling about them- 
selves on a 15-point continuum with regard to each of 
the seven pairs of adjectives. After the subjects had 
completed the scales, the person-perception task was 
administered. 

At the conclusion of the person-perception task, the 
experimenter thanked the subject and the assistant for 
their participation and,explained the true nature of the 
experiment to them; The reasons for the use of decep- 
tions were discussed. 


RESULTS 


The experimenter’s tally of the number of 
glances which the subjects received from the 
assistants revealed that the assistants had 
carried out their assignments. Assistants in the 
high-eye-contact condition glanced an average 
of 30.1 times at their subjects. On the other 


assistants in the low-eye-contact con- 
ms glanced an average of .15 times. 
if of the subjects in the high- and low-eye- 


tact conditions were given a list of seven 
$ of polar adjectives designed to assess 
ir mood after they had been exposed to the 


ation. The adjectives were 

vat an extremely positive rating 
any pair of adjectives was assigned a score 
1, while an extremely negative self-rating 
assigned the score of 15. Each subject’s 
d score was the sum of his ratings on the 
adjective pairs. Total scores could range 
rom an extremely positive self-rating of 7 to 
à 5 tive 105. 

The results showed that subjects in the high- 
tact condition felt more positively 
ut themselves than subjects in the low-eye- 
intact group. The mean rating for subjects in 
€ high-eye-contact condition was 38.3, while 
low-eye-contact condition had a mean of 
A | test revealed the difference to be 
ically reiiable (f= 2.53, df= 18, p 


ubjects’ ratings of the stimulus photographs 
e treated as a 2 X 2 factorial design. The 
eye-contact groups provided one factor, 
e being or not being exposed to the mood 
onnaire served as the second factor. The 
-sented in Table 1. In support of 
"hypothesis, it can be seen that the subjects 
the high-eye-contact conditions rated the 
mulus photographs as considerably more 
ssful than the subjects in the low-eye- 
tact conditions (F = 880, df = 1/36, 
01). The F ratios for both the interaction 
presence or absence of the mood scale were 
than 1, 
n H . 
An inspection of Table 1 suggests that the 
erence between eye-contact conditions was 
re pronounced when subjects took the mood 
questionnaire than when they did not take 
; questionnaire. Individual comparisons per- 
med on the data indicated that while the 
Allerence between the two eye-contact groups 
highly significant for subjects who were 
sed to the mood questionnaire (F = 6.21, 
= 1/36, p < .05), it was only marginally 
hificant for subjects who did not take the 
SWsstionnaire (F = 2.82, df = 1/36, p < .10). 
~ appears that being exposed to questions 
ding one's mood may have highlighted, 
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TABLE 1 
MEAN RATINGS OF THE PHOTOGRAPHS 


| Eye contact* 
Mood scale — | 
Low High 
Administered —.67 | +.80 
Not administered — 40 +.59 


Note,—The higher the number, the more successful the 
rating. 
en = 10 in each group, ; 


and therefore accentuated, the positive or 
negative feelings induced by the eye-contact 
manipulation. The interaction between the two 
variables did not approach statistical signifi- 
cance (F < 1). 


DISCUSSION " 


It appears that the hypotheses of the two. 
experiments have been supported by the data. A 
It was demonstrated that subjects who had 
received “success” feedback immediately prior 
to the person-perception task perceived greater 
success in the neutral photos than did subjects 
who received “failure” feedback prior to the 7 
person-perception task. Further, it was shown E 
that (a) when the person administering the 
instructions for the person-perception task 
maintains a great deal of eye contact with the 
subject, the subject rates himself as feeling 
happier, more successful, etc., than when the | 
experimenter does not look at him during the b 
instructions; and (b) when the experimenter 
maintains a great deal of eye contact with the 
subject during the instruction period, the sub- 
ject perceives greater success in the neutral 
photos than when the experimenter maintains 
little eye contact during the instruction period. 
Taken together, the two studies offer rather 
good support for the hypotheses. 

There appear to be at least two issues raised 
by the results, however, that require additional 
comment. The first concerns the nature of the 
process by which the “feeling tone” of the 
experimenter-subject interaction comes to be 
reflected in the ratings of the pictures, and the 
second concerns the implications the present 
studies have for the investigation of experi- 
menter bias. 

We have used the term “projection” rather 
loosely to mean a direct relation between a 
subject’s own state of happiness or success 
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and the amount of happiness or success that 
he attributes to the people pictured in Rosen- 
thal’s stimulus photos. A study by Goldings 
(1954) indicates that this is too simplistic an 
approach. Goldings found that subjects who 
claimed to be extremely happy saw others 
(“neutral photos”) as being unhappy, while 
subjects claiming to be extremely unhappy saw 
others as being very happy. Conversely, sub- 
jects who rated themselves as less extreme on 
the happiness scale saw others as possessing 
approximately the same affective state as 
themselves. 

It appears that projection is not a unitary 
concept. In fact, Holmes (1968) has called 
attention to several possible varieties of pro- 
jective experience. He has argued that there 
are at least two dimensions along which pro- 
jective phenomena may vary: (a) whether 
the trait or attribute ascribed to the other 
person is the same or different from the trait 
or attribute possessed by the projector and 
(b) whether the projector is or is not aware of 
his possession of the trait or attribute that he 
is ascribing to others. 

The subjects in our experiment were engag- 
ing in a very specific type of projection which 
Holmes (1968) has called “attributive projec- 
tion." Attributive projection is said to occur 
when the person sees others as having the 
same trait which he is aware of in himself. 
There are, as Holmes pointed out, several 
possible mechanisms underlying attributive 
projection. One such mechanism might be 
stimulus generalization; that is, subjects may 
simply generalize their own feclings and/or 
characteristics to others who are similar to 
them in certain ways. This possibility would be 
easy enough to test. Subjects first could be 
given bogus feedback, as was done in our pilot 
study, and could then be asked to perform the 
person-perception task. The stimulus photo- 
graphs of the person-perception task would be 
varied systematically in the extent to which 
the face depicted was similar to the subject 
(sex, age, race, etc.). If stimulus generalization 
underlies attributive projection, one would 
expect the most projection to occur on those 
photos that were most similar to the subject. 

While we feel that the present studies offer 
evidence that attributive projection is—and 


has been—a factor in studies using the 
perception task, we do not mean to imply th 
projection is the only mediating process or tl 
visual cues are exclusively involved. Recen 
Duncan and Rosenthal (1968) presented 
dence that unintentional voice inflections of 
experimenter can play an important role 
producing biased ratings of the photographs 
while Minor (1970) has suggested that eval 
tion apprehension may play an important 
in mediating experimenter expectancy effe 
It may be profitable for future research 
proceed in two very important directions; 
First, the role of visually mediated attributiv 
projection can be tested in the natural 


teacher expectancy phenomenon studied, 
Rosenthal and Jacobson (1968). Se 
further laboratory research is called for v 
can institute ways of standarizing th 
menter-subject interaction so t 


mediating variables can be specified. 
Ine À 
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WHEN SIMILARITY BREEDS CONTEMPT’ 


SHELLEY E. TAYLOR anp DAVID R. METTEE * 
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Female subjects interacted with a female confederate who was represented as 
I imilar or dissimilar to them and who behaved in either a pleasant or 

} s manner, It was predicted that a pleasant similar other would be 
aore than a pleasant dissimilar other, but that an obnoxious similar 
vould be disliked more than an obnoxious dissimilar other. The hypoth- 

ere supported, and the effect was found to be mediated in part by fear 
ssessing the undesirable trait and in part by the greater affective potency 
similar partner. Results are discussed in terms of anxiety reduction and 
eater salience of a similar person’s characteristics in the social comparison 


CE e x i Social Psychology 

Recent research has demonstrated that 
people are attracted to those who hold similar 
attitudes (By: 1961; Rokeach, 1960) or 


who have similar personality traits (Byrne, 
Griffitt, & Stefaniak, 1967). Novak and Ler- 
ner (1968), however, suggested that in the 
context of other motives, similarity may as- 
sume negative rather than positive implica- 
tions. They created a situation in which a 
subject found himself to be similar or dissimi- 
lar to either 2 normal individual or a former 
mental patient. Consistent with previous find- 
ings, a similar normal partner was approached 
more and liked more than a dissimilar normal 
partner, In contrast, the similar mental pa- 
tient was avoided more than the dissimilar 
mental patient, though he was still liked bet- 
ter than the dissimilar mental patient. 

The Novak and Lerner (1968) study leaves 
Several points unclear. First, as Cooper 
and Jones (1969) have noted, Novak and 
Lerner's subjects were aware that the experi- 
menter knew both that they were similar to 
their partner and that the partner was emo- 
tionally disturbed, Cooper and Jones reasoned 
that the subjects may have avoided the part- 
ner, not because of any personal anxiety on 
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their part, but solely to disassociate them- 
selves from the partner in the experimenter's 
eyes; Cooper and Jones found support for 
this reasoning. Second, the Novak and Lerner 
study addressed a person's desire to avoid a 
similar afflicted other but provided neither a 
theoretical rationale nor evidence that would 
lead one to suppose that a similar other will 
ever be liked less than a dissimilar other. A 
similar afflicted other might be avoided be- 
cause his presence would provide unassailable 
evidence that the same fate may befall any- 
one, including the subject. However, it is not 
clear from this argument that the similar 
afflicted other should be disliked, especially 
since the subject may feel sorry for him. In 
fact, the similar other was not disliked more 
than the dissimilar other in the Novak and 
Lerner study; hence, it is yet to be shown 
that the consequences of similarity may be 
negative in the sense of producing dislike. 

A theoretical framework that goes beyond 
fear of possessing the same affliction possessed 
by a similar other and predicts dislike 
of the similar other is Festinger's (1954) 
theory of social comparison. Assuming that 
individuals are motivated to evaluate the 
relative goodness of their abilities and opin- 
ions, Festinger predicts that individuals will 
compare themselves with similar others, since 
similar others furnish one with accurate in- 
formation about one’s own standing, On the 
other hand, dissimilar individuals, or persons 
who have abilities and opinions discrepant 
with one’s own, are likely to provide little 
valuable information on which to evaluate 
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one’s own characteristics. Similarity, there- 
fore, should render the behavioral character- 
istics of a person highly salient and meaning- 
ful to the other. If another’s behavior is 
highly salient and that behavior proves to be 
unpleasant, then that other’s behavior should 
be evaluated more negatively and produce 
more dislike than if it has little meaning for 
the person. Pleasant behavior as well should 
be perceived as more pleasant and conse- 
quently induce greater liking if it is salient 
than if it is not. 

In the present study, subjects interacted 
with a partner who was either pleasant or ob- 
noxious and who was represented as either 
very similar or very dissimilar to them. In 
line with the above reasoning, it was predicted 
that a pleasant partner would be liked more 
if she were similar than if she were dissimilar, 
whereas an obnoxious partner would be dis- 
liked more if she were similar rather than dis- 
similar. Reasoning that the fact of similarity 
would make the similar individual's behavior 
more salient, it was further predicted that an 
obnoxious similar other would be derogated 
more both when the similar other's obnoxious- 
ness did and did not imply that the subject 
herself might be obnoxious. 


METHOD 
Subjects 


Sixty-six female students from Quinnipiac College, 
Hamden, Connecticut, were assigned randomly to 
seven experimental groups. Three subjects were dis- 
carded from the analysis due to procedural complica- 
tions, leaving 9 subjects per group. They had volun- 
teered to participate in a study on how people form 
impressions of others and expected to spend time 
talking with a stranger and filling out questionnaires 
about their impressions of her. In all cases, the sup- 
posed stranger was an experimental accomplice. 


Preexperimental Procedure 


In volunteering for the experiment, each subject 
filled out a set of bogus personality tests that con- 
sisted of sentence-completion items, an adjective 
checklist, and true-false questions from the Cali- 
fornia Psychological Inventory. The tests were used 
as a guise for providing false similarity and dissimi- 
larity feedback in the experimental setting, 


Procedure 


When the subject arrived, she was greeted by the 
experimenter and asked if she were acquainted with 
Janet Martin, the fictitious name assigned to the 


confederate. The experimenter explained that it was 
important that subjects not know each other in ag 
impression-formation study. Shortly thereafter, the 
confederate arrived, and the two subjects were intro- 
duced. The experimenter then explaincd that one 
variable which determines the kinds oí impressions 
people form of each other is similarity~cissimilarity. 
Subjects were told that the test scorers had used their 
*personality tests" to match up pairs oí people who 


were either very similar or very dissimilar to each 


other. (The high reliability and validity of the tests 
were stressed, so that the subjects would not later 
attempt to deny their supposed relationship to the 
partner. The experimenter assured the subjects that 
she did not know their scores or if they were similar 
or dissimilar to each other. Subjects were then given 
sealed envelopes containing their own and their 
partner's scores and summary statements about their 


supposed similarity or dissimilarity. 

Subjects in the obnoxious-or ^ condition were 
given no information concerning similarity or dis- 
similarity. They were told only that the study con- 
cerned how people form impressions of others and 
that the tests they had filled out would ascertain 
how personality characteristics correlate with the 
impressions one forms. These subjects were then given 
only their own test scores, under the pretext that 
ethical restrictions on psychologists require that all 
subjects see their test results. 

Subjects in all but the obnoxious-only control 
group received their own and their partner’s scores 
on each of 15 personality traits. In the similarity 
condition, 12 of the 15 scores were very similar to 
the partner's scores, deviating an average of 4 points 
from them on a scale ranging from 0 to 
scores were discrepant with the pa 
deviating about 29 points from the par 
In the dissimilarity conditions, 12 scores were dis- 
similar to the partner’s scores, deviating an average 
of 41 points, and 3 scores were similar, deviating an 
average of 5 points from the partner's scores. In or- 
der to minimize the possibility that subjects might 
become distressed over particular scores that were 
higher or lower than their partner's, three alternate 
score forms were created. On one form, the confed- 
erate might score lower than the subjects on femi- 
ninity but higher on spontaneity, whereas on an- 
other form the confederate might score higher than 
the subject on femininity but lower on spontaneity: 
Subjects were randomly given one of the three alter- 
nate forms for whichever condition, similarity 07 
dissimilarity, they happened to be in. 

Every subject received the same set of scores re- 
gardless of whether she was similar or dissimilar t9 
her partner. The partner's scores were varied t? 
produce small discrepancies in the similar conditions 
and large ones in the dissimilar conditions. For eX- 
ample, in the similar conditions, the partner's "extra- 
version" score might be 4 points higher and her 
"sensitivity" score, 6 points lower than the subject* 
respective scores, whereas in the dissimilar condi- 
tions, her extraversion score might be 33 point 
higher and her sensitivity score, 35 points lower tha? 


Thus, while the range of the 
much greater in the dissimilar 
rall mean of the partner's scores 
espective of conditions. Regardless 
nilarity or dissimilarity to the con- 
s mean score was always slightly 
of the confederate. This was done 
ibject’s anxiety about her own 
aura of anonymity, the sub- 
code number “81,” and the con- 
1 the code number "49," 
subjects were given no-possession-possible 
, these subjects were given assur- 
lid not possess obnoxious qualities. 
ns, comments at the bottom of the 
s follows: 


Subjects similar (dissimilar) on most traits; be- 
havior appropriateness score missing for #49; 
, tactful, courteous, considerate, not 
"sive, unlikely to cause offense. 
out #49. Even if two girls are 
(similar) on behavior appropri- 
r, personality structure is more 
similar (dissimilar) to justify this 


rationale for the confederate's missing score was 
a low behavior appropriateness score might di- 
the effects of the obnoxious manipulation by 
warning the subject about her partner’s fault; a 
g score, on the other hand, would create a 
ion in the subject’s mind about the courtesy, 
sideration, etc, of the confederate without di- 
y warning her. 

y-six subjects, including all subjects exposed 


the pleasant confederate, received possession- 
ible feedback; that is, they were given no in- 
formation that might be construed as assurance that 
they did not possess the obnoxious trait. In these 
) Sonditions the comments read: 

- Subjects similar (dissimilar) on most traits; versa- 
tility score missing for #49; #81 is a very versa- 
file person. Even if two girls are very dissimilar 
(similar) on versatility, however, personality struc- 
- Üüre is more than sufficiently similar (dissimilar) 
_ to justify this match. 


The versatility score was absent in these conditions 
Bor to match the procedure with the no-posses- 
. Mon-possible conditions as closely as possible. No 
""Aavior appropriateness score appeared on either 
Posse Ssion-possible form. 
the obnoxious-only control condition, the sub- 
received only their own scores, identical to 
lose received by all other subjects. The comments 
Trad: “#81 is very tactful, courteous, considerate, 
hot especially aggressive, unlikely to cause offense.” 
- "his was done in order to match the no-possession- 

Possible conditions as closely as possible. 
posu after the subjects had received their scores, 
‘the confederate informed the experimenter that she 
“Was Missing a score, The experimenter explained 
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that a mistake had obviously been made and that 
she would try to clear it up. (This interaction was 
designed to call the naive subject's attention to the 
confederate's missing score.) About 1 minute later, a 
tape-recorded telephone ring sounded, and the ex- 
perimenter left the room, thus providing an oppor- 
tunity for the confederate to interact with the sub- 
ject. In the obnoxious conditions, the confederate 
berated the subject's college and criticized her mo- 
tives for attending it. She went on to say: 


My grades are really good and I have the 
money, so I'm transferring. I went to BU [Boston 
University] for an interview last weekend. What 
a laugh that was! I really psyched that guy out. 
You know how most people get nervous at inter- 
views? You probably do. Well, I never do. I 
figured out what he wanted me to say ahead of 
time and said it. After that interview, they'll have 
to let me in. 


In the pleasant conditions, the confederate asked 
the subject about her major, her hometown, her 
interests, and the like. She smiled a lot and agreed 
with the subject, and for the most part let the sub- 
ject do the talking. The confederate was blind to 
the subject's similarity-dissimilarity condition and 
to the no-possession-possible versus possession-possi- 
ble condition; the experimenter was blind to the 
subject's similarity-dissimilarity condition, to the 
no-possession-possible versus possession-possible con- 
dition, and to the confederate's behavior. 

Following this manipulation, the experimenter re- 
turned. She explained that for the remainder of the 
time, the subjects would alternately talk to each 
other for structured time periods and record their 
impressions of each other. She then explained that 
one set of questionnaires would be filled out immedi- 
ately in order to tap in on the impression-formation 
process at this point in time, She stressed that com- 
pletely honest and subjective responses were crucial 
and assured the subjects that she would not have 
access to their questionnaires. The experiment was, in 
fact, over at this point, and no further discussion 
took place. 

The confederate then excused herself to go to the 
ladies room. The experimenter debriefed the subject, 
using the procedure outlined by Aronson and Carl- 
smith (1968). The experimental procedure aroused no 
suspicions, and all subjects expressed interest in the 
details of the study. Prior to debriefing, a large 
number of subjects spontaneously expressed their 
feelings about their partner. In a large number of 
cases, reasons given for their feelings exactly paral- 
leled the rationale proposed by the authors. 


Dependent Measures 


Subjects filled out an impression-formation study 
scale, which consisted of a primary liking measure, 
two indirect liking questions, four manipulation 
checks, and two control questions. The direct ques- 
tion asked, “How much do you feel like your part- 
ner at this moment?” The other liking questions 
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asked, “How good a first impression does your part- 
ner make?” and “Do you think your partner is gen- 
erally well-liked by others?” Other measures were 
designed to check on the similarity manipulation, 
the obnoxiousness manipulation, and the subject’s 
belief that the experimenter would not have access 
to the testing materials. Two additional questions 
ascertained the subject’s liking for the experimenter 
and the subject’s perception of her personality scores. 
With the exception of two questions, subjects indi- 
cated their answers by checking the appropriate 
points on a 15-point continuum with labeled end 
points. Subjects indicated perceived favorability of 
their personality scores on a 10-point scale and indi- 
cated “yes” or “no” as to whether the experimenter 
would find out if the partners were similar or dis- 
similar. 


RESULTS 
Checks on the Manipulations 


The similarity question provided a means 
for assessing the effectiveness of the similar- 
ity-dissimilarity manipulation. The main ef- 
fect for similarity was highly significant (F 
= 28.19, df = 1/48, p < .001). An examina- 
tion of Table 1 reveals that perceived simi- 
larity did not vary depending on the behavior 
of the confederate; “similar” partners were 
perceived as similar, and “dissimilar” part- 
ners were perceived as dissimilar, whether the 
confederate was pleasant or obnoxious. In 
addition, subjects who interacted with a 
neither-similar-nor-dissimilar obnoxious part- 
ner rated the partner as very dissimilar. Since 
the subjects responded to the liking measures 
prior to responding to the similarity measure, 


TABLE 1 
MEAN RATINGS or SIMILARITY AND OBNOXIOUSNESS 


Obnox- Obnox- Obnox- 
Condition | Pleasant Rosas pomenian ious 
possible possible enl 
Ratings of similarity 
Similar 8.00, 7.89, 8.41, 
Dissimilar | 4.67, | 3.41, | 2.78, | 2-78» 
Ratings of obnoxiousness 
Similar 4.44, 11.114 9.33.4 
Dissimilar | 5.44 |  7.56& | 756. | 90. 


Note.—n = 9 in every cell; entries with a common subscript 
do not differ significantly from each other; higher numbers 
indicate greater perceived obnoxiousness, AL between-cell 
comparisons based on pooled variance for the seven cells, 
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it is unclear whether the perceived dissimilar- 
ity of the confederate in the obnoxious-only 
condition was due to the obnoxiousn:ss manip- 
ulation per se or was a function having 
negatively evaluated the partner. 

The analysis of the confederate’s obnoxious- 
ness reveals that the obnoxious partner was 
seen as more obnoxious than the pleasant 
partner (F = 14.73, df = 2/48, p < 001). A 
subsequent 2 X 2 analysis which combined 
the obnoxious conditions produced ::e same 
results (F= 54.22, dj = 1/50, < 001), 
thus confirming the effectiveness of the ob- 
noxiousness manipulation. This analysis also 
revealed a significant main effect for simi- 
larity (F = 4.28, df = 1/48, p< .05) with 


similar partners being rated as more obnoxious 


than dissimilar partners and a significant in- 
teraction (F=3.61, df — 2/48, p< .05). 
The interaction and similarity main efect are 
a function of there being little difference in 
perceived obnoxiousness within the pleasant 
conditions (f < 1) and the similar pariners in 
the obnoxious conditions being sec as sig- 
nificantly more obnoxious than dissinvlar ob- 
noxious partners (f= 4.67, df= p< 


001). These results are generally consistent 
with the hypothesis that ratings of a similar 
other's behavioral characteristics will bc more 
sharply differentiated than ratings of a dis- 
similar other's characteristics. 

Two questions were designed to assess the 
extent to which subjects understood and, 
more importantly, believed that their relation- 
ship to and ratings of their partners would 
not be discovered by the experimenter. On the 
question regarding whether the experimenter 
would find out whether the partners were 
similar or dissimilar, only one subject be- 
lieved that the experimenter would find out 
that she was similar to her partner. On the 
question regarding likelihood that the experi- 
menter would compare the subject's evalua- 
tions of each other, only two subjects felt that 
the experimenter would make such a compari- 
son. The analysis of this question revealed 
that mean ratings of suspiciousness for all ex- 
perimental groups were well below the mid- 
point of 7.5, with higher scores indicating 
greater suspiciousness. Finally, two additional 
analyses revealed that the subject’s experi- 
mental condition did not affect their evalua- 


sonality scores or their evalu- 
imenter. 


'er personal Attraction 


:sures of interpersonal attrac- 
verage intercorrelation of .76. 
sis of vacance of summed liking scores 
ed the same mean pattern, though more 

t found with the direct liking 

However, since the study was 
th liking per se, only the direct 
g results will be presented here (see Table 


s that ratings of similar oth- 

re pronounced than ratings of 

rs is strongly supported. Spe- 
s predicted that a similar part- 
iked more than a dissimilar 

ı the partner was pleasant and 
less when the partner was obnoxious. 
@ interaction between similarity and con- 
e ior is highly significant (F — 
18, p < .001). Further 2 X 2 
| that the pattern exists when 
possible subjects only are con- 
ects exposed to the pleasant 
df=1/32, p< 


the effect is only slightly more pronounced (F 
=19.12, df = 1/32, p< .001)8 

In addition, individual comparisons confirm 
that a similar partner is liked more than a 
milar partner when the partner is pleasant 


e for the similar obnoxious partner Oc- 
not only when there is the implication of 
sing the undesirable trait (possession 
Possible: t = 2.38, df = 56, p < .05) but even 
When there is no implication of possessing the 
Obnoxious trait (no possession possible: £ = 
‘81, df — 56, p < .01). This comparison be- 
veen the two no-possession-possible groups 


“Similar analyses performed on the indirect liking 
tes reveal a pattern which matches closely 
Of the direct liking measure with one exception: 
question, “Do you think your partner is 
lly well-liked by others?” contrasting no- 
*sslOn-possible subjects with pleasant confederate 
, the interaction effect does not reach signifi- 
F — 293, dj — 1/32, p < 40). 


L2 A Aa Ahi a a B T 


rw p” Em d 


WHEN SIMILARITY BREEDS CONTEMPT 79 


TABLE 2 


MEANS AND ANALYSIS OF VARIANCE rox Sunyect’s 
Ratines ON Dixect LIKING MEASURE 


| f Oinos | Obnos- 


Condition Pes | Dus. | loura | Otnosions 
i possible | posible 
Similar — | 106% | 333. | 456, 1... 
Dissimilar | 6.78 | $784 | 74S. | 900 
Analysis of variance 
Source |«4| ms | F 
Similarity (A) | 1] 3130] <1 
Confederate behavior (B) 2 | 80.574 | 16.41* 
AXB 2 | 64.685 | 13.17* 
Within cells | 48 4.907 


Note.— = 9 per cell; higher means indicate greater liking; 
entries with a common subscript do not differ significantly from 
n All between-cell comparisons based on pooled 
variance for the seven cells. 

* p «.001. 


demonstrates conclusively that it is not merely 
fear of possessing an undesirable trait that 
mediates derogation of the similar other; 
rather, similarity per se apparently renders a 
similar person's characteristics more salient. 
As a result, these characteristics, whether 
positive or negative, are judged more ex- 
tremely. 

A 2x2 analysis of the obnoxious condi- 
tions (no possession possible and possession 
possible) revealed that fear of possessing the 
negative trait produced less liking for the 
partner than no fear of possessing the trait 
(F = 4.56, df = 1/32, p < .05). Although in- 
dividual comparisons on the direct liking 
measure fell short of significance, on the 
combined liking measure, a similar obnoxious 
possession-possible partner was seen as more 
unlikable than a similar obnoxious no-posses- 
sion-possible partner (f = 2.16, df = 56, p< 
05); possible possession also produced 
greater, though nonsignificant, overall dislike 
in the dissimilar condition. Similar obnoxious 
possession-possible subjects’ greater overall 
dislike for the partner due to fear of posses- 
sing the obnoxious trait is understandable; 
the dynamics of the dissimilar obnoxious pos- 
session-possible situation are less clear. Very 
possibly, similarity on 3 of 15 traits and the 
lack of specific assurance of dissimilarity on 
a particularly salient negative trait, such as 
obnoxiousness, aroused some anxiety for pos- 
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session-possible obnoxious dissimilar subjects 
and led to greater dislike for the partner. 

Finally, an examination of the obnoxious- 
only control condition reveals that obnoxious 
partners are evaluated approximately the 
same as dissimilar no-possession-possible ob- 
noxious partners. This finding conclusively 
confirms that it is the similar obnoxious part- 
ner who is perceived as relatively more unlik- 
able and not the dissimilar obnoxious partner 
who is rated as relatively more likable than 
an obnoxious person. That is, the knowledge 
that an obnoxious other is similar produced 
greater dislike of the obnoxious other; rat- 
ings of dissimilar persons perceived as 
obnoxious matched the ratings given ob- 
noxious-only persons. 


Discussion 


Taken as a whole, the results showed con- 
vincingly that similarity "cuts" both ways so 
that similar others are evaluated more posi- 
tively and more negatively than dissimilar 
others. Moreover, in the present study, the 
subjects clearly believed that neither their 
personality scores nor their relationship to the 
partner would be known by the experimenter. 
Hence, the results clearly refute Cooper and 
Jones’ (1969) assertion that subjects will 
derogate a similar obnoxious partner solely to 
disassociate themselves from the partner in the 
experimenter’s eyes. However, there is a criti- 
cal difference between dependent measures in 
the two studies. Cooper and Jones presented 
subjects with an opportunity to publicly undo 
a relationship of attitudinally based similarity 
by altering their opinions on certain issues. 
Subjects who believed that the experimenter 
would not see their opinions would have no 
reason to misrepresent them since an unaware 
experimenter precludes undoing. The depen- 
dent variable in the present study was liking. 
Liking does not offer a viable means of undo- 
ing a personality-based similarity relationship, 
but rather reflects one’s internal feelings 
toward the fact of similarity. Hence, whether 
the experimenter is aware or unaware of sub- 
jects’ feelings should be irrelevant. Thus, 
while there is reason to expect that subjects’ 
misrepresentation of opinions might depend 
upon whether or not the experimenter would 
become aware of the disassociation attempt, 


s 
m 
T d 


there is no reason to expect that indi 
of liking would necessarily be contine 
the public-private dimension. ! 
Since the present study relates to Novak 
and Lerner's (1968) avoidance results, a d 
cussion of their line of reasoning is d. 
Elsewhere, Lerner (1969) has outlined 
conception of the “just world," which ma 
tains that people need to believe tha 
happens to them bears some lawful re 
ship to their behavior and that their f: 
not determined by whimsical forces 


good or bad actions. Thus, according to Novak 
and Lerner, the existence of a similar and 
emotionally disturbed person is threatening 
because it suggests that mental illness 
happen to anyone, not just to people who in 
some sense deserve it. If the just-world hy 
pothesis truly mediates the phenomenon 
ported in Novak and Lerner, then a physi 
disabled similar desi should also 
perceived as threat ening. However, a pi 
study reported in Novak and Lerner revealed 
that subjects did not avoid a similar, crippled 
partner. Therefore, just the opposite of th 
Novak and Lerner rationale, then, seems 
plausible; that is, people may need to see law: 
ful events as random, particularly if the law 
ful event is personally threatening. Thus, 
existence of a similar, emotionally disturb 
person may be threatening because it impl 
that one harbors the seeds of mental illn 
oneself, This reasoning would also apply 
the present data more aptly, in that ob 
iousness can hardly be conceived of as a ri 
dom event that must in some sense be ma 
lawful. Rather, subjects in the possessi 
possible conditions seemed to fear the possi 
ity that since a similar partner was obnoxio 
they might also be obnoxious. The expla 
tion offered by the present study, then, is thé 
avoidance or derogation of a similar, nega 
tively reinforcing partner is, in part, an 
tempt to reduce anxiety aroused by fear t 
an affliction may be lawfully related to one 
self. However, a comparison of the present 
study's rationale with that of Novak a 

Lerner is qualified by the conflicting result 
on the liking measure. It may well be tha 
mental illness, unlike obnoxiousness, is not 4 


characteristic which is “disliked”; rather, one 
may be made nervous and uneasy by an emo- 
tionally ill person, especially a similar one, 
and hence wil! avoid them but not necessarily 


dislike them (cí. Sigall & Aronson, 1969). 
In the present study, anxiety reduction ob- 
viously was not the only factor that mediated 


derogation of a similar obnoxious partner. 
Subjects who were assured that they did not 
possess the obnoxious trait nevertheless dero- 
gated the similar partner severely. The sali- 
ence notion derived from Festinger’s (1954) 
social comparison theory offers a parsimonious 
explanation for these data. It was proposed 


that similarity makes an individual’s behav- 
ioral characteristics salient. Since the simi- 
larity-dissimilarity manipulation was con- 
ducted prior to the pleasant-obnoxious manip- 
ulation, the subjects were presumably very 
attuned to the characteristics of a similar 
partner. For the subjects in the similar condi- 
tion, then, the relative pleasantness or ob- 
noxiousness of the partner was highly salient, 
leading to pronounced liking or disliking of 
the partner. This greater salience evidently 
stemmed from the general rather than specific 
comparison relevance of the similar other, 
since even in the no-possession-possible condi- 
tion, where social comparison was unnecessary 
for evaluating one’s own obnoxiousness, the 
obnoxious similar other was disliked. The rela- 
tive lack of salience of the dissimilar partner’s 
behavior is demonstrated by the finding that 
Pleasant dissimilar partners were not liked sig- 
nificantly more than obnoxious dissimilar 
partners. Finally, the finding that an obnox- 
ious-only partner is less obnoxious and more 
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likable than a similar obnoxious partner con- 

firms that it is the similar obnoxious partner 

who is disliked relatively more, rather than 
the dissimilar obnoxious partner who is dis- 
liked less. In short, irrespective of direction, 
affective evaluations of a similar person are 
sharpened by the fact of similarity, because 
the positive and/or negative characteristics of 
the similar individual are highly salient, 
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COMPARISON OF THREE THEORIES OF COGNITIVE BAL .NCE' 
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Three experiments are reported in which subjects estimated the value oí 
missing relationships in a series of triadic (P-O-X) structures. In Experiment I 
independent variables designed to vary perceived P— X and OX rela 

tionships had an interaction effect upon the subjects’ estimates of P — O 
relationships; the pattern of means obtained in this experiment was moderately 
consonant with the upper-boundary limits of Wiest's model and two formula- 
tions based upon Feather's discrepancy principle. In Experiment II, variables 
designed to vary perceived P — X and P — O relationships had an inter 

action effect upon attributions of O — X relationships. In Experiment III, 
variables designed to vary perceived P — O and O — X relationships had an 
interaction effect upon reported P —. X relationships. The results of the latter 
two experiments were best described by an equal-weights version of Wiest's 

tetrahedron model. No support was found for Newcomb's proposal concern- 


ing nonbalanced structures. 


This study was concerned with analyzing 
and testing the implications of three rival cog- 


P-0-X 
ve is a 


cording to this latter formulation, “a: 
situation in which P—O is n 


nitive balance theories: Newcomb’s (1953, nonbalanced one, often though not invariably 
1959, 1968) theory of interpersonal balance; associated with low engagement [p. 231," and 
Wies's (1965) tetrahedron model; and lack of engagement implies “only a state of 
Feather’s (1964, 1966) balance formulation little or no preference for balance or imbal- 


based upon the discrepancy principle. 
Newcomb (1959) has assumed that the 
amount of “strain toward symmetry” found 
within a triadic structure will vary, among 
other things, according to the degree of per- 
ceived discrepancy between two persons’ (P's 
and O's) coorientation toward a common 
attitudinal object (X), together with the sign 
and degree of the perceiver’s (P) attraction 
toward O (P — O). Newcomb has stated that 
when P is positively attracted to O, strain in- 
creases according to the amount of discrep- 
ancy present between P's and O's attitudes 
toward X. On the other hand, when P is neg- 
atively attracted to O, the degree to which 
strain will be induced by discrepant attitudes 
toward X depends upon O's threat potential 
and the object relevance of X to P. More 
recently, Newcomb (1968) has grouped all 
structures containing negative P — O relations 
into the single category of *nonbalance." Ac- 


1 This study is based upon a master's thesis sub- 
mitted by the first author to Vanderbilt University. 
The work was partially supported by Grant GS-2658 
from the National Science Foundation. 

? Requests for reprints should be sent to Donald 
L. Thistlethwaite, Department of Psychology, Van- 
derbilt University, Nashville, Tennessee 37203, 
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ance within the total set of relations [p. 33].” 
In sum, P-O-X situations in which the P > 0 
relationship is negative are said to elicit 
indifference, and in such situations Newcomb's 
theory predicts little or no balance modifica- 
tions. 

As Newcomb (1968) has shown, the three- 
category classification of P-O-X structures as 
balanced, nonbalanced, and imbalanced seems 
to give a better account of the data from stud- | 
ies in which hypothetical situations are rated 
for their pleasantness than Heider's (1946) 
two-category classification system. However, 
Newcomb has not yet attempted to show that 
his formulation is applicable to experiments 
in which the subject, instead of rating the 
pleasantness of hypothetical situations, pre- 
dicts a missing relationship. For example, the 
results of an experiment by Aronson and Cope 
(1968), in which a subject develops a more 
favorable attitude toward his enemy’s enemy 
than toward his enemy's friend, seem incon- 
sistent with Newcomb's contention that bal- 
ance principles are inoperative in situations | 
involving a negative P — O relationship. I? 
contrast, the rival balance formulations 9 


Feather (1966) can readily 
results. 


(19¢ reats the interelement rela- 
PO, >X, and O— X as three 
pendent bipolar continua that represent 
lative s measuring the degree of 
ction be n pairs of elements. These 
continua may be conceptualized as the 
axes of ube, Balanced structures are 
ed as t whose spatial coordinates 
Lanywhere o» or within the tetrahedron 
ed by connecting the four opposing cor- 
of the cube that represent Heider's 
) four balanced configurations. Wiest 
ed confirmable predictions from the tet- 
Iron model by “slicing” the cube along a 
dimens at points that correspond to 

US magnitudes of a relation given in a 
OX configuration. These planes pass 
the te\rahedron lying within the cube, 


"boundaries of balance" associ- 
y triad possessing the given rela- 
value. Wiest has shown that such 
ies are useful in predicting the mag- 
e of covariation between two relations in 
O-X structure as a function of the level 
third. 

More recently, Wellens and Thistlethwaite 
971) have provided algebraic formulations 
‘the upper and lower boundaries of balance 
ibed by Wiest. Using X, Y, and Z to rep- 
t the interelement relationships P — X, 
X, and P— O, respectively, it may be 
n that Wiest’s boundaries of balance are 
ed by the equations: 


Zy - k— |X —Y| 1] 
Zz, = [XR A [2] 
€ Zy and Zi, represent the upper and 


boundary limits, respectively, for one 
element relation Z; |X — Y| and |X + 
Correspond to the absolute values of the 
l difference and sum, respectively, of the 
0 other interelement relations (X and Y): 
d the integer k is a constant representing 
highest positive value on the scale repre- 
ting each interelement relation. In Section 
^ 0f Figure 1, the lines EC, ILK, MPO, RUT, 
AG represent upper boundary limits cal- 
ed on the basis of Formula 1, while EC, 

MNO, RST, and AG represent the 
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lower boundary limits calculated on the basis 
of Formula 2. 

The limits Zy and Z;, may be combined to 
yield a single prediction of Z according to the 
generalized formula: 

Z = aZv +b, [3] 
where Z represents the expected value of the 
missing relation, and a and b represent weight- 
ing factors assigned to the predicted range 
limits. The values of a and b will obviously 
vary according to assumptions made about 
the distribution of values expected along the 
range Z;, to Zy. On the basis of Osgood and 
Tannenbaum's (1955) polarization principle, 
Wiest hypothesized a tendency for configura- 
tions to cluster toward the surfaces of the 
tetrahedron; one implication of this hypothe- 
sis is that the obtained distribution of values 
of Z should be bimodal for all structures in 
which Zr, and Zy do not coincide. Regardless 
of whether the polarization principle is as- 
sumed to be operative, if it is assumed that 
the upper and lower boundaries of balance are 
equally potent as points of reference, then it 
would seem appropriate to assign equal 
weights to the boundaries of balance associ- 
ated with these surfaces. Such a procedure is 
equivalent to assigning values of .5 to both a 
and b in Formula 3; predictions based on the 
latter procedure will hereafter be referred to 
as those based upon the “equal-weights” 
tetrahedron model (Formula 3’). The solid 
lines EC, IK, MO, RT, and AG in Section A 
of Figure 1 show the predictions based upon 
this model. 

Feather’s (1964, 1966) structural balance 
model of communication effects represents a 
third extension of Heider’s (1946, 1958) 
theory. Attitudinal relationships considered in 
Feather’s model deal with a source (S) pre- 
senting a communication (C) about an issue 
(1) to a receiver (R). Only three of these 
elements will be of concern here (namely, S, 
I, and R), with relations being limited to the 
receiver's attitude toward an issue (R — I), 
the receiver’s perception of the source’s atti- 
tude toward an issue (S— I), and the re- 
ceiver's attitude toward the source (R — S). 
Following Newcomb's (1959) earlier proposal, 
Feather (1966, 1967) proposed that given the 
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Fic. 1. Predicted mean ratings of subjects’ estimates of one relation (Z), given information con- 
cerning two of the relations (X and Y) within the same triad. 


r3 


dIIVMHIGTIISIH] "] QIVNO([ ANV SN3TI3A) AdNGdOW "V 


COMPARISON OF THREE THEORIES OF COGNITIVE BALANCE 85 


sign and magnitude of two relations in such a 
triadic situation, the sign and relative magni- 
tude of the third relation may be predicted 
according to a discrepancy principle: 


t 


when two attitudina l relations within a triad are of 
the same sign the third relation in the triad will be 
positive in sign and its strength will increase as the 
discrepancy between the two given relations de- 
creases . . . when two attitudinal relations 


within a triad are of opposite sign the third relation 
will be negative in sign and its strength will increase 
as the discrepancy between the two given relations 
increases [1967, p. 107]. 

If X, Y, and Z are taken to represent the 


relations R — I, S—I, and R-S, then 
Feather's discrepancy principle may be re- 
formulated and extended, as Wellens and 
Thistlethwaite (1971) have shown, as two 
functional equations: 


Z.-hk—|X-Y [4] 
Jæ =E] 


-- oe [5] 


where Zs represents the missing relation (Z), 
given that the known relations (X and Y) are 
of the same sign; Zo represents Z, given that 
the known X and Y relationships are of oppo- 
site signs; |X — Y| corresponds to the abso- 
lute value of the real difference between the 
given values; and & is the highest positive 
integer value on the scale (ranging from —& 
to +k) used to describe magnitudes of all 
three relations. It can be seen that Formula 4 
yields a maximally positive value for Zs when 
X and Y are minimally discrepant; the lines 
EV, ILJ, MPO, SUT, and WG in Section B of 
Figure 1 show predicted Za values based upon 
this formula. Formula 5 yields the largest 
value for Zo when X and Y are maximally 
discrepant; the lines V'C, J'K, M'P'O', RS’, 
and AW’ in the lower half of Figure 1 show 
the predicted Zo values based upon this form- 
ula. Tt can be seen that when it is assumed 
that an attitude is perfectly juxtaposed be- 
tween positively and negatively affective posi- 
tions (i.e., when one holds, or is perceived to 
hold, an attitude that is located precisely at 
the “middle” position on the X or Y scale), 
both Formulas 4 and 5 may be considered 
Applicable, yielding competing predictions con- 
cerning the missing Z relation (cf. V versus 

» J versus J’, etc., in Section B of Figure 1y. 


ll 


One approach in using these formulas when X 
or Y assumes a neutral value is to use the 
average of the competing Zs and Zo values as 
the prediction for the missing relation. The 
solid lines plotted between the dotted lines in 
Section B of Figure 1 show predictions made 
using this convention. These predictions will 
hereafter be referred to as those based upon 
the “composite” discrepancy principle (For- 
mula 6), and may be regarded as a specific 
extension of Feather’s discrepancy principle. 
The following experiments were designed to 
compare the ability of each of these six alge- 
braic formulations to predict outcomes when 
two of the three interelement relations in a 
given triad were given and the subject was 
required to estimate the third relation. If 
Newcomb’s (1968) assumption concerning 
“nonbalance” is correct, the formulations 
based upon the Feather and Wiest models 
should prove to be relatively inaccurate under 
conditions of negative interpersonal relations; 
according to Newcomb’s nonbalance proposal, 
the discrepancy between P — O and px 
(or O— X) should have little or no effect 
when P—0 is given as negative. In con- 
trast, both Wiest’s tetrahedron model and 
Feather’s discrepancy principle may be in- 
terpreted as implying that other things being 
equal, it should make little difference which 
two relations in the triad are given and which 
relation the subject is asked to estimate. 


METHOD 
General Overview 


This study included three concurrent experiments. 
Within each experiment, the subjects were asked to 
make predictions concerning the sign and magnitude 
of a relation (Z) for all possible combinations of 
five levels each of two independent variables (X and 
Y); this design calls for every subject within each 
experiment to rate a total of 25 items. In the first 
experiment, the dependent variable (Z) was taken to 
be the relationship P — O; the subjects were asked 
to estimate the magnitude of this relationship given 
all possible combinations of PX ad OX 
values (X and Y). Experiment II had another set of 
subjects predict O — X, while P—X and P— 0 
were given and experimentally varied. Experiment 
III had still another set of subjects estimate PX 
values, given various P — O and 0— X values. 


Materials 


An approach similar to the one developed by 
Feather (1966) was employed for data collection. 
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Three forms of an “attitude questionnaire” were 
constructed, one form for each experiment. The 
questionnaire used for Experiment I stated that the 
study’s purpose was to investigate the “processes by 
which people form first impressions of others on the 
basis of limited information.” Each item of the 25- 
item questionnaire specified the subject’s hypothetical 
stand on a “resolution concerning war protests” (X), 
together with another person’s feeling toward the 
same resolution (Y). The instructions asked the 
subject to mark, on a $-point scale ranging from 
“Dislike” to “Like,” how he thought he would feel 
toward the other person (Z). Judgments were to be 
based solely on the information provided within a 
specific item. The questionnaires used in Experi- 
ments II and III were analogous to the one used in 
Experiment I, with the given information varying 
according to the specific independent variables being 
manipulated. In Experiment II, the subjects were 
asked to estimate how they thought the other person 
would feel toward the given issue; in Experiment 
III, the subjects were asked to estimate how they 
thought they would feel toward the given issue. For 
example, the following items—a, b, and c—illustrate 
the types of items appearing in the questionnaires 
used in Experiments I, II, and III, respectively: 


a. You are just as much in favor of as opposed to a reso- 
lution concerning war protests. 
“Person O" is strongly opposed to the same resolu- 


tion concerning war protests. 
How do you think you would feel toward this person? 
Dislike ———; 2 : ike 


b. You are strongly in favor of a resolution concerning 
war protests. 
You mildly like *Person O." 
How do you think “Person O" feels toward the same 
resolution concerning war protests? 
Opposed. iB h z E 


c. You strongly dislike “Person O." 
"Person O” is mildly opposed to a resolution concern- 
ing war protests. 

How do you think you would feel toward the same 
resolution concerning war protests? 
Opposed : p 5 k 


The 25 items in each questionnaire represented the 
25 cells of a Latin square, formed by considering 
five levels each of the two independent variables, X 
and Y. For example, in varying the P — X relation- 
ship in Experiment I, the subjects were asked in 
successive items to imagine that they were “strongly 
opposed to,” “mildly opposed to,” “just as much in 
favor of as opposed to,” “mildly in favor of,” or 
“strongly in favor of” the hypothetical issue, These 
descriptive phrases may be thought of as repre- 
senting values of X corresponding to the quantities 
—2, —1, 0, +1, and +2, respectively. The 0s X 
relationship was similarly varied in the first experi- 
ment and associated with corresponding values for Y. 

Within each experiment, every item presented the 
particular variable associated with “X” first and the 
variable associated with "Y" second. The order of 
the items in each questionnaire was determined ran- 


In Favor 


In Favor 
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domly, with the restriction that no « 
each other in the Latin square could 
positioned as items on a given par y qu 
tionnaire. The latter procedure wa: designed 
prevent any sequential incrementing of the 
pendent variables that would leac subjects | 
respond according to an externally sug 
Order was further controlled by cresti 
subject variable that varied the order 
were assembled for questionnaires 
perimental group; five such page 
cluded in each experiment. 


Subjects and Procedure 


Subjects were drawn from the in! 
chology subject pool at Vanderbilt Univers 
males and an equal number of fera! 
domly assigned to each experiment; the í 
page presentations were equally repre-cit 
experiment. To minimize reactive ari 
larger pool of subjects was actually 
out respect to sex; and excess subject 
quently randomly discarded, so that 
and the order of presentation of item 
balanced within each experiment. 


on their questionnaires and to begin ; 
as soon as they understood what they 
Any questions were answered indiv 
were asked to hold their questionn: 
one had finished, at which time all m 
collected and subjects dismissed. Each 
approximately 40 minutes. 


RESULTS 
Tests of Predictions Based on the Wiest @ 
Feather Models 


A five-way analysis of variance (Expé 
ment X Sex X Order x First Manipulated B 
lation X Second Manipulated Relation), f 
on the combined data from all three expel 
ments, showed no significant main effet 
associated with experiments or with the ord 
of item presentation. There was, howevely 
significant main effect associated with $ 
(F—618, df— 1/30, p<.05); wom 
tended to give higher overall estimates 
items than men. In addition, a significa 
triple-interaction effect was found involvil 
experiments and the two given interelem! 
relations (X and Y); although this inter 
tion was highly significant (F = 3.04, dj 


small proportion of variance (o? = .02). T 
obtained mean patterns varied significant 
but not greatly, across experiments. 


-. 
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The results of two-way analyses of variance, 
run independentiy on the data from each ex- 
periment, are nmarized in Table 1. Al- 


though the ge l X xF interaction, pre- 


iy all of the formulas consid- 


ered, was hi: significant in each experi- 
P ment, it is a ent that the proportion of 
variance in the dependent variable accounted 
for by this interaction declined progressively 


i to III. Omega-squared 
estimates for the X X Y interaction effect in 
the three experiments were .51, .49, and .43, 
respectively. ppears that the independent 
variables had what less impact when the 
dependent variable consisted, as it did in Ex- 


from Experit 


| periment ITI, of the subject's estimate of how 
he would fec! toward the hypothetical issue, 
given informa‘ion about his attitude toward 
O and O's attitude toward the issue. The main 


and Y shown in Table 1 were 
consistent with the theoretical 

d upon the Wiest and Feather 

both Wiest’s upper balance 

the composite discrepancy 

X and Y main effects with 
lues occurring for X = £1 or 
specific predictions were con- 
‘iment T, but not in Experi- 
Til. 

F igure 2 shows a plot of the obtained mean 
estimates for each experiment. It can be seen 
that the profile “peaks” predicted by Formu- 
las 1, 4, and 6 when Y = —1, 0, or +1 did 
tend to occur for triads with the appropriate 
X values in Experiment I; these peaks did 
not consistently occur in the expected triads 
in Experiments II and II. Individual com- 
parisons (Newman-Keuls) indicated that the 
Peak means of Experiment I (shown in the 
three middle graphs in Section A of Figure 2) 
Were significantly different from the other 
Means associated with Y = +1 (p < .01) and 
with Y —0 (p< .05); tests performed for 
Experiments TI and III indicated that the 
Means for predicted peak cells did not differ 
Significantly from the means for other cells 
associated with the appropriate levels of Y. 

hus, estimates obtained in the first experi- 
Ment confirmed the profile peaks predicted on 
the basis of (a) the upper boundaries of bal- 
ance described by Wiest (Formula 1) and (5) 
two applications of Feather's discrepancy 


effects for X 
only partialis 
predictions ! 

models. Thu 
boundaries < 
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TABLE 1 
ANALYSIS OF VARIANCE OF EsrruATES 
IN EACH EXPERIMENT 
Experiment. 
ERA 4f I Hu Ht 
| - — = 
DER. | E JI " 
X 4 15,6199| 93 | 146. | — | sza | 01 
Y . 4 | 9.20€** | 02 1.74 — 2.5889* | Of 
XXY | 16 | 41.38%% | i51 | 36.75%% | „49 | 26,25%% | 43 
*p <.05. 
** 5 « .008. 
++p c. 001 


principle (Formulas 4 and 6), while the esti- 
mates obtained in Experiments II and II did 
not confirm these predictions. 

Distribution analyses performed on the 25 
cells associated with each experiment showed 
that 65.6% of all estimates given by the sub- 
jects fell within the boundaries of balance 
(i.e., ranges for Z) defined by the tetrahedron 
model. This percentage remained relatively 
constant across experiments: 64.2% for Ex- 
periment I; 67.2% for Experiment IT; and 
65.4% for Experiment III. Therefore, the 
degree to which there was confirmation of 
Wiest’s general prediction, that estimates 
should fall within the upper and lower bound- 
aries of balance, did not vary appreciably 
across experiments. There was, however, little 
support for Wiest’s polarization assumption 
concerning the distribution of Z values. Part 
A of Figure 1 shows that the theoretical 
upper and lower boundaries of balance differ 
for the following nine pairs of X and Y 
coordinates: (—1, —1), (0, 21), (+1, —1); 
(ds 0), (0, 0), (F1, 0), (sU +1), (0, +1), 
and (+1,+1). Thus, the polarization prin- 
ciple predicts bimodal distributions for Z 
values in each of these nine cells. The ob- 
tained distributions of Z values in these cells, 
however, were uniformly unimodal, showing 
no support for the polarization assumption. 

Table 2 summarizes the average squared 
deviations of the obtained Z estimates and 
the ratings predicted by each of the algebraic 
expressions: Formulas 1 and 2 represent the 
upper and lower boundaries of balance, re- 
spectively, of Wiest’s model; Formula 3’ is 
the equal-weights tetrahedron model; For- 
mulas 4 and 5 represent formulations of 
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Fic. 2. Results from Experiments I, II, and III. (Interelement relations _are a res in the 
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TABLE 2 


MyzRAGE SQUARED DEVIATIONS AND CORRELATIONS OF 


OsrAmED MEANS AND MEANS PREDICTED BY 
FORMULAS BASED UPON THE FEATHER 
AND THE Wiesr MODELS 
Pre- | A r given value of ¥® 
diction = Overall | |, 
equa 1 average Ls 
tions -2 1 0 | +1 +2 
| 
Experiment I (X P =X; Y =0+X;Z =P —0) 
og d 4| 40] 13} 224 37 |.96 
2 |25 58 | 2.60 | 1.81 | .24 1.29 | 62 
y |25 36 | .30| .37| .24 .30 |.90 
à |.22 16 | .40| .16| .04 (w9| .27 |.91 
$. |.45 2 | .75 | 104] n 182 | .96 
é |.30 60 | .05 | .47| 45 En .96 
| | | 
Experiment II (X = P >X; Y =P 20;2 =0 +X) 
1 12 (ns 1.21 | 44] .10 52 |.84 
2 [120 1:37 | 128 | 10 75 |.82 
3 12 (ns | 109 | .26| .10 16 |94 
4 16 (n 1.08 | 1.21 | .60| .08 69 54 
5 19 (» 3| .79| .81| 93 67 75 
6 M (n 57 | .48| .62| .33 44 |.88 
Experiment Ill (X = P5 0; Y 20 X;Z =P — X) 
i E 79 | 1.08 | -38 | .39 .60 |.83 
zy | 3 9 1,02 | .39 183 |.84 
io 10 | .39 .24 |.95 
1 | -26 1 58 | .11 71 50 
LER 44 | 1.55 SA |. 
S54 39 | .72 .58 89 


Emu 6 represents à composite, using Formulas 4 and 5 
ard Average of the values provided by these equations where 
b SI, 1.78 
ES deviations were significant at the .05 level except where 
ee for Formulas 1, 2, 3^ and 6 are each based upon 
i ah of means; those for Formulas 4 and 5 are each based 
117 pairs of means. 


Feather’s discrepancy principle for given rela- 
tions of similar and opposite signs, respec- 
tively; and Formula 6 represents the compos- 
tte discrepancy principle (i.e. predictions 
based on either Formula 4 or 5, as appropri- 
ate, or on an average of Zs and Zo values 
E both equations may be construed as 
making competing predictions). The last col- 
umn of Table 1 shows the correlations be- 
tween the obtained ratings and the ratings 
Predicted by each formula across all cells in 
pe a given formula was applicable. In the 
T experiment, Formula 1 yielded a mark- 
di ecw overall average deviation than 
ormula 2 (.27 versus 1.29); also, For- 
mula 4 yielded a much smaller average devi- 
ation than did Formula 5 (.27 versus 82). 
a correlations for Formulas 1 and 2 tend 
EX a similar pattern (.96 versus .62). 
Us, it appears that in the task given in 


Experiment I, the subjects' estimates were 
better predicted by Wiest’s upper boundaries 
of balance than by his lower boundaries of 
balance; also, these estimates were better pre- 
dicted by Feather's discrepancy principle for 
given relations of similar signs than by his 
discrepancy principle for given relations of 
opposite signs. 

'These differences were sharply attenuated 
in Experiments TI and III when the subjects 
were not required to estimate their liking or 
disliking for another person. Thus, in Experi- 
ment II, the superiority of Formula 1 over 
Formula 2 was reduced (.52 versus .75), and 
the equal-weights tetrahedron model (For- 
mula 3') provided predictions having the 
closest fit to the data (.16). Also in this ex- 
periment, there was virtually no difference in 
the overall average deviations for Formulas 
4 and 5 (.69 versus .67). Similarly, in Experi- 
ment III, Formula 1 was only slightly supe- 
rior to Formula 2 (.60 versus .83), and For- 
ula 4 was only slightly superior to Formula 5 
(.71 versus .84). Viewing the average devia- 
tions and correlations in Table 2 for the latter 
two experiments, the equal-weights tetrahe- 
dron model (Formula 3’) seems to provide the 
best overall fit to the data. 


Tests of the Newcomb Formulation 


Table 3 summarizes the separate two-way 
analyses of variance performed on truncated 
2 x 5 matrices from Experiments IT and TI 
in which only negative P— O relationships 
were given. In Experiment II, the subjects’ 
estimates were significantly affected by given 
X (ie, PX) relationships (F — 34.18, 
p < 001); similarly, in Experiment TII, the 


TABLE 3 
ANALYSIS OF VARIANCE OF ESTIMATES FOR CELLS WITH 
NEGATIVE P — O RELATIONSHIPS IN 
Experments II AND TL 


Experiment 
Source It IH 
df F wt df F ot 
X 4 34.18* Al 1 «1 -— 
Y 1 1.14 x Ay 146372] 29 
Xocr|-4 9.52* | .08 4 6.39* | .04 
*p «001. 
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subjects’ estimates were significantly affected 
by the given Y (ie, 0—X) relationships 
(F = 16.57, p < 001). In addition, signifi- 
cant X X Y interactions were found in both 
of these matrices (F = 9.52, p< 001; F= 
6.39, p < 001). These findings are contrary 
to the predictions made on the basis of New- 
comb's (1968) proposal, and they suggest 
that balance principles are not suspended 
when negative P — O relationships are given. 


DISCUSSION 


It is apparent from Table 2 and from the 
profile comparisons in Figure 2 that the alge- 
braic formulations derived from the Wiest and 
Feather models do not describe the obtained 
P—0, O— N, and P— X mean patterns 
with equal accuracy. The pattern of obtained 
means associated with estimates of the inter- 
personal relation P — O (upper part of Table 
2) seems best described by the upper bound- 
ary limits of the tetrahedron model (For- 
ula 1) and by two formulations based upon 
the discrepancy principle (Formulas 4 and 6). 
Mean patterns associated with O — X and 
P — X (lower parts of Table 2) seem better 
described by the equal-weights tetrahedron 
model (Formula 3'), with none of the dis- 
crepancy formulations showing comparable 
degrees of fit. The variation in the applicabil- 
ity of the algebraic formulations across ex- 
periments is, of course, also reflected in the 
obtained triple interaction involving the X 
and Y variables and experiments. 

First, it appears that we can reject at least 
one explanation of this triple interaction. In 
the analyses based upon only those triads in 
Experiments II and III in which the subject 
was asked to assume that he had a negative 
P—O relation, we have seen that there 
were significant main and interaction effects 
(Table 3). Thus, for such triads in Experi- 
ment IT, the given P— X relations signifi- 
cantly affected estimates of the O — X rela- 
tion, and these effects varied according to the 
magnitude of the given negative P — O rela- 
tion; similarly, in such triads in Experiment 
III, the given O — X relations significantly 
affected estimates of the P — X relation, and 
again these effects varied according to the 
magnitude of the given negative P — O rela- 
tion. These results indicate that the change 


in mean patterns across experiments was prob. 
ably not the consequence, as Newcomb’s 
(1968) proposal might suggest, of “indiffer- 
ence" when subjects are asked to assume 


negative P— O relationships. While New. 
comb's category of nonbalanced triadic struc- | 


reting the 
of the 


tures seems to be useful in int 
earlier literature involving ratings 


pleasantness of completed P-O-> structures, 
it does not appear useful in interpreting the 
data from the present study in which a miss 
ing relation is induced. Newcomb s general 
assumption that the interelement relationships 


in a P-O-X structure are functionally dif- 
ferent may be correct, but these results argue 
that his proposed suspension of balance prin- 
ciples for structures with negative P01 
relationships is not appropriate. 

There are, however, several plausible inter 
pretations that may be given for the triple 
interaction. First, it may be part!y related to 
the greater error variance found to be associ- 


of estimating one’s liking for anot 
The variation in omega-squared : 
experiments (Table 1) might aiso indicate 
differences in the degree to which the given 
interelement relations were successfully im 
duced. A second interpretation is that balance 
may have been differentially operative acros 
experiments. Thus, Experiment I manipulated 
attitudes toward a common element (X) 
while Experiments II and III varied attitudes 
toward two different elements (O and X) 
Perhaps attitudinal discrepancy is more sa- 
lient to subjects when items are presented 
with relations converging on a common focal 
element; such an interpretation might help 
account for the poor fit of the formulas bas 
on the discrepancy principle in Experiments 
IT and IIT. Although Feather (1967) 1 
presented the discrepancy principle as 4 ge 
eral formulation “applying to all triads n 
the communication structure [p.- 108]. 
heretofore, he has consistently applied it to 
triads in which the given relations had 3 
common element. 

A third interpretation of the triple 
action is that it was partly the consequ 


intel 
enc 


‘nating influences of positivity 
Whitney, 1971; Zajonc, 
shown that such tendencies, 


conta! 
gues, 
C It can 


could be extraneous influences in Ex- 

ent I, not in Experiments II and 
can read: : incorporated into either the 

jest or Fea formulation, For example, 
better fit ined in Experiment I when 
discrepan: inciple was applied to given 


tions of simular sign (Formula 4), than to 
en relations of opposite sign (Formula 5), 
y be interpreted as the consequence of 
i t to imagine that one dislikes 
en given coorienting attitudes of 
ce requires that the subject 
iad by reporting a positive lik- 
ther person, but when given 
tritudes of unlike sign, balance 
the subject complete the triad 
it he dislikes the other per- 
refer not to imagine negative 
tons toward others, the relatively 
the discrepancy principle when 
iven relations of unlike signs is 
ie? According to this reasoning, 
: TI and IIT when the subject 
wired to estimate liking or disliking for 
her, the superiority of Formula 4 over 
nula 5 should decline, and it does. 
The generalized tetrahedron model (For- 
à 3) seems to offer an unusually con- 
nient conceptualization for dealing with the 
meous effects arising from both positiv- 
ly and agreement tendencies (Rodrigues, 
1967; Whitney, 1971; Zajonc, 1968). For 
ple, a and b in this formulation may be 
ded as parameters that are affected by 
sitivity. When the experimental task re- 
tes the subject to estimate P — O, and it 


ag average squared deviations considered here 
i5 a criterion for assessing goodness of fit have no 
tpart in the results reported by Feather and 
m associates. Experiment I in this study was similar 
| Ene of the tasks included in Feather's (1966) 

S udy, but his predictions based on the discrepancy 
nciple concerned only ordinal differences in esti- 
tes. While these predictions were better confirmed 
hen the given relations had unlike signs than when 
End like signs, later experiments reported by 
ea er (1967), Feather and Jeffries (1967), and by 

ther and Armstrong (1967) tend to support the 
""lerences between triads predicted by the discrep- 
hey principle, both when the given relations had 
and when they had unlike signs. 


COMPARISON OF THREE THEORIES OF COGNITIVE BALANCE 
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is reasonable to expect him to prefer imag- 
ining positive interpersonal relations, then a 
in Formula 3 might appropriately be assigned 
a relatively greater weight than b. Assigning 
values of .75 and .25 to a and b, respectively, 
in Formula 3 produced predictions having a 
better fit to the data of Experiment I than did 
Formula 3': the overall average squared de- 
viation of predicted from obtained values was 
.17, and the overall correlation of predicted 
and obtained ratings was .96. (Table 2 shows 
that the corresponding values for Formula 3° 
were .30 and .90, respectively.) The latter 
findings confirm those reported by Wellens 
and Thistlethwaite (1971). In Experiments IT 
and III, when subjects are not required to 
estimate liking or disliking toward another 
person, positivity forces should decline, and 
the superiority of the equal-weights formula 
in the latter experiments would be expected. 
In a similar manner, agreement tendencies 
may be conceptualized as affecting the a and 
b parameters in Formula 3 when the subject's 
task is to estimate his own (or another's) 
attitude toward an issue. When subjects are 
given the P — O and OX relations and 
are asked to estimate the P— X relation, for 
example, such an agreement tendency might 
influence subjects to assign a greater weight 
to a when O — X is positive, and a greater 
weight to b when O—X is negative. The 
data for Experiment III in Table 2 show 
that the former prediction is consistent with 
the average squared deviations for Y = +1 
(.38 < 1.02), indicating a closer fit to For- 
mula 1 than to Formula 2; however, the 
latter prediction was not consistent with the 
shared deviations for Y = A 19. < 01) 
indicating, contrary to prediction, a closer fit 
to Formula 1 than to Formula 2. 

In short, of the three theories of cognitive 
balance considered, Wiest’s model appears to 
offer the broadest conceptual framework. 
While Feather’s discrepancy principle was 
moderately applicable in one of the three 
experiments reported, Wiest’s tetrahedron 
model was applicable to all three experiments. 
Moreover, the generalized tetrahedron formu- 
lation (Formula 3) seems capable of elabo- 
ration so that it provides a formulation capa- 
ble of simultaneously taking into account 
balance, positivity, and agreement tendencies. 
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It has been theorized that high drive initiates an “autistic” perceptual process 
operating in the direction of drive gratification. In the present study, it was 
hypothesized that (a) under conditions of high drive, perceptual distortion 
of drive-relevant objects occurs principally along drive-relevant dimensions ; 
(b) the direction of distortion is toward enhancement of the object's potential 
for drive satisfaction; (c) the degree of accessibility of the object for po- 
tential drive gratification affects the extent to which drive-relevant character- 
istics of the object are enhanced. These hypotheses were tested in a person- 
perception situation in which level of sexual arousal and accessibility of the 
judged stimulus object (a prospective date) were varied. A two-factor multi- 
variate analysis of variance with three dependent measures indicated that 
arousal level of judge, accessibility of stimulus object, and dimension relevance 
interact in determining perception of stimulus person. This and additional 


findings are discussed. 


Bruner and Goodman (1947) first proposed Several theorists have noted the circum- 
t perception under- scription apparent in tests of the hypothesis. 


— to study the variations tha 
goes "when one is hungry, in love, in pain, or Secord and Backman (1964), for example, 
blem [p. 33]." Since then, many felt it necessary to point out that the hy- 


solving a pro 
d the impact of mo- pothesis “may apply not only to a series of 


investigators have examine e nay é ] 
tivational factors on perception, most particu- physical stimuli but also to any abstract di- 
larly on the perception of stimuli related to mension opening the way for the exploration 


the drive under consideration. Much of this of many variables that have great social im- 
research has focused directly on. Bruner and portance [p. 46]." Tajfel (1969), too, com- 
Goodman's hypothesis, that as motivational mented that the hypothesis has wider impli- 
level increases, drive-related objects “become cations than is implied by its experimental 
more vivid, have greater clarity or greater tests and specifically pointed out that little 
brightness or greater apparent size |P- 36].” attention has been given to the effect of need 
Although the perceptual accentuation hy- upon person perception, even though “studies 
pothesis was initially couched in general on the perception of physical characteristics 
terms, experimental investigations of the of people would seem to lend themselves par 
proposition have been confined largely to tests excellence to the demonstration of the effect 
of a special derivation of the hypothesis. of motivational factors in perception [P. 
Most experiments have been concerned with 325]." : 

the judgment of size of objects, most particu- Despite the relevance of the perceptual ac- 
larly objects associated with monetary value, centuation hypothesis to person perception, 
and among these, most especially coins. little relevant research has been conducted in 
` this area. The vagueness of the proposition’s 

1 This experiment was supported in pet NE theoretical underpinnings makes its predictive 
"em pedet ind lestitute of Mental extension difficult, a fact that may partially 
Health Research Grant NH 16661 and National account for the unwillingness of many inves- 
Science Foundation Grant CS 2932 to Walser. tigators to stray far from the domain in 
EN at the University e: Ear ug LEN which the hypothesis was first formulated and 

1 E 
inii c Roach in Social Rela- tested. (Bruner & Goodman, eg, gave only 
tions, University of Minnesota, Minneapolis, Minne- cursory attention to the raison d’étre of an 
sota 55455. accentuation effect under high drive. They 
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speculated that it was the result of “habitual 
selection," but they did not elaborate.) 

The perceptual accentuation hypothesis, 
however, bears a great deal of similarity to a 
proposition advanced earlier by Levine, Chein, 
and Murphy (1942). This earlier formulation 
does seem to readily permit extensions to 
person-perception situations. These investi- 
gators posited that high drive initiates an 
“autistic” process "operating in the direction 
of gratification.” Such an autistic process, of 
course, may account for certain instances of 
perceptual accentuation. Although Levine et 
al. also failed to elaborate on their notion, 
they clearly foresaw that some explication 
would inevitably be necessary: 


However commonplace such [autistic] processes may 
be in everyday life and in clinical experience, extra- 
ordinarily little is known about them; no one can 
define the laws determining when they will appear 
or vanish, or how they facilitate or inhibit each 
other [p. 283]. 


While the notion of autistic processes in 
person perception has survived in the social- 
psychological literature to the present day 
(sometimes under the label of “facilitative 
distortion,” cf. Jones & Davis, 1965, p. 241), 
little has been discovered concerning the cir- 
cumstances under which such autistic proc- 
esses may be expected to “appear or vanish.” 

One might speculate, however, that im- 
plicit in Levine et al.’s concept of a drive- 
initiated autistic process are the following two 
axioms: (a) Perceptual distortions will occur 
principally along dimensions of the stimulus 
object which are relevant to drive gratifica- 
tion, For example, individuals will distort the 
size of a coin, They will not systematically 
distort the color of coins. Size of coin is as- 
sociated with value and potential need satis- 
faction, while color is not. (b) The direction 
of distortion along drive-relevant dimensions 
will be toward enhancement of the object’s 
potential for satisfaction of the drive. 


i The first implication is in accord with Tajfel’s 
(1957) analysis of the results of the several studies 
that have tested the size-accentuation hypothesis and 
with his theoretical statement of the principles under- 
lying these findings. It is difficult to determine 
whether the second implication (that the direction 
of distortion will be toward enhancement of the 
Objects drive-satisfaction potential) is compatible 
with Tajfel’s theory. Tajfel (1957) predicted that 


A number of additional factors may be 


hypothesized to influence the extent :o which 
such an autistic process will operate in a 
given perceptual situation. One factor that 
may be of potential importance is tic accessi- 
bility of the stimulus object to th eiver, 
In daily life, the extent to which a stimulus 
object qualifies as a potential need «ratifier 
depends on two factors: (a) the extent to 
which the stimulus object is releven! to the 
aroused need (i.e, the extent to which the 


object has the inherent potential for sa\isfying 
the need) and (b) the extent to which the 
stimulus object is accessible to the ig 


as a means of need gratification. One mig 
speculate, , that both the inherent rele- 
vance td the acest of an object should 
determine how drive relevant the object is 
perceived to be, and thus determine the extent 
to which various features of the object should 
be enhanced. 

Previous tests of the perceptual accentua- 
tion hypothesis have been conducted in 
ations in which the stimulus objects—w bet 


b 


itu- 


judged under conditions of high or low drive 
or judged on dimensions relevant or irrelevant 
to the drive under consideration—were com- 
pletely unavailable for consumption or for 


“drive gratification.” One should expect the 
perceptual accentuation hypothesis to be dem- 
onstrated most strongly when the stimulus ob- 
ject is both relevant and accessible. 
The present experiment was designed to 
test the hypothesis that arousal, accessibility, 
relevance interact in determining percep- 
tion of a stimulus person. The hypothesis was 
tested in the context of a heterosexual dating 
situation. The two independent variables were 
varied in the following way. 


there will be an accentuation of the differences be- 

tween valued and neutral stimuli and argued that a 

“functional” mechanism underlies this effect: “Mini- 

mizing the differences entails a risk of confusion; 

accentuation then is additional guarantee of a suc- 

cessful response [p. 197].” Tajfel pointed out that 

the presumption of a functional mechanism leads only — 4-7 

to the prediction that the perceived differences of 

magnitudes between valued and neutral stimuli will 

be larger than the corresponding differences between 

two sets of neutral stimuli. “It does not, however, 1 

allow for the prediction that the valued stimuli 

would be perceived as larger than the neutral ones 

[p. 2001," or, in the present case, permit prediction i 

of the direction that distortion will take. | 
} 
| 


“Arousal: Halí of our male subjects (aroused 
jects) evaluated a girl while sexually 
used; half evaluated her while not so 
used (nonaroused subjects). 
Accessibility: Half of the males in each 
‘arousal group evaluated a girl whom they 
"were to date within the following week (ac- 
cessible subjects). The remainder evaluated a 
girl who was participating in the study but 
with whom they would never have an oppor- 
tunity to interact (nonaccessible subjects). 
Relevance: The three dependent variables 
'varied in relevance, Each man was asked to 
rate the stimulus object on a wide variety of 
‘traits. Some traits were selected because they 
seemed especially relevant for sexually aroused 
males (i.e., a sirl's physical attractiveness and 
EE her sexual receptivity). Other traits were 
© chosen because they seemed irrelevant for 
sexually aroused males (ie., the general 
worability of the girl's personality). 
In order io test for the predicted differen- 
‘tial effect of arousal and accessibility on judg- 
"ments made on relevant and irrelevant dimen- 
sions, a two-factor multivariate analysis of 
- variance with three dependent measures was 
used. 


: METHOD 
Subjects 

Ninety-two males who were enrolled in an intro- 
- ductory psychology course at the University of 

Minnesota participated in the experiment. Seventy- 
- five subjects volunteered for the experiment in return 

for receiving points to be applied to their final exam 
"grade, Due to a shortage of volunteers on this basis, 
the other 18 subjects received $2.50 in addition to 
points. Subjects were preselected to climinate all 
married, engaged, or pinned subjects, as well as those 
who were steadily dating only one girl at the time 
of the study. 


Procedure 


Subjects were scheduled to arrive at the experi- 
mental rooms in groups of four. Each subject in a 
group was randomly assigned to a different experi- 
mental condition, Each subject was seated in a 
separate booth. Subjects were not permitted to con- 
verse with one another during the experiment. 

After the subjects were seated, the experimenter 
explained that she was on the staff of the Center for 
Student Life Studies and that the present study was 
part of an effort to examine the dating practices of 
college students. She told the subjects that each of 
them would be asked to go on one blind date with 
^ a co-ed also participating in the study and that 


SEXUAL AROUSAL AND HETEROSEXUAL PERCEPTION 


their date would be selected on the basis of random j 
assignment. 

Subjects were told that they would be asked about 
their impressions of their partner both before and 
after the date. The first impression was to be made 
after they had examined information concerning the 
girl's background. (Subjects were told that in the 
current study only the males would have background I 
information provided them prior to the date. Ostensi- — — 
bly, the girls participating had agreed to go into the 
dating situation without prior information about F 
their dating partners.) Following the blind date, 
subjects would be expected to return to the labora- 
tory to fll out a number of questionnaires describ- 
ing their postdate impressions. 

‘After this description of the purpose and procedure , 
of the study, subjects were asked to indicate on a " 
"commitment" form their willingness to date which- k 
ever girl they received by process of random assign- I 
ment and, further, to hold in confidence the nature h 
of the background information they would read 4 
about each girl. All subjects agreed to the conditions 
of participation in the study of dating processes. 3 

After collecting and examining the commitment ? 
forms, supposedly to see how many subjects had ` 
agreed to the terms of the study, the experimenter j 
said that she would get the background information 
questionnaires of four girls who had also indicated 
their willingness to participate in the study and 
would make the random pairings. She assured the 
subjects that this would take only a few minutes. 
The experimenter then explained that a graduate 
student in the laboratory where the study was being 
conducted had asked to be allowed to try to enlist 
their cooperation in a pretest for his thesis during the 
time she collected the required number of files and F 
made the random pairings. At this point, the experi- 
menter went to the door and called the graduate stu- 
dent (the second experimenter), and then told the 
subjects she would be back shortly. 

The second experimenter introduced himself as a 

graduate student in the lab and explained that he 
hoped to study sexual behavior the following summer. 
He said that he was in the process of selecting 
written materials that would vary in their potential- 
ity for arousing male undergraduate college students. 
He explained that he wanted the entire continuum 
of arousal represented in his material and that, 
therefore, he needed some materials that undergradu- 
ates would find highly arousing as well as materials 
that they would find only mildly arousing. 

He asked the subjects if, while they were waiting, 
each would be willing to read a short one-page article 
and rate it on a brief form as to its arousability 
value for undergraduates. He stressed that some sub- 
jects might not wish to do this and that all were 
free to refuse if they felt any reservations whatso- 
ever about reading this type of materjal. All subjects 
were willing to read and rate an article. 

He then distributed one brief article and a short 
evaluation form to each student. Two subjects in 
each group read an article depicting a romantic 
seduction scene between two young people. (We 
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j i y whi i he first-in ion in- 

hoped that the subjects would find this article sexu- in fantasy while filling out t 
ay arousing.) The other two subjects read an ventories. Subjects then filled out the inve: 
article describing the sex life of herring gulls. (We 
considered this article to be nonarousing.) Relevant Items 

After reading the article, the subjects filled out the When a man is sexually aroused, two t in a 
evaluation questionnaire that asked the subjects to — pirl-her sexual desirability and her sexual receptiv- 
evaluate the article’s potential for capturing atten- ity—should be especially relevant to his needs (and 
tion and for producing sexual arousal. In the aroused — (hus especially susceptible to drive distortio: 
condition, the last item on the questionnaire re- A number of questions designed to tap the per- 
quested that subjects very briefly, in a few sentences, ceived attractiveness of the stimulus girl were in- 


tell what kind of article they would find sexually 
arousing. In the nonaroused condition, the subjects 
were asked to indicate what kind of story they 
would find interesting but not arousing. This phase 
of the experiment lasted approximately 10 minutes. 

When the first experimenter returned with the 
background questionnaires, the second experimenter 
collected his articles and questionnaires, thanked the 
subjects for helping him out, and departed. 

At this point, the first experimenter told the sub- 
jects that she would give each of them the back- 
ground questionnaires of two girls participating in 
the study and that they should report their first 
impressions of each. She instructed them: 


In addition to giving us your first impression of 
the girl assigned to you as a date, we are also 
going to ask you to give us your first impressions 
of a girl that you will not be dating. We are ask- 
ing you to rate two girls participating in the study 
because we want several opinions of each of the 
girls, 


Thus, all subjects expected that a date would be 
assigned to them. The subjects were told that some 
of them would evaluate their own date first, while 
others would first evaluate a girl they would not be 
dating. They were then given an envelope containing 
background information on the first girl they were to 
evaluate. 

One aroused subject and one nonaroused subject 
received envelopes that indicated that the girl they 
were to evaluate was their blind date. The other two 
subjects received envelopes which indicated that the 
ap they would be evaluating would not be their 

late. 

The envelope which the subject received contained 
a picture of the girl, a “background questionnaire” 
ostensibly completed in the girl’s own handwriting, 
and a “first-impression inventory,” on which the 
subject was to record his first impression of the girl. 
The photograph of the girl that was attached to the 
background questionnaire and the background in- 
formation included on the questionnaire were the 
same for all subjects. The photo was of a pretty, 
blonde co-ed. The girl had described herself on the 
questionnaire as active, fairly intelligent, easy to get 
along with, and moderately liberal. 

After the subjects had looked at the photograph 
and read the girl’s responses to the background 
questionnaire, the photo and questionnaire were re- 
moved by the experimenter. This was done to give 
the subjects the widest possible latitude to indulge 


cluded on the first-impression inventory. Subjects were 
asked to indicate on an 8-point scale how attractive 
they felt the girl was and also how attractive she 
was compared to girls they usually dated. In addi- 
tion, subjects were required to rate the girl on a 
number of characteristics, using a semanti fer- 
ential format. Three of these semantic diffe rential 
scales were relevant to attractiveness: beautiful ver- 
sus ugly; pretty versus mot pretty; and attractive 
versus unattractive. The responses to all five items 
were combined to form an attractiveness indc 


Questions designed to tap the perceived sexual 
receptivity of the stimulus girl were also included on 
the semantic differential portion of the inventory. 


Induded were amorous (versus nonamorous); im- 
moral (versus moral); willing (versus unwilling 
promiscuous (versus unpromiscuous); unwholesome 
(versus wholesome) ; careless (versus careful); n 
(versus nice); durable (versus delicate); and not 
inhibited (versus inhibited). Subjects’ responses to 
these nine items were then combined to form a 
sexual-receptivity index. 


Less Relevant. Items 


The semantic differential also asked the subjects to 
assess the girl on a number of traits that were in- 
tended to be of little relevance to a sexually aroused 
male. These traits were intelligence, cheerfulness, 
neatness, sensitivity, warmth, generosity, friendliness 
open-mindedness, affectionateness, self-confidence, so- 
phistication, and femininity. (Each of these irrelevant 
traits could be classified as socially desirable or un- 
desirable traits for a person to possess) Such a 
classification was made, and the subjects’ responses 
were combined to form an index of favorability of 
a girl’s personality.5 

When the subjects had completed their evaluation 
of the girl, the experimenter collected the inven- 
tories, The experimenter then asked the subjects to 
provide some information about themselves while she 


*Tt is possible, of course, that some subjects may 
have perceived some of these 12 traits to be relevant 
to sexual gratification. What is relevant or irrelevant 
depends entirely on individual perceptions, Although 
we cannot be sure that for every subject every trait 
we designated as irrelevant was perceived as totally 
irrelevant, we do have some confidence that on the 
whole the irrelevant traits were undoubtedly per- 
ceived to be less relevant to sexual gratification than 
were physical attractiveness and sexual receptivity. To 
the extent that our classification is wrong, we simply 
diminish our chances of supporting our hypothesis. 


l 
i 


i the information on the second set of girls 

to rate, The subjects were told that the 

n furnished on their own background 

vould be kept strictly confidential and would 
nished to their dates. 

the subjects had finished filling out the 

naire about themselves, the experimenter 

t she had not yet assembled all the infor- 

n on the second set of girls and suggested that 

the subjects waited they could fill out a form 

ziven out at the end of the experiment. This 

; explained that there had been a deception in- 

in the experiment and invited the subjects to 

vhat it was, They were also invited to write 

»wn any guesses that they might have as to what 

nerimenters expected to find out in this experi- 

` After the subjects had finished filling out this 

form, both experimenters debriefed the subjects. 


RESULTS 
Sexual Arousal Manipulation Check 


1t will be recalled that the rationale given 
to the subjects for reading the stories that we 
hoped would be either sexually arousing or 
nonarousing was that the second experimenter, 
the graduate student, was interested in as- 
sessing the potentiality of each story to sex- 
ually arouse college males. Thus, each sub- 
ject evaluated the arousal value of the story 
he read. These evaluations served as our check 
that aroused subjects (who read the romantic 
seduction story) were indeed more sexually 
aroused than nonaroused subjects (who read 
about the sexual behavior of the herring gull). 

The evidence indicates that the arousal ma- 
nipulation was successful. As compared to 
subjects in the nonaroused condition, aroused 
subjects found their story to be more sexually 
arousing (F = 391.76, P < .01), more interest- 
ing (F = 1797, p < 01), and reported them- 
selves to be far more desirous of engaging in 
sexual behavior after reading the article (F 
= 125.66, p < 01). Nonaroused subjects were 

© In speculating as to the nature of the deception, 
7 subjects responded that they felt it possible that 
they were not really going to get their date as had 
been promised, and 5 subjects felt that there may 
have been a connection between the two experiments, 


only 1 of these subjects correctly guessing the nature 
of the connection, The data, collected from these 


these subjects in order to determine whether or 
not excluding these suspicious subjects affected our 
experimental results in any way- Tt did not.) 
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more likely to report that the story failed to 
hold their attention (F = 60.47, p < .01) and 
that the story was not graphic enough to be 
arousing (F = 29.86, p < 01). It seems clear, 
then, that immediately prior to making their 
evaluations of the girl, aroused subjects were 
more sexually aroused than were the non- 
aroused subjects. 

Before proceeding to the test of our hy- 
pothesis, however, we might ask if these differ- 
ences in sexual arousal were maintained 
throughout the experiment or if they quickly 
dissipated. 

Some evidence pertinent to this question 
was obtained from the background informa- 
tion forms that all subjects completed after 
they had evaluated the girl and just prior to 
the termination of the experiment. The last 
few items of this form concerned the recency 
with which the subjects had engaged in vari- 
ous types of sexual behavior (from necking 
through intercourse). After indicating their 
actual level of sexual activity, the subjects 
were asked to indicate the level of sexual ac- 
tivity that they would consider to be ideal. It 
was reasoned that if aroused subjects were 
still more aroused than nonaroused subjects at 
this point in time, they should report a higher 
ideal level of sexual behavior than nonaroused 
subjects. Thus, a greater discrepancy between 
actual and ideal level of sexual behavior 
should be found in the aroused condition than 
in the nonaroused condition. This was, in fact, 
the case. Although there was no significant 
difference between the two groups in reported 
level of actual sexual activity (F = 2.13, ? 
> 10), the discrepancy between actual and 
ideal level was greater for aroused subjects 
than for nonaroused subjects (F = 4.17, p< 
05). Since the subjects filled out their infor- 
mation forms approximately 45 minutes after 
they had read the arousing materials, this re- 
sult would seem to attest to the strength and 
duration of the sexual arousal manipulation. 


Test of Hypothesis 


It will be recalled that we predicted that 
the arousal (or drive) level of the subjects 
and the accessibility of the girl would differen- 
tially affect the extent to which the subjects 
would enhance the relevant versus the irrele- 
vant characteristics of the girl. Specifically, an 
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Fic. 1. The effect of arousal and accessibility of relevant items (attractiveness 
and sexual receptivity) and irrelevant items. (The higher the number, the 
greater the perceived attractiveness or perceived favorability; the lower the 
number, the greater the ascription of traits associated with sexual gratification.) 


interaction for the relevant dependent mea- 
sures was predicted in which subjects in the 
aroused-accessible condition would rate the 
girl as more attractive and receptive than 
those in the aroused-nonaccessible condition, 
and these cells would show greater enhance- 
ment than the two nonaroused cells, between 
which little or no differences were predicted; 
for the irrelevant dependent measures, only a 
minimal interaction, if any, was predicted. As 
indicated previously, the correct way to test 
this hypothesis, given our design, is via a 
multivariate analysis of variance (cf. Hays, 
1963). 

The appropriate means are graphically pre- 
sented in Figure 1. Statistical analysis indi- 
cated that the independent variables, arousal 
and accessibility, did affect the relevant de- 
pendent variables, attractiveness plus sexual 
receptivity, differently than they affected the 
irrelevant dependent variable, favorability of 
girl’s personality (multivariate interaction F 
= 4,62, df = 1/76, p < .03). Neither arousal 
alone nor accessibility alone had a significant 
effect on perception (multivariate Fs = .48 
and .26, respectively). 

Examination of Figure 1 suggests that con- 
trary to our expectations, the two relevant de- 


pendent variables were differentially affected 
by the arousal and accessibility manipulat 
A second multivariate analysis, conduct 
test for this possibility, provided marginal evi- 
dence that there was indeed differential re- 
sponse to the two relevant dependent measures 
(multivariate interaction F = 3.13, df = 1/ 
76, p < .08). For this reason, and because the 
nature of the obtained overall interaction was 
not entirely as expected (eg., nonaroused 
subjects appear to have been affected by the 
accessibility of the stimulus girl, although in 
different fashion than for aroused subjects), 
additional analyses were undertaken to further 
examine the data. 


Separate Analyses of Three Dependent 
Variables 


Attractiveness. It will be recalled that it 
was predicted that arousal and accessibility 
would strongly interact in determining how 
much subjects enhanced the girl’s attractive- 
ness—a characteristic relevant to sexual grati- 
fication. An analysis of variance performed on 
the attractiveness data (see Figure 1) indi- 
cated that arousal and accessibility did not 
significantly interact in determining how at- 
tractive the subjects judged the girl to be 


{ 


(interaction F = 05, ns). (Thus, we have 
confirmation that our first relevant dependent 
measure did not contribute to the expected 
jnteraction.) Nor was the accessibility main 
effect significant (F = .05, ns). Rather, it ap- 
pears that the sexual arousal manipulation 
alone affected the extent to which the girl’s 
attractiveness was enhanced (F = 4.37, P< 
05). Subjects in the aroused conditions found 
the girl more attractive than did subjects in 
the nonaroused conditions. 

Sexual receptivity. It was predicted that 
arousal and accessibility would interact in 
determining how much the subjects enhanced 
the girl's sexual receptivity—a characteristic 
relevant to sexual gratification. An analysis of 
variance performed on receptivity data (see 
Figure 1) indicated that arousal and accessi- 
bility did significantly interact in determining 
how sexually receptive the subjects perceived 
the girl to be (interaction F = 6.24, P < 02). 
Subjects in the aroused-accessible condition 
were more likely to perceive the girl as being 
sexually receptive than were subjects in the 
aroused-nonaccessible condition. This differ- 
ence between the accessible and nonaccessible 
conditions was reversed for the subjects in the 
nonaroused conditions. 

Neither arousal alone nor accessibility 
alone affected perception of the girls’ sexual 
receptivity (aroused F = .09; accessibility F 
= 23)! 

General favorability of personality. ]t was 
predicted that arousal and accessibility would 
interact only weakly—or not at all—in de- 
termining how favorably subjects would per- 
ceive the girl's person: ity characteristics es- 
sentially irrelevant to sexual gratification. As 


TWith respect to analyses of the individual items 


illing than non- 
showed evi- 


showed evidence of an interaction of the type de- 
scribed for the total receptivity index. Several of 
these individual interactions were statistically signifi- 
cant: (*amorous" F= 12.95, p < 01; “careless” F= 
646, p — 05; "nasty" p-can,«€10 


F=5.26, p € 05). 
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predicted, arousal and accessibility did mot 
strongly interact in determining how favor- 
ably subjects perceived the girl to be along 
presumably irrelevant dimensions (F = 144, 
ns), and the means were not lined up in the 
direction predicted for relevant items. 
Neither the arousal mor the accessibility 
main effect was significant (arousal F = 1,99; 
accessibility F = .16).* 


DISCUSSION 


The results of this study provide some sup- 
port for Levine et al.'s (1942) general hy- 
pothesis that high drive initiates an “autistic” 
process “operating in the direction of gratifi- 
cation.” The results also suggest some guide- 
lines that may prove useful in predicting the 
occurrence and nature of such processes in 
person perception. 

If it is presumed (as seems safe in this 
culture) that sexual relations with an attrac- 
tive girl are expected to be more satisfying 
than sexual relations with an unattractive girl, 
the greater perceived attractiveness of the 
stimulus girl under conditions of sexual 
arousal is compatible with the notion that 
drive-initiated autistic distortion takes the 
form of enhancing the stimulus object’s poten- 
tial for drive gratification. 

This result supports and perhaps extends à 
similar finding in an experiment conducted by 
Epstein and Smith (1957). Although primar- 
ily interested in investigating the effect of 
sexual drive upon fantasy, these investigators 
also examined the effect of sexual drive upon 
perception of the attractiveness of stimulus 

8 No significant interaction or main effects were 
found for the following items making up the favor- 
ability index: intelligence, cheerfulness, neatness, 
sensitivity, warmth, generosity, friendliness, open- 
mindedness, affectionateness, and self-confidence. On 
the two remaining items—sophistication and femi- 
ninity—an arousal main effect was found (F for 
sophistication = 547, p < 05; F for feminine = 4.97, 
p< 05). Aroused males perceived the girl to be more 
sophisticated and more feminine than did non- 
aroused males. ‘Arousal X Accessibility interaction ef- 
fects of the same nature as that obtained on the 
receptivity index were also found (sophisticated F 
—5$47, p<.05; feminine F = 3.06, f < Q9). Sub- 
jects in the aroused-accessible condition found the 
stimulus girl to be less sophisticated and less femi- 
nine than did subjects in the arousal-nonaccessible 
condition; this effect was reversed for subjects in 
the nonarousal cells. 
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women. Rather than manipulating sexual 
arousal, Epstein and Smith estimated the 
strength of an individual’s sex drive on the 
basis of the subject’s reported rate of sexual 
orgasm. They requested their subjects to rate 
on a 5-point scale the sex appeal of three 
scantily clad women whose pictures were pro- 
jected successively on a screen. A positive re- 
lationship was found between the tendency to 
rate the women in the pictures as sexually ap- 
pealing and rate of orgasm; that is, the more 
sexually experienced the subject reported him- 
self to be, the more likely he was to rate the 
women as sexually attractive. 

One way in which the Epstein and Smith 
results may be interpreted is that the high- 
rate subjects possessed chronically high sexual 
drives; in the interest of need fulfillment, 
these subjects, like our sexually aroused sub- 
jects, saw the depicted women as more at- 
tractive than did the low-rate subjects. There 
are other possible interpretations of their 
findings, however (e.g., as has been demon- 
strated by Leiman & Epstein, 1961, high- 
tate subjects may have more accepting atti- 
tudes toward sex and thus may be less in- 
hibited in ascribing sex appeal to women de- 
picted in a laboratory setting; having just 
proclaimed their sexual prowess on a ques- 
tionnaire, high-rate subjects may feel a need 
to make their ratings of sexual appeal con- 
sonant with their extensive experience by rat- 
ing the women as sexually appealing, etc.). 

In the context of our own study, we investi- 
gated the effect of sexual experience versus 
inexperience on the tendency to find the 
stimulus girl attractive, We found that the 
more sexually experienced males in our study 
(determined by self-report of frequency and 
recency of sexual experience) found the stim- 
ulus girl less attractive than did the inexperi- 
enced males (r = —.21, p < .05), Thus, while 
sexual arousal was definitely related to per- 
ceived attractiveness of the stimulus girl, sex- 
ual drive as measured by reported magnitude 
of sexual experience was related to perceived 
attractiveness of the stimulus girl in the oppo- 
site fashion, This result suggests that sexual 
arousal may have mediated the Epstein and 
Smith finding; that is, sexually experienced 
males may have been aroused more easily by 
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scantily clad women in a laboratory setting, 
and arousal then led to enhanced atiractive- 
ness. 

The absence of an Arousal Accessibility 
interaction on the attractiveness index is per- 
plexing, however, since it seems quite clear 
that accessibility does make a difference in 
whether or not subjects perceive the girl as 


potentially receptive to engaging in sexual 
behavior on the first date. 
In this connection, we must comment on the 


nature of the sexual receptivity interaction, 


We had predicted that since accessible gb- | 
jects have more relevance for drive gratifita- 


tion than inaccessible objects, distortion along 
drive-relevant dimensions in the arousal-ac- 
cessible condition should be greater than in 
the arousal-nonaccessible condition. This was 
the case. We had thought, however, that under 
conditions of nonarousal, the accessibility or 
inaccessibility of the stimulus would have 
little effect on perception of the stimulus. This 
was not the case. Contributing to the sexual 
receptivity interaction was a tendency for 
nonaroused subjects to perceive their date as 
less immoral, promiscuous, etc., than the girl 
they were not to date. If it is assumed that it 
is not generally socially desirable for a girl to 
be immoral, promiscuous, or nasty, the non- 
aroused subjects showed a tendency to en- 
hance the general favorability of the girl they 
were to date as opposed to the nondate (al- 
though on other nonsexual items, there was no 
general effect strong enough to reach signifi- 
cance), 

Such general favorability enhancement due 
to anticipated contact with another might be 
expected from the results of other studies 
(eg. Darley & Berscheid, 1967). But the 
present results clearly suggest that anticipated 
interaction alone does not inevitably lead to 
perceptual enhancement of the favorability of 
the person. Therein, perhaps, lies the useful- 
ness of the general principle that it is the 
stimulus object's potential for drive gratifica- 
tion that is enhanced, not simply its general 
favorability. 

In most cases, of course, social interaction 
With a person possessing characteristics re- 
garded as socially desirable may be expected 
to lead to greater reward than interaction 


with a person possessing socially undesirable 
“traits. In the current case of aroused sexual 
“need, however, we have perhaps stumbled onto 
— one of those cases in which interaction with a 
| person possessing à few socially undesirable 
traits, of a prescribed nature, is likely to be 
more immediately rewarding than interaction 
with a completely socially desirable person. 
- The “sexual revolution" notwithstanding, the 
Ew Characteristics associated with sexual relations 
- on a first date may be still generally unfavor- 
able and may fall into the “bad girl” syn- 
drome (cf. Wolfenstein & Leites, 1950). Thus, 
the distortion that takes place, while in the 
“direction of enhancing the object's potential 
for drive gratification, may occasionally be at 
the same time in the direction of decreasing 
the general favorability of the object. It might 
be noted that the results of an additional item 
included on the first-impression inventory sup- 
- port the notion that aroused subjects per- 
— ceived their dates both as potential objects of 
"sexual gratification and perhaps less favorably 
overall than did those who were nonaroused. 
Subjects were asked how likely they would be 
to ask the girl out again, assuming that the 
=) ‘ first date “went well.” An Arousal X Accessi- 
bility interaction was obtained such that 
aroused-accessible subjects felt they would be 
less likely to ask her out again than aroused- 
nonaccessible subjects; the reverse effect was 
obtained for nonarousal conditions (S T32 


p « 01). 
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EFFECTS OF FREQUENCY OF EXPOSURE OF CHANGING 
AND UNCHANGING STIMULUS PAIRS ON 
AFFECTIVE RATINGS * 


ALBERT A. HARRISON,; ROBERT M. TUTONE, Ax» D. GAYLE McFADGEN 


University of California, Davis 


From the response competition interpretation of the attitude-enhancing effects 
of exposure, it was hypothesized that exposure of a stimulus coupled with 
consistent and redundant contiguous associates should enhance favorability, 
whereas exposure coupled with inconsistent and changing contiguous as- 
sociates lowers favorability. In Experiment I, ambiguous names were pre- 
sented 2, 4, and 16 times with either two identical associates for each presenta 
tion or with two incompatible associates that were changed for each presenta- 
tion. Names paired with consistent and redundant associates received more 
favorable ratings (P < .01). Experiment II suggested that these results are 
not explainable in terms of a "transfer of liking" from repeatedly presented 
associates. In Experiment III, ideographs were presented 2, 4, and 16 times 
with (a) the same associate, (5) no associate, (c) one of two associates, or (d) 
a different associate for each presentation. In support of the response com- 
petition hypothesis, ideographs in Conditions a-d received successively lower 
ratings ($ «.025). The overall frequency effect (p <.01) suggested pairing a 
stimulus with inconsistent and changing contiguous associates impairs, not 


reverses, the exposure-favorability relationship. 


Since the time of Fechner (1876), nu- 
merous but sporadic studies have related the 
frequency of exposure of a stimulus to the 
favorability ratings accorded it. Some of these 
studies were concerned with the formation of 
aesthetic preferences (e.g, Krugman, 1943; 
Maslow, 1937; Moore & Gilliland, 1924; 
Mull, 1957; Pepper, 1919; Verveer, Barry, & 
Bousfield, 1933). They found, for example, 
that exposure to musical selections led to im- 
proved ratings of them (e.g., Moore & Gilli- 
land, 1924; Mull, 1957; Verveer et al., 1933). 
Other studies were conducted within the con- 
text of consumer motivation, Becknell, Wilson, 
and Baird (1963) found that female subjects 
had increasingly favorable reactions to ficti- 
tious brand names as these brand names were 
made familiar. Still other studies attempted to 
relate word frequency and goodness of mean- 
ing in order to better understand perceptual 
threshold phenomena (e.g., Frincke & John- 


1 This research was supported by National Science 
Foundation Grant GS-2791 to the first author. The 
authors are indebted to Dennis Mahar and Joseph 
Stewart for their assistance. 
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*Now with the Kaiser Foundation, 


son, 1960; Johnson, Thomson, & Frincke, 
1960; Wilson & Becknell, 1961). For the rost 
part, they found a close positive relationship 
between word frequency and rated goodnes: of 
meaning. 

The results of such studies led Zajonc 
(1968) to formulate the hypothesis that 
“Mere repeated exposure of the individual to 
a stimulus enhances his attitude toward it [p. 
1].” At the time of Zajonc's writing, many of 
the earlier studies could be considered the 
product of outmoded methodology. Conse- 
quently, Zajonc reported a series of his own 
studies which provided (a) correlational evi- 
dence that things or events with a high prob- 
ability of occurrence in the environment re- 
ceive more favorable ratings and (5) experi- 
mental evidence that variations in the number 
of exposures of a stimulus produce concomi- 
tant variations in favorability ratings, such 
that rated favorability increases as a. function 
of the logarithm of the frequency of exposure: 
A tentative explanation of this “exposure ef- 
fect” has been offered by Harrison (1967, 
1968b, 1969), Harrison and Zajonc (1970), 
and Matlin (1970). They suggest that when 
the perceiver is first confronted with a novel 
stimulus, a number of antagonistic response 
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FREQUENCY OF EXPOSURE ON AFFECTIVE RATINGS 


are tenuously elicited, producing a 
ite (response competition). As a 
tate, response competition produces 
of the stimulus and motivates ex- 
behavior aimed at eliminating the 
competition, Subsequent exposure to 


the ius provides opportunity for some of 
the onse tendencies to be strengthened, 
while others are weakened or crowded out. As 


one onse gains dominance, response com- 
pe is decreased, reducing both disliking 
ar ivation for exploratory behavior. 


esent, the response competition hy- 
p : is supported by evidence that as re- 


S competition decreases, attitudinal rat- 
in e enhanced (Harrison, 1968b; Harrison 
& -aionc, 1970; Matlin, 1970) and that 
curiosity as assessed by “approach” behavior 
is inversely related to liking, as assessed by 


semantic rating scales (Harrison, 1968b; 
Ha-rison & Hines, 1970). Thus far, only Mat- 
ıs successfully attempted to influence 
response competition. Matlin hypothesized 
that as the number or strength of alternative 
responses is increased, response competition 
will be enhanced and attitudinal ratings sub- 
rently lowered. In one experiment, Matlin 
! subjects learn one, two, three, or six re- 
ises to initially novel stimuli. She found, 
expected, a relationship such that stimuli 
iated with fewer response alternatives 
:eived more favorable ratings. In a second 
experiment, Matlin presented nonsense words 
and had her subjects pronounce them aloud. 
Some nonsense words were pronounced the 
same way and others in different ways over 
successive presentations. Again, she found an 
inverse relationship between number of re- 
sponses to the stimulus and favorability of 
stimulus ratings. In both of these experiments, 
the subjects were forced to make overt re- 
sponses. The experiments reported here also 
attempted to influence response competition, 
but within a context where the role of the 
subjects is relatively passive. Under these lat- 
ter procedures, any contributions to affective 
ratings of task difficulty independent of re- 
sponse competition should be minimized. 

The exposure research that led to the 
postulation of an exposure-liking relationship 
was of one type, that is, “mere” exposure, 
which involved presentation of test stimuli in 
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varying frequencies and then obtaining affec- 
tive ratings. In these experiments, attempts 
were made to hold all other stimuli constant, 
With such procedures, each successive expo- 
sure was likely accompanied by relatively few 
or weak new associations, with the result that 
the reduction of response competition was 
facilitated. But in the natural setting, re- 
peated exposure to a stimulus is likely to be 
accompanied by varying environmental condi- 
tions, that is, by new stimuli that elicit 
numerous and perhaps strong associations. 


These contiguous stimuli may or may not — 


facilitate the reduction of response competi- 
tion and negative affect. (a) Exposure of a 
stimulus may be accompanied by contiguous 
stimuli that are consistent, in the sense that 
they are unlikely to elicit competing responses 
over successive exposures. For example, the 
exposure of one stimulus may always be 
associated with Contiguous Stimulus A, while 
the exposure of another is always associated 
with Contiguous Stimulus B. Exposure accom- 
panied by consistent contiguous stimuli, which 
provide unchanging cues, should aid in the 
selection of a response to the original stimu- 
lus, thereby helping to reduce response com- 
petition. (b) Exposure may be accompanied 
by contiguous stimuli that are inconsistent, 
that is, likely to elicit new and competing 
responses over successive stimulus exposures. 
For example, a stimulus may first appear 
with Contiguous Stimulus A, on second ap- 
pearance with Contiguous Stimulus B, and on 
third appearance with Contiguous Stimulus C. 
Under these conditions, the contiguous stimuli 
should, in effect, send “mixed messages," 
thereby inhibiting the reduction of response 
competition and impairing the enhancement 
of liking over successive stimulus exposures. 

Thus, an implication of the response com- 
petition hypothesis is that repeated exposure 
of a stimulus that is accompanied by con- 
sistent or unchanging contiguous stimuli 
should enhance attitudinal liking, while expo- 
sure coupled with inconsistent or changing 
contiguous stimuli should retard the exposure- 
liking relationship. In the present experiments, 
emphasis was thus on the role of contiguous 
stimuli with which “test” stimuli of varying 
frequencies were presented. 
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A requirement for testing our hypotheses 
was the selection of stimuli for exposure that 
were highly ambiguous and likely to elicit 
conflicting responses. Under these conditions, 
consistent contiguous stimuli should prove 
most helpful, and inconsistent contiguous 
stimuli should prove most harmful for the 
process of reducing response competition. In 
this first experiment, the test stimuli con- 
sisted of first names that were ambiguous, in 
that when spoken aloud, they were equally 
suitable for males or females. The contiguous 
stimuli were  tachistoscopically presented 
graduation portraits from a high school year- 
book. 

In the consistent condition, as a given name 
was presented, two identical pictures of the 
same person were simultaneously presented, 
and this same pair of portraits was again 
presented on each subsequent presentation of 
the name. This procedure was expected to 
aid in the reduction of response competition 
and enhance liking. In the change condition, 
as the name was presented, two pictures of 
different people (male and female) were 
simultaneously presented, and this pair of 
portraits was changed for each successive 
presentation of the name. This second proce- 
dure was expected to hamper the reduc- 
tion of response competition and retard the 
exposure-liking relationship. 


EXPERIMENT I 
Method 


Subjects. Subjects were 36 volunteers from intro- 
ductory psychology classes at the University of 
California, Davis, who were given class credit for 
participation if they so requested. None had earlier 
participated in exposure research. 

Stimuli. The test stimuli presented in varying fre- 
quencies and subsequently rated were ambiguous first 
names. Boy or Girl? Names for Every Child 
(Thomson, 1961) was searched to select first names 
which, when spoken, were ambiguous as to the sex 
of the referent. Thirty-eight such names were found 
which the authors agreed fulfilled this criteria, This 
pool included both names that are spelled alike when 
applied to either gender (such as Pat) and homo- 
phonic pairs of names that are pronounced the same, 
although spelled different, according to gender (such 
as Gene and Jean). To reduce this pool to the 12 
required for the experiment, names that the authors 
considered bizarre and names that could be applied 
to either gender but which, in the judgment of the 
authors, generally were not, were arbitrarily elimi- 
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nated. To ensure that aesthetic preferences did mot 
govern this selection, the original pool of 38 names 
was rated on 7-point like-dislike scale a panel 
of 36 independent judges. Ratings accord ur sub- 
sample of 12 clearly reflected the di tion of 
ratings accorded the original 38 names 

To provide relevant stimuli to be contiguously 


presented with the names, an obscure /f-state 
high school yearbook was selected. Indiv gradu- 
ation portraits measuring 185 X2/& in were 
photostatically reproduced and then preps n two 
forms. First, they were grouped togett n two 


ad-lib 


] mea- 


large pieces of cardboard for a preexp: 
exploration session which served as a cont 
sure later to be described. Second, they we unted 
in pairs on index cards for tachistoscopie) »rcsenta- 
tion when the associated name was verbalized. For 
the consistent condition, two identical pictures of the — 
same individual were centered on the index card. 
Half of these pairs were pictures of the same male; 
the other half, pictures of the same female. (Since 
pairs were required in the change condition, pairs 
were used in the consistent condition to control for 
the amount of visual field containing stimulus ma- 
terials.) For the change condition, a pool was cstab- 


lished by mounting a picture of one male one 
female on the center of the card. For half of the 
cards in the change pool, the male was on the left; 


for the other half, the male was on the right 

Equipment. The pairs of portraits were presented 
by means of a Cambridge tachistoscope, bavi a 
visual field area of 3 X 5 inches with a bright of 
approximately 5 footlamberts. The tachistoscope was 
mounted on a table, protruding through a partition 
which separated the subject and experimenter. To 
obtain ratings, 7-point like-dislike and dislike-like 
scales were prepared. 

Procedure, A within-subjects design was used. Each 
subject heard four names in each of the frequencies 
2, 4, and 16. Within each írequency category, two 
names were presented with redundant and repetitious 
consistent stimuli and two names with inconsistent 
and varying change stimuli. Thus, within each fre- 
quency, for each subject, there were two replications 
of the consistent and change treatments. In the 
consistent condition, the contiguous associates were 
the two identical portraits of the same person pre- 
sented tachistoscopically each time that the name | 
was verbalized. For one replication, the paired 
identical portraits were of the same man; for the 
other replication, they were of the same woman. 

In the change condition, the contiguous portraits 
were of different people, one male and one female. 
They were replaced with a novel pair over succes- 
sive repetitions of the name. For one replication of & 
the change condition, the male portrait was on the 
left; for the other, the male was on the right. 

A counterbalanced design was used to control 
initial favorability independent of treatment, Each 
name appeared in each replication of the consistent 
and change conditions under each frequency of expo- 
sure an equal number of times. Order of presenta- 
tion of the names was initially randomized and then 
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a tant across subjects. Contiguous stimuli 


miy selected for each subject with the 


were 
restri that once selected, the pairs of identical 
portra d in the consistent condition had to 
appear cach frequency category before returning 
to th | 

Subjects were brought into the experimental cubicle 
and seated before the tachistoscope. In this and the 
subst ;t experiments, all subjects were first read 
the chology department's rules governing the 
rights of human subjects in psychological research. 
Sut were then told that the experiment would 
be cied in several parts. First, they were given 
car ntaining all of the portraits later used in the 
“exposure” phase of the experiment, and a period 
oí ad-lib exploration began that was terminated when 
the subject indicated “feeling comfortable with the 


faces” by returning the cards. This phase was in- 
cluded to provide some control on the familiarity 
of the portraits, hopefully extracting from the subse- 
quent affective ratings of the names contributions 
made by the differential familiarity of the contigu- 
ous portrait, Next, the exposure phase commenced. 
Instructions were to listen and watch while the 
experimenter read the names aloud and presented 
the pictures. The experimenter, who had been 
coached in pronouncing the names neutrally, spoke 
the names and simultaneously presented the appro- 
priate pair of portraits by means of the tachistoscope 
which had been set for 40 milliseconds. About 3 
seconds elapsed between presentations. 

Following exposure, half of the subjects were 
given the like-dislike scales and the remainder the 
dislike—like scales. Subjects were told that the experi- 
menter would call out each name and that the sub- 


presentation of the names was random and unrelated 
to any ordering sequence during the exposure phase. 
During this phase; portraits were not presented. 


Results 


Affective ratings were scored from 0 to 6, 
with higher numbers indicating greater favor- 
ability. Figure 1 presents the mean affective 
ratings of the names as à function of (@) fre- 
quency of exposure and (b) the “conflict 
condition,” that is, having been paired with 
consistent and change contiguous stimuli. Sig- 
nificance was tested by means of an analysis of 
variance. Tested against the Subjects X Con- 
fict interaction, the difference between con- 
sistent and change conditions was signifi- 
cant (F = 12.22, df = 1/33, p < 01). Tested 
against Subjects X Frequency, the main effect 
for frequency was 3.01 (df = 2/66, ns). The 
Frequency X Conflict interaction Was also 


insignificant (F < 1). 
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Fic. 1. Ratings of ambiguous names as a function 


of exposure frequency and earlier pairing with con- 
sistent or changing portraits. 


Discussion 


In this experiment, the main effect for fre- 
quency of exposure approached, but did not 
quite attain, significance. One possibility, with 
greater than usual probability, is sampling 
error. Prior to the experiment, a pilot study 
was conducted which addressed several issues, 
one of which was whether or not ratings of 
the ambiguous names could be enhanced as 
a function of repeated stimulus exposure. Al- 
though we did not consider all these names 
to be novel, we thought that their ambiguity 
would initially arouse strong response com- 
petition which would subsequently be re- 
duced over successive stimulus exposures. In 
this pilot study, 36 subjects similar to those 
used in Experiment I heard 6 names twice, 
6 names 4 times, and 6 names 16 times, and 
then rated them. With this sample of 18 
names, drawn from the original pool of 38 
names, higher frequency of exposure was 
associated with more favorable ratings at all 
data points; this was significant when tested 
against the Subjects X Frequency interaction 
(F = 7.08, df= 2/68, p < .01). Perhaps our 
procedures for «refining? and reducing this 
pool from 18 to 12 for use in Experiment T 
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attenuated the exposure-liking relationship, or 
perhaps unidentified individual differences 
weakened the effect. However, Figure 1 sug- 
gests a very strong “trend” toward favorabil- 
ity enhancement as a function of repeated 
stimulus presentations, particularly under 
consistent conditions. 

The substantial differences between the 
consistent and change conditions in Experi- 
ment I make it clear that within the context 
of the present experiment, stimuli presented 
with redundant and unchanging contiguous 
stimuli received more favorable ratings than 
stimuli presented with inconsistent and 
changing contiguous associates. This finding 
is in full accordance with the response 
competition hypothesis. 

There remains, however, an alternative inter- 
pretation of this result that demands con- 
sideration. Although the contiguous associates, 
that is, portraits, were presented for 40 milli- 
seconds per exposure, the results of Harrison 
and Hines (1970) suggest that even 40- 
millisecond exposures can enhance ratings. 
This raises the possibility that as a function 
of mere successive exposures, the consistent 
portraits were affectively enhanced, and that 
this favorability was reflected in subsequent 
ratings of the associated consistent names. In 
the change condition, where the names were 
presented with ever-changing portraits, such 
enhancement could not occur for the portraits 
and thus could not be reflected in the subse- 
quent ratings of the change names. "Thus, in 
the consistent condition, the subjects could 
develop liking for the portraits over succes- 
sive exposures, and this positive regard might 
transfer to the associated name. If transfer 
of affect were likely, our results would be 
less clearly in favor of the response competi- 
tion hypothesis. We would have, instead, 
merely a “double” exposure effect, 

In Experiment T, the preexposure, ad-lib 
exploration phase was included as a control 
on this possibility. In general, the curve for 
the exposure-liking relationship sharply de- 
celerates (Zajonc, 1968). For this reason, if 
a subject were shown an initially familiar 
stimulus 16 times, we would expect the incre- 
ment in favorability to be minute, In the 
above experiment, the portraits were familiar 
before the names were presented, since prior 
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to the exposure phase, the subjects looked at 


the portraits until they felt ifortable 
with them. However, the possibiity of a 
"transfer of affect” interpretation c! Experi- 
ment I still seemed of sufficient rest to 
warrant a test of its plausibility. 
EXPERIMENT II 

In Experiment II, it was asked w^c!her the 
affective ratings of fest stimuli which had 
been presented a constant number times 
would be affected by earlier pai with 
familiarization stimuli which had been pre- 
viously presented a varying number times. 
That is, a deliberate attempt was wade to 
influence the affective ratings of stimuli by 


pairing them with other stimuli of differing 
exposure frequencies. If the affective «tings 
of the constantly exposed test stimiei? are 
enhanced by earlier exposed familiarization 


stimuli, the transfer-of-affect interpretation 
gains plausibility, for “transfer” between two 
variably exposed stimuli would be quite 


possible too. 
Method 


Subjects. Subjects were 48 students from the intro- 
ductory psychology classes at the University of Cali- 
fornia, Davis. They were given class credit for their 
participation if they so requested, 


Stimuli. Twenty-four high school graduation por- 


traits were photostatically reproduced, mounted on 
index cards, and inserted individually and in pairs 
into plastic holders. 

Procedure. In this experiment, the subject sat 


across from the experimenter, separated by a 
shoulder-high partition. First, two familiarization 
portraits were presented in each of the frequencies 
of 1, 2, 5, 10, and 25, while two 0-frequency por- 
traits were held in reserve for the rating phase. The 
design was counterbalanced, such that each familiari- 
zation portrait appeared in each frequency an equal 
number of times. In the second phase, the differen- 
tially exposed familiarization portraits were pre- 
sented in pairs, with novel test portraits 3 times 
each, the order of appearance of the pairs being 
randomized. Over the course of the experiment, each 
portrait served as both a familiarization and test 
stimulus in each frequency; and across subjects, 
pairings were continually changed to produce as 
many combinations of pairs as possible. Finally, 
there was the rating phase, where all subjects first 
rated test stimuli and then rated the familiarization 
stimuli, with alternate subjects using alternate forms 
of the scales. The order of rating of the test and 
familiarization stimuli was not counterbalanced, since 
multiple ratings of associated stimuli are suspected 
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encing each other, and the critical ratings 


of the stimuli were to be kept uncontaminated 
Res: 

Results of Experiment II showed that the 
ratines of the test stimuli which had been 
exp a constant number of times were not 
afiected by the earlier pairing with familiari- 
zation stimuli which had been exposed differ- 
en umbers of times. There was not the 
vi systematic ordering of means (F< 1, 
d 188). On the other hand, for the 
familiarization stimuli, there was an “expo- 
sut ct" despite the fact that the famil- 


jarization stimuli were always rated last 
(F — 2.51, df = 5/488, p < 05). The mean 
renes of familiarization stimuli exposed 5, 
1 nd 25 times were higher than the mean 
ratings of stimuli presented once, twice, or 
not at all. 


Discussion 
Experiment II, where a transfer of 
affect was sought, pairing of a stimulus ex- 
posed for a constant number of times with 
a different stimulus earlier exposed for a vary- 
; number of times had no effect. We by no 
3 means suggest that a transfer of affect from 
a high- to a low-frequency stimulus is impos- 
sible. Tf frequency of exposure were increased, 
if the pairings of the associates were strength- 
ened by forcing the subject to make overt 
responses, or if the two stimuli were presented 
simultaneously for rating, such an effect might 
| emerge. But in Experiment I, the maximum 
number of exposures Was less than in Experi- 
ment II; there was no effort made to force 
the subject to learn the pairings, just as there 
was no such effort in Experiment TI; and the 
names, like the test stimuli in Experiment Il, 
were presented alone for rating. For these 
reasons, the results of Experiment TI de- 
crease the plausibility of a transfer-of-affect 
interpretation of the high ratings accorded 
consistent stimuli in Experiment L 


ExrERIMENT IH 


Experiment III was conducted to corrobo- 
rate and extend the findings of Experiment I. 

In Experiment HI, we asked if by using en- 

tirely different stimuli and procedures, results 
similar to those in Experiment I could be pro- 
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duced. To extend the earlier results, Experi- 
ment III involved a comparison of four 
degrees of conflict (ie, consistency and 
change). Each of these treatments was defined 
by the nature and type of the contiguous 
stimuli, In all conditions, the variably exposed 
test stimuli were Chinese ideographs, and the 
contiguous associates, nonsense words. In the 
consistent condition, presentation of a given 
ideograph was always accompanied by the 
simultaneous presentation of the same non- 


sense word. Because they were to be associ- 


ated with unchanging contiguous stimuli, 
which should aid in the reduction of response 
competition, these ideographs were expected 
to receive the highest affective ratings. In 
the control condition, the ideograph was al- 
ways presented alone. Since in the control 


condition no attempts were made to aid or 


hinder the reduction of response competition, 
it was expected that control stimuli would 
receive lower ratings than the consistent stim- 


uli but higher affective ratings than those 


accorded stimuli in the two remaining condi- 
tions. In the first of these conditions (alter- 
nating), on half of its presentations the ideo- 
graph was associated with one nonsense word, 
on the other half, with another. This was done 
in such a way that the two nonsense words 
would alternate on successive presentations 
of the ideograph. In the final, or change, 
condition, the contiguous nonsense word was 
replaced with an entirely new one for each 
presentation of the ideograph. We expected 
the ideographs associated with alternating 
and change nonsense words, respectively, to 
receive the lowest affective ratings. 


Method 


Subjects. Subjects were 84 undergraduates from 
the University of California, Davis. Subjects were 
drawn from introductory psychology classes and, 
upon depletion of that pool, from residence halls. 
Excluded were students who had participated in 
earlier research of this type and all subjects claiming 
knowledge of oriental languages. 

Stimuli. Stimuli consisted of the 12 Chinese ideò- 
graphs used by Zajonc (1968) in his initial research 
on the exposure-affect relationship. Each ideograph, 
measuring 1 X 13 inches, was photostatically repro- 
duced and mounted on index cards, positioned equi- 
distant from the sides, but only 3 inch from the top. 

A total of 31 contiguous associates was required: 
3 for the consistent condition, 6 for the alternating 
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condition, and 22 for the change condition. To pro- 
vide this pool, the 12 trisyllabic, seven-letter non- 
sense words used by Zajonc (1968) in his earlier 
exposure research were used, as well as the remainder 
of the pool from which these words had been drawn 
(Solomon & Postman, 1952). Since this still did 
not prove sufficient, the investigators fabricated the 
remaining words, adhering as closely as possible to 
procedures designed to ensure that all words were 
visually and auditorially distinct from one another. 
Each word was then typed in capitals on a slip 
of paper, and each slip of paper was adhered to 
an individual clear plastic template to be placed over 
the cards containing the ideographs. 

Apparatus. The apparatus consisted of the Cam- 
bridge tachistoscope used in Experiment I. Again, 
the tachistoscope was mounted on a table protruding 
Írom a partition which hid the experimenter from 
the subject. In this experiment, the tachistoscope 
was not used in its capacity for very brief exposures, 
but it was used as a convenient device for displaying 


stimuli for a constant period of time. 


Procedure. A within-subjects design was used. Each 
subject saw four ideographs each in the frequencies 
of 2, 4, and 16. Within each frequency, one ideograph 
was associated with each manipulation. In the con- 
sistent condition, each ideograph was accompanied 
by the same nonsense word on each exposure. In 
the control condition, the ideographs were presented 
alone. In the alternating condition, each ideograph 
Was accompanied by one nonsense word on succes- 
sive even-numbered trials and by a second nonsense 
word on successive odd-numbered trials. In the 
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change condition, the associated nonsense word was 
changed for each presentation of the ideow: iph. The 
design was counterbalanced, such that each deograph 
appeared in each frequency under each condition an 
equal number of times. 

To form pairings between ideographs and words, 
words were initially shuffled and then drawn from 


the pack for assignment. Once drawn for the con- 
sistent or alternating conditions, however, the words 


had to appear in each frequency within that condi- 
tion. Counterbalancing in this latter case was such 
that while over subjects the ideographs were rotated 


clockwise, the words were rotated countercloch wise, 
thus minimizing the number of times any gii 
graph appeared with any given word. ( 
presentation of the pairs was initially randi 
and then held constant across subjects. 

The subject was seated before the tachistoscope 
and told to watch the scope. The exposure phase 
then commenced, with each stimulus or stimulus 
pair being presented for 2 second per exposure, Pair- 
ing of ideographs and nonsense words was accom- 


n ideo- 
ler of 
mized 


plished by placing the transparent films containing 
the words over the opaque cards containin the 
ideographs and introducing them simultaneously into 
the istoscope; words always appeared c 

and directly beneath the ideographs. As the exper 
menter tripped the exposure, he pronounced the 
nonsense word, if any, taking extreme care to pro- 
nounce it as neutrally as possible. Verbal presentation 


of the nonsense word was included to make sure 
that the word did not go unregarded; earlier, 
Harrison (1967) found that subjects spend more time 
looking at the ideographs than the words. Inter- 
exposure interval was 2-3 seconds, 

After exposure, affective ratings were obtained 
following the techniques of Zajonc (1968). Half of 
the subjects rated on good-bad scales, the remainder 
rated on bad-good scales, Order of presentation 
during testing was randomized for each subject. The 
ideographs were presented alone for rating. 


Results 


Affective ratings were scored 0-6, with 6 
indicating highest “goodness of meaning.” 
Affective ratings as a function of frequency 
of exposure and treatment conditions appear 
in Figures 2 and 3. Significance was tested 
by means of an analysis of variance, At the 
outset, it is clear that independent of condi- 
tions, affective ratings were enhanced as a 
function of repeated stimulus exposures. 
Tested against Subjects x Frequency, this ef- 
fect was highly significant (F = 35.29, df= 
2/140, p < .01). However, also clearly, the 
conflict treatment had an effect (Figure 3). 
Ideographs in the consistent condition re- 
ceived the highest affective ratings; those in 
the control condition, intermediate affective 
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‘ratings; and those in the alternating and 
"change conditions, the lowest affective ratings. 
Tested against the Subjects X Conflict inter- 
action, this effect was significant at the 025 
level (/ 3.68, dj = 3/210) and closely ap- 
proached the .01 level. Using the Newman- 
Keuls method (Winer, 1962), post hoc com- 
parisons were made. With significance set at 
the .05 level, these comparisons showed that 
ideographs in the consistent condition re- 
ceived significantly higher ratings than those 
in either the alternating or change conditions. 
The Frequency X Conflict interaction did not 
approach significance. 


Discussion 


Results of Experiment III are consistent 
with those of Experiment I, despite the con- 
siderable differences in format. In both experi- 
ments, stimuli accompanied by consistent and 
unchanging contiguous associates received 
higher affective ratings than stimuli accom- 
panied by inconsistent or changing contiguous 
associates. In Experiment III, this was true 
when two types of procedures aimed at inhib- 
iting the reduction of response competition 
were tried. The inclusion of the control treat- 
ment in Experiment III also clarifies the 
results of Experiment I. 

Experiments I and III thus support the 
response competition hypothesis. However, 
the lack of a Frequency X Conflict interaction 
in Experiment III requires comment. We 
speculated that response competition is a 
function of (a) the relative strength of com- 
peting responses and (b) the number of re- 
sponse alternatives. Our procedures, unlike 
Matlin's (1970), left the subject relatively 
*passive" during exposure. Tt is thus unlikely 
that our procedures exerted appreciable in- 
fluence on the strength of the competing 
alternatives. On the other hand, since the 
contiguous stimuli in the change and alter- 
nating conditions should themselves elicit re- 
sponses, our procedures should have been 
quite congenial for increasing or restricting 
the total number of response alternatives. 
Indeed, a comparison of the means for the 
four conditions, independent of frequency, 
supports this interpretation. However, in Ex- 
periment III, affective ratings were enhanced 
over exposures for each condition. For the 
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INCREASING RESPONSE COMPETITION 


Fic. 3. Ratings of ideographs as a function of 
earlier pairing with consistent, alternating, or chang- 
ing nonsense words. 


change stimuli, this becomes a potential 
source of embarrassment. Clearly, the change 
stimuli presented 16 times received higher 
affective ratings than the change stimuli pre- 
sented 2 or 4 times. Yet, the change stimuli 
presented 16 times were presented with 16 
different associates; those presented twice, 
with only 2 different associates. Could we not 
expect favorability ratings of change stimuli 
to decrease over exposures? 

Earlier research (Harrison, 1968b; Har- 
rison & Zajonc, 1970; Matlin, 1970) indi- 
cated that subjects can and do form associa- 
tions to stimuli such as the words and ideo- 
graphs. Postexperimental interviews suggest 
that this was also true in the present experi- 
ments. Any response elicited by the changing 
nonsense words would be pitted against re- 
sponses elicited by the ideograph. But, while 
the responses to the ideograph would be elic- 
ited each time the ideograph was presented, 
responses to the ever-changing contiguously 
presented words would be summoned far less 
often. If, as this earlier research suggests, 
repetition of a response increases response 
strength, strong responses to the more fre- 
quently presented ideographs would be 
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pitted against weak responses to the less fre- 
quently presented nonsense words, and over 
exposure, the more frequently elicited response 
to the ideograph would gain in relative 
strength. In this way, the reduction of re- 
sponse competition would be slowed, but not 
stopped, by alternating or changing contigu- 
ous stimuli. On the other hand, in the con- 
sistent condition, the responses elicited by the 
first presentations of the paired ideographs 
and words would be elicited on each subse- 
quent exposure: new, weak associations are 
thus not later introduced. Here, right from 
the first exposure, we would expect the un- 
changing contiguous stimulus to provide con- 
sistent and reundant information, rather than 
to add to the confusion. Consistent contigu- 
ous associates could result in a narrowing 
down of the number of response alternatives 
from a large pool, which would satisfy the 
requirements of the ideograph, to a narrower 
pool, which would satisfy the joint require- 
. ments of both the ideograph and nonsense 
word, thereby reducing the range of responses 
and lowering response competition. 

Finding that contiguously presented stimuli 
merely help or hinder the reduction of re- 
sponse competition suggests how the response 
competition hypothesis can take into account 
the correlational evidence of the exposure- 
liking relationship. This correlational evi- 
dence suggests that things or events with a 
high probability of appearance in the envi- 
ronment receive more favorable affective 
ratings. Many of these “natural” stimuli must 
have appeared under conditions where con- 
tiguous stimuli varied highly, and yet 
the exposure-liking relationship prevailed. 
Changes in the natural contiguous stimuli, 
like changes in the contiguous stimuli in 
Experiment III, could also hinder, not stop, 
the reduction of response competition, 

In addition to the response competition 
interpretation of the exposure effect, at least 
three other interpretations have had some 
experimental testing. Suedfeld and Epstein 
(1970) have shown that attempts to create 
response sets can encourage or hamper the 
exposure effect. A possible implication is that 
the exposure effect reflects experimenter bias. 
But it is not clear how this explanation ac- 
counts for exposure results after attempts 
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are made to minimize set and make instruc- 
tions as neutral as possible, nor how the 
same set of instructions and rating proce- 
dures can produce different exposure fayor- 


ability relationships depending on su 
as the sequence of stimulus pr: 
(Berlyne, 1970; Harrison & Cr 
press). A second interpretation was < 
Burgess and Sales (1971), who sug; 


stimulus repetition leads to the form of 
associations between initially nov: inuli 
and elements associated with the experimental 
situation, elements which, for college sub- 
jects, have favorable affective conn: ns 
(e.g, “science,” “psychologist,” and ‘ ri- 
menter"). However, it is intriguing thal re- 
inspection of earlier response competition data 


(Harrison, 1968b; Harrison & Zajonc, 1°70) 
suggests that when subjects free associs'« to 
stimuli exposed in different frequencies, j 
almost never mention such things as sci 
psychologist, or experimenter, never mind with 
different probabilities as a function of s 

lus exposure. Both the Suedfeld and Epst: 
(1970) and Burgess and Sales (1971) intei 
pretations seem rather hard pressed to ac: 

for the growing amount of correlational, a 
mal, response latency, and the manipulat 
conflict data generated by the exposure | 
pothesis, and neither would seem to he 
important implications for the present patt: 
of results. 

Somewhat more problematical in the pre 
ent context is the Berlyne (1970) two-factor 
theory of the effects of exposure. The two 
factors are positive habituation and tedium. 
Both factors increase over exposure. Early in 
the course of exposure, positive habituation 
preponderates, leading to an enhancement of 
favorability as the organism becomes oriented 
to the stimulus. Later in the exposure se- 
quence, the tedium factor gains the upper 
hand, with the result that rated favorability 
declines. Several variables have been identi- 
fied that may affect the positive habituation 
and tedium factors (Berlyne, 1970; Harrison 
& Crandall, in press). One of these is 
initial stimulus complexity. Favorability en- 
hancement should be faster for relatively 
simple stimuli, If, within the limits of the 
present research, positive habituation oper- 
ated, the results of Experiment I are sup- 


FREQUENCY OF EXPOSURE ON AFFECTIVE RATINGS 


portive of the two-factor theory if the con- 
sistent names are defined as simple and the 


change names as complex. However, the re- 
sults ci Experiment III are less easily handled 
by i wo-factor theory. Complex stimuli 


are : defined as a total stimulus con- 
;, while simple stimuli are defined 


as a art of the configuration. According 
to t -finition, the consistent stimuli, which 
inv the ideograph, printed word, and 
spoken word should be more complex than the 
con stimuli which involved ideograph 
alone. However, our results indicated that 
the relatively complex consistent stimuli 
te: to receive higher ratings than the 
hypothetically easier to habituate to simple 
co stimuli. This difference did not attain 
sis: Dcance, however, and the relative com- 
ple of consistent, control, alternating, and 
chiese stimuli seems a likely subject for 

future debate. 

REFERENCES 
Beck sent, J. Cy Ja, Wrtson, W. R, & Baio, J. C. 
J effect of frequency of presentation on the 
of nonsense syllables, Journal of Psychology, 
i 56, 165-170. 

Ber D. E. Novelty, complexity and hedonic 


value, Perception and Psychophysics, 1970, 8, 279- 

Burcrss, T. D. G, & SALES, S. M. Attitudinal ef- 

jecis of “mere exposure”: A reevaluation. 

(Carnegie Mellon University Rep. 70-28). Pitts- 

burgh, Pa.: Carnegie-Mellon University, 1971. 

Frcnwex, G. T. Vorschule der Aesthetik. Leipzig: 

Breitkopf & Hártel, 1876. 

Fuwckr, G., & Jouwsow, R. C. Word value and 
word frequency in homophone pairs. Psychological 
Reports, 1960, 7, 470. 

Harrison, A. A, Response competition and attitude 
change as a function of repeated stimulus expo- 
sures, Unpublished doctoral dissertation, University 
of Michigan, Ann Arbor, 1967. i 

Harrison, A. A, Exposure, favorability and item 
endorsement, Psychological Reports, 1968, 23, 
1970. (a) 

Harrison, A. A. Response competition, frequency, 
exploratory behavior, and liking. Journal of Per- 
sonality and Social Psychology, 1968, 9, 363-368. 
(b) 

Harrison, A. A, Exposure and popularity. Journal of 
Personality, 1969, 37, 389-377. : 

Harrison, A. A. & CRANDALL, R. Heterogeniety- 
homogeneity of exposure sequence and the atti- 


nt 


tudinal effects of exposure. Journal of Person- 
ality and Social Psychology, in press. 

Harrison, A. A., & Hines, P. The effects of fre- 
quency of exposure at three short exposure times 
on affective ratings and exploratory behavior. 
Proceedings of the 78th Annual Convention of the 
American Psychological Association, 1970, 4, 391- 
392. (Summary) 


Harrison, A. A, & Zayonc, R. B. The effects of ' 


frequency and duration of exposure on response 


competition and affective ratings. Journal of Psy- — 


chology, 1970, 75, 163-169. 

Jouwsow, R. C. THomson, C. W., & Fricke, G. 
Word values, word frequency, and visual duration 
thresholds, Psychological Review, 1960, 67, 332— 
342. 

Krucman, H. E. Affective response to music as a 
function of familiarity. Journal of Abnormal and 
Social Psychology, 1943, 38, 338-392. 

Mastow, A. H. The influence of familiarization on 
preference. Jourmal of Experimental Psychology, 
1937, 21, 162-180. 

Martin, M. W. Response competition as a mediating 
factor in the frequency-affect relationship. Journal 
of Personality and Social Psychology, 1970, 16, 
536-552. 

Moore, H. T, & Grtumanp, A. R. The immediate 
and long term eífects of classical and popular 
phonograph selections. Journal of Applied Psychol- 
ogy, 1924, 8, 309-323. 

Mutt, H. K. The effect of repetition upon enjoyment 
of modern music. Journal of Psychology, 1957, 
43, 155-162. 

Pepper, S. C. Changes of appreciation for color 
combinations. Psychological Review, 1919, 26, 
389-396. 

SoroMow, R. L., & Postman, L. Frequency of usage 
as a determinant of recognition thresholds for 
words, Journal of Experimental Psychology, 1952, 
43, 195-201. 

Surpretp, P., & Ersrrmw, Y. M. “Mere exposure"— 
Plus set. Paper presented at the meeting of the 
American Psychological Association, Miami Beach, 
September 1970. 

Tuomson, C. Boy or girl? Names jor every child. 
New York: Arc Books, 1961. 

Verveer, E. M., Barry, H., JR, & BOUSFIELD, W. A. 
Change in affectivity with repetition. American 
Journal oj Psychology, 1933, 45, 130-134. 

Wuson, L. R., & BEcKNELL, J. C. The relation be- 
tween the association value, pronounceability, and 
affectivity of nonsense syllables. Journal of Psy- 
chology, 1961, 52, 47-49. 

Winer, B. J. Statistical principles in experimental 
design. New York: McGraw-Hill, 1962. 

Zayonc, R. B. Attitudinal effects of mere exposure. 
Journal of Personality and Social Psychology 
Monograph Supplement, 1968, 9(2, Pt. 2), 1-27. 


(Received October 15, 1970) 


ee 


Personality and Social Prychology 


Journal o d 
d 20, No. 1, 112-117 


1971, Vo 


NONVERBAL MAINTENANCE OF CONDITIONED VERBAL 
BEHAVIOR FOLLOWING “INTERPRETATIONS,” 
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The differential conditioning effects of 
firmatory words, interpretations, and 


stimulus (600-cycles-per-second tone) to acquire secondary reinforcing properties 
in a verbal operant conditioning paradigm were studied using 64 college students. 


A control group established the tone as 


experimental groups were reinforced for critical responses with one of the three 
verbal reinforcers paired with the tone. 


continued to receive the tone alone for 


occurred with each of the three types of verbal reinforcers, but only the experimental 
groups that received the tone during extinction maintained the frequency of the 
previously conditioned response, thereby establishing the tone as a secondary 


reinforcer. 


In recent years, the verbal operant condition- 
ing paradigm has been increasingly used to in- 
vestigate varied aspects of verbal behavior, as 
well as employing it as a model for the study 
of psychotherapy process variables. One of 
the important parameters of learning that has 
received little attention in verbal conditioning 
studies, however, is that of higher order con- 
ditioning, or what might be comparable to 
secondary reinforcement. Using hospitalized 
psychiatric patients as subjects, Timmons 
and Noblin (1962) paired a 600-cycles-per- 
second (cps) tone, shown to have been a 
previously neutral stimulus, with mild affirma- 
tory words to study the effect of “secondary 
reinforcement” in a verbal conditioning frame- 
work, The tone was found, after pairing, to 
have acquired sufficient reinforcing properties 
to meet the two basic criteria of a secondary 
reinforcer; that is, the tone was capable of 
maintaining the strength of a previously con- 
ditioned response and of reinforcing new 
learning. Levin and Sterner (1966) found, 
with similar subjects, that when a pencil tap 
was paired with “right” during a paired- 
associates learning task, the tap functioned as 
a reinforcer on a subsequent verbal condition- 
ing task. 

! Requests for reprints should be sent to Thomas D. 


Kennedy, Department of Psychology, Arizona State 
University, Tempe, Arizona 85281. 
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three types of verbal reinforcers (mild af- 
reflections) and the ability of a neutral 


a neutral stimulus. During acquisition, six 


During extinction, three of these groups 
critical responses. Significant conditioning 


Evidence that verbal behavior in therapy 
or quasi-therapy settings can be modified has 
led to experimental investigations of clinical 
phenomena and to attempts to explain certain 
processes occurring in psychotherapy (Kanfer, 
1968; Krasner & Ullmann, 1965). While most 
verbal conditioning studies have investigated 
the effects of mild affirmatory words (social 
reinforcers), such as “good,” “fine,” or “mm- 
hmm” on verbal behavior, others have been 
concerned with more complex or theoretically 
oriented interventions. For example, Kanfer, 
Phillips, Matarazzo, and Saslow (1960) used 
psychoanalytic-type interpretations as a rein- 
forcing stimulus and found that such interven- 
tions produced a reduction in length of speech 
with a subsequent elevation after they 
had been terminated. Noblin, Timmons, and 
Reynard (1963) and Timmons, Noblin, Adams, 
and Butler (1961), using hospitalized patients 
as subjects, and Adams, Butler, and Noblin 
(1962), using college students as subjects, 
demonstrated that psychoanalytically derived 
interpretations may be used as reinforcers in a 
verbal conditioning paradigm. In each of these 
studies, however, interpretations were less 
effective than mild affirmatory words in in- 
creasing the frequency of the selected response 
class. Employing a Taffel (1955) procedure, 
Timmons, Noblin, and Reynard (1964) sig- 
nificantly conditioned pronoun responses of 
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college dents when non-directive-type re- 
flecti« sr paraphrases) were used as reinforc- 
ing s . Reflective-type reinforcers have 
effective in conditioning affective self- 
in quasi-therapy situations (Mer- 
1963; Merbaum & Southwell, 1965). In 
these studies it was found that reflec- 
iphrases) were even more effective 
tioning affective self-references than 


also I 
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mild :atory words or echoic responses. 
One purpose of the present study was to 
compare the differential reinforcing effects of 
psychoanalytic-type interpretations, of re- 
flections of a nondirective nature, and of mild 
affirmatory statements in raising the level of a 
selected response class during acquisition. A 
second aim was to determine whether the 
phenomenon of higher order conditioning, or 


sight be comparable to secondary rein- 
ont, can be demonstrated within the 
conditioning paradigm using college 
s as subjects. 
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METHOD 
Subjects 


The subjects were 64 college student volunteers 
attending undergraduate psychology courses. Each of 
two control and six experimental groups consisted of 
8 subjects. To insure that all groups would be reasonably 
matched at the operant level, the subjects were as- 
signed to groups on the basis of their performance 
during the nonreinforced operant phase (first 30 trials). 


Apparatus 


The experimental room was arranged to eliminate 
the possibility of extraneous reinforcing cues from 
the experimenter (such as nods, smiles, etc.) by having 
the subject sit in a chair with the experimenter behind 
him and slightly to his left. A Heathkit signal generator 
(Model AG-10) tuned to 600 cps and a loudspeaker 
used in presenting the tones were located out of the 
subject's sight. All tones were presented at a level of 
45 decibels. : 
A modified Taffel (1955) procedure was used in 
presenting the stimulus materials. These materials 
were one hundred and twenty 5 X 8 inch cards with 
two pronouns printed at the top of each card—one first 
person (I or we) and one second or third person (you, 
he, she, or they). Below the pronouns was a sentence 
fragment which, when preceded by either of the pro- 
nouns, made a complete sentence. The pronouns were 
randomized as to position over the entire sertes of cards, 
and no two sentence fragments were the same. The 
cards were handed one at a time in the same order to 
all subjects; after reading the sentence aloud, the sub- 
ject returned the card to the experimenter. 
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TABLE 1 


EMPLOYMENT OF THE THREE VERBAL REINFORCERÉ 
AND THE 600-CPS Toxe ror CurrICAL Re 
SPONSES DURING ACQUISITION AND 


EXTINCTION 

Group Acquisition Extinction 
Nothing Nothing 
Tone only Nothing 
Tone + mild affirmatory words | Nothing 

"Tone + mild affirmatory words | Tone 
| Tone + interpretations Nothing 

| Tone + interpretations Tone 
Tone + reflections Nothing 

Tone + reflections Tone 


Note,—n = B. 
3 Control groups; all others are experimental groups. 


Procedure 


Each subject, seen individually, received 120 trials. 
The instructions to the subject, and the questions for 
assessing awareness following the experiment, were 
essentially the same as those reported by Noblin, 
‘Timmons, and Kael (1966). First-person pronouns 
(I or we) were designated as the critical (reinforced) 
response class, and the remaining pronouns (you, he, 
she, they) constituted the nonreinforced response class. 
Each subject's operant level for selecting the first- 
person pronoun was established on Trials 1-30, during 
which the experimenter remained silent. The next 60 
cards (Trials 31-90) constituted the acquisition phase, 
and the final 30 cards (Trials 91-120), the extinction 
phase. A 90% variable ratio schedule of reinforcement 
was employed to minimize the possibility of the sub- 
jects verbalizing the reinforcement contingency. 

Types of verbal reinforcement administered and 
uses of the 600-cps tone during acquisition and ex- 
tinction for each group are shown in Table 1. As can 
be seen in Table 1, neither of the two control groups 
(N-N and NT-N) received any type of verbal reinforce- 
ment throughout the experiment. However, Group 
NT-N received the 600-cps tone during acquisition 
when a first-person pronoun was selected. Each of the 
experimental groups (word, interpretation, reflection) 
received the appropriate verbal reinforcement paired 
with the tone during the acquisition phase following a 
critical response. For example, if a subject in either of 
the mild affirmatory word groups (WT-N or WT-T) 
chose a first-person pronoun, the experimenter presented 
the tone for approximately 1 second, followed im- 
mediately by a mild affirmatory word such as good, 
fine, or mm-hmm. The interpretation or reflection 
used as a verbal reinforcer on any given trial during 
acquisition was typed on the back of the stimulus card. 
Thus, if a subject in an interpretation group (IT-N 
or IT-T) chose "T" rather than “she” to complete the 
sentence * — bid too high for such a prize," the ex- 
perimenter responded by sounding the tone and saying 
“You show a basic mistrust and hostility toward 
others." For this same sentence stem, if a subject in a 
reflection group chose “I,” the experimenter responded: 
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Fic. 1. Performance curves of the two control groups 
and performance curves of the two experimental groups 
reinforced with mild affirmatory words on the acquisi- 
tion trials. 


“You feel you offered more than it was worth," follow- 
ing the presentation of the tone. 

Table 1 also shows that during extinction, one of the 
two experimental groups within each reinforcement 
category (i.e, WT-N, IT-N, and RT-N) was ex- 
tinguished in the normal manner. That is, these groups 
received no reinforcement during the last 30 trials. ‘The 
other experimental group within each reinforcement 
category (ie, WT-T, IT-T, and RT-T) continued to 
receive the 600-cps tone when a critical pronoun was 
selected during the extinction trials, 


RESULTS 


Three subjects, each from a different group, 
who correctly verbalized the contingency of 
reinforcement, were not included in the data 
analysis and were replaced by other volunteers 
in order to maintain a balance of 8 subjects 
per group. Of the 64 subjects included in the 
following data analysis, none had previously 
participated in a verbal conditioning study, nor 
did any of them correctly verbalize the experi- 
mental principle in response to the brief 
questionnaire used. Figures 1-3 present the 
mean percentage of first-person pronouns 
selected during the experiment by the various 
groups. On all figures; the 60 acquisition trials 
are plotted in two blocks of 30 trials each, in 
order to depict more clearly the course of the 
dependent variable during the acquisition 
phase and to provide an equal base line of 30 
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trials for each division analyze 

the statistical analyses presented in the follow- 
ing discussion are based on the last 20 acquisi- 
tion trials (unless otherwise indicate’), since 
this block of trials reflects the fine! level of 


conditioning. All ordinates on th« res are 


lowever, 


plotted according to the same scale for com- 
parative purposes, and all statisti: sts are 
from Siegel (1956). 

The two control groups (N-N, NT-N) 
are presented in Figure 1. Inspection of the 
figure clearly indicates that both groups failed 
to condition. A Friedman two-way a sis of 
variance by ranks for each group reves «d that 
there was no significant overall effect ( .20) 
for either group throughout the experiment. 
Further, Mann-Whitney U tests indicated that 
these two groups did not differ significantly 
from each other during any phase, Thus, 
Group NT-N, which received onl; 


600-cps tone for critical responses 
acquisition, demonstrated that the to 
was not capable of inducing condit: 
establishing the tone as a neutral stimulus. 

The two “word” groups (WT-N, WT-T) 


receiving mild affirmatory words during ac- 
quisition are also shown in Figure 1. An in- 


crease in frequency of the dependent variable 
was displayed by both groups during acquisi- 
tion. Figure 1 also shows that the performance 
of Group WT-N decreased to the operant 
level during extinction. A Friedman analysis of 
variance of the data obtained from this group 
indicated that the overall effect was significant 
beyond the .01 level of probability, A series of 
Wilcoxon matched-pairs signed-ranks tests 
was employed in analyzing the origin of effect 
between operant, acquisition, and extinction 
phases. This analysis revealed that the per 
centage of first-person pronouns selected during 
acquisition was significantly greater than was 
selected during the operant phase (p < .05), 
and that response frequency during extinction 
was significantly lower than during acquisition 
(p < .05). Figure 1 further shows that Group 
WT-T, which continued to receive the tone 
during the final 30 trials, failed to show an ex- 
tinction effect. Friedman’s analysis of variance 
revealed a significant overall effect (p < .01) 
for this group. Although a Wilcoxon analysis 
indicated that the frequency of critical re- 
sponses during acquisition was greater than 
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during the operant period at a probability level 
beyond .01, no difference was found between 
acquisit and extinction trials (p > .05). A 
signifi: ference (p < .05) was obtained 
between the operant and extinction phases, 
indica ihe continuing high proportion of 
critic ponses produced during the extinc- 
tion Mann-Whitney U tests showed 
that < two word groups did not differ from 
each luring the operant period nor during 
acquisition. However, a direct comparison of 
the vs during extinction reveals the effect 
of mdary reinforcement," in that the 
tor ne maintained the frequency of the 
pre iv conditioned response in Group 
wI ¿a level significantly higher (p < .01) 
than that of Group WT-N. 

performance of the two interpretation 
gi (IT-N, IT-T) is presented in Figure 2. 
Both control groups were combined and are 
shown in Figures 2 and 3 for visual comparison. 
As can be seen in Figure 2, both experimental 
groups displayed an increase in frequency of 
the dependent variable during acquisition. 
Analysis of variance showed an overall 


effect 
and 
(p 
opera 
(p 

157 
70 
55 


50 


55 


MEAN PERCENTAGE CRITICAL RESPONSES 


Fic 


beyond the .05 level for Group IT-N, 
Wilcoxon analyses revealed an increase 
_05) in critical response frequency from 
nt to acquisition phase, and a decrease 
.05) from acquisition to extinction. 


à T-N 
à it-T 
@ CONTROLS 
1-30 31-60 61-90 ~ 94-120 _ 
OPERANT T ACQUISITION EXTINCTION 


. 2. Performance curves of the two experimental 


groups reinforced with psychoanalytic-type interpreta- 


tions on the acquisition trials. 


group 


(The combined control 
is alsozshown.) 
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Fic, 3. Performance curves of the two experimental 
groups reinforced with non-directive-type reflections on 
the acquisition trials. (The combined control group — 
is also shown.) 


Figure 2 shows that as before, the response 
level was maintained in the interpretation — 
group (IT-T) which continued to receive the | 
tone during extinction. Though there was no el 
significant overall effect (p > .05) for Group — 
IT-T, the use of Wilcoxon tests showed that | 
a significant difference (p < .05) was obtained 
between operant level and acquisition, and no 
difference was found between acquisition and 
extinction (p > .05). A significant difference 
at the .05 level emerged between the operant 
and extinction phases. Mann-Whitney U tests 
indicated that these two interpretation groups 
did not differ significantly in the number of 
critical responses emitted during the operant 
phase or during acquisition. Although Group 
IT-T failed to extinguish, there was no sig- | 
nificant difference in extinction (p> -10) 
when the last block of 30 trials was compared 
between the two groups. As a result, the last 
10 trials in each group were compared for a final 
extinction effect. The result was significant 
beyond the .05 level of probability, again 
demonstrating the ability of the tone alone to 
maintain the frequency of the previously 
conditioned response. 

Figure 3 shows that both reflection groups 
(RT-N, RT-T) demonstrated a pronounced 
conditioning effect during acquisition, and 
that Group RT-N extinguished in the usual 
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manner, while Group RT-T, which received 
the tone during extinction, maintained the 
previously conditioned response, Analysis re- 
vealed a significant overall effect (p < .01) 
for Group RT-N. Further analyses indicated 
a significant increase (p < .01) in critical 
response frequency from operant to acquisi- 
tion and a decrease (p < .01) from acquisition 
to extinction. The values obtained for Group 
RT-T were the same as for Group RT-N, 
except for the lack of difference between ac- 
quisition and extinction (p > .05) and the 
subsequent difference between the operant and 
extinction periods (p < .01). Mann-Whitney 
U tests indicated that the two reflection groups 
did not differ significantly during the operant 
period or during acquisition, but the effect of 
the 600-cps tone in maintaining the previously 
conditioned response was again clearly demon- 
strated in extinction, where the difference be- 
tween the groups was significant beyond the 
01 level. 

Although inspection of Figures 1-3 reveals 
that the reflection and word groups yielded a 
considerably higher frequency of first-person 
pronouns during Trials 61-90 of the acquisition 
phase than did the interpretation groups, 
these differences were not significant. A 
Kruskal-Wallis one-way analysis of variance 
yielded a probability greater than .20. In 
addition, U tests were used to compare all of 
the experimental groups, two at a time. A 
significant difference (p < .05) was obtained 
between Group IT-N and Group RT-T; all 
other comparisons failed to yield significance, 


Discussion 


Results of the present study clearly support 
the proposition that frequency of occurrence 
of selected responses in an operant condition- 
ing paradigm may be increased through the 
use of the three types of verbal reinforcers 
employed, and that when any of these rein- 
forcers are paired with a previously neutral 
stimulus, higher order conditioning, or what 
might be comparable to secondary reinforce- 
ment, may be demonstrated. 

With respect to the relative efficacy of the 
different type verbal reinforcers in raising the 
frequency of the selected response class, 
psychoanalytic-type interpretations were found 
to be less effective (though not significantly 
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less effective) than mild affirmatory words or 
reflections. This finding is in agreement with 
other investigations where the comparative 
effects of psychoanalytically derived inter- 
pretations and mild affirmatory words were 
tested (Adams et al., 1962; Timmons et al, 
1961). Adams et al. found that when response 
latency was measured, the latency of the sub- 
jects’ response was significantly decreased by 
mild affirmatory words, but significantly in- 


creased by interpretations during acquisition, 
with a corresponding decrease after the 
interpretations were discontinued during ex- 
tinction. In attempting to explain the in- 
creased latency, these authors hypothesized 
that interpretive statements may serve as a 
class of noxious stimuli and, hence, may have 
an effect similar to that of mild punishment. 
With psychiatric patients, Timmons ct al. 
(1961) found that interpretations raised the 
frequency of the selected response class less 
rapidly than mild affirmatory words, but the 
frequency continued to increase slightly, even 
when the interpretations were terminated 
during the extinction period. These investiga- 
tors suggested that mild affirmatory words and 
interpretations do not appear to operate ac- 
cording to the same principles. That is, mild 
affrmatory words used as verbal reinforcers 
can be considered discriminative stimuli—cues 
which, in the past, have been associated with 
reinforcement. Thus, the use of these words 
by the experimenter as a reinforcer carry an 
inherent meaning of approval or favorable im- 
pression, On the other hand, Timmons et al. 
(1961) argued that if the interpretations they 
used were somewhat “threatening” or “anxiety 
provoking,” as they seemed to be, then it is 
possible that a summation of the positive 
aspects of obtaining any response from the 
experimenter, combined with the mild punish- 
ing aspects of the interpretation, produced the 
comparatively less effective conditioning. 

On a qualitative level, Adams et al. (1962) 
noted that subjects reinforced with interpre- 
tive statements often became emotional; that 
is, they seemed annoyed and anxious. Similar 
behavior was observed in the present experi- 
ment. Consequently, each subject in the two 
interpretation groups was “debriefed” in an 
interview following the awareness question- 
naire in order to allay any anxiety that may 


„roused by the use of the interpreta- 
e 16 subjects in these two groups, 
pressed surprise at the “accuracy” 
oretations, and some expressed dis- 
vith them and/or relief that they had 
been discontinued (in extinction). These sub- 
jects also ppeared to become more personally 
or emotionally involved in the task than the 
subjects in the other experimental groups. 
Thus, additional evidence was produced, lend- 
ing support to the tentative hypothesis that 
interpretive statements may serve as a class 
of mildly noxious stimuli. By contrast, sub- 
jects reinforced with mild affirmatory words 
or reflections seemed to be relaxed, less tense, 
and less threatened. Since the reflections were 
as efficacious and 4nonthreatening" as the 
— mild affirmatory words, it appears that they 
T too can be considered discriminative stimuli, 
possibly expressing à sense of acceptance, 
agreement, and understanding. According to 
Merbaum and Southwell (1965), the para- 
phrase intervention (similar to the type of 
reflection used in the present study) adds an 
additional aspect to the verbal interaction, 
“a different emotional word, thus providing 
discriminative stimuli for the emission of re- 
lated verbal associations [ p. 186].” 

The other issue for the present study was 
that of higher order conditioning or secondary 
reinforcement. It was demonstrated that a 
previously neutral tone, after being paired 
with different type verbal reinforcers, ac- 
quired sufficient reinforcing properties to act 
as a “secondary reinforcer” in maintaining the 
level of a previously conditioned response. The 
secondary reinforcing effect obtained in the 
present study lends support to this phe- 
nomenon reported earlier by Timmons and 
Noblin (1962), who suggested that verbal 
reinforcers are very potent cues for modifying 
the verbal behavior of humans and, in fact, 
appear to be powerful enough to serve for 
people the role that food serves in animal ex- 
periments designed to demonstrate secondary 
reinforcement. : 

The findings of the present investigation 
point to the utility and versatility of the verbal 
conditioning paradigm for investigating param- 
eters of human learning and for scientifically 
evaluating the nature and effect of verbal 
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techniques commonly used in psychothera- 
peutic relationships. « 
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ATTITUDE TOWARD TROOP WITHDRAWAL FROM 
INDOCHINA AS A FUNCTION OF DRAFT NUMBE! 


DISSONANCE OR SELF-INTEREST? * 


WILLIAM W. NOTZ, BARRY M. STAW, and THOMAS D. COOK ° 


Northwestern University 


In December 1969, the Selective Service System informed eligible men with 
draft numbers between 1 and 122 that they would almost certainly be drafted: 
men with draft numbers between 123 and 244 were told that they might be 
drafted; and men with numbers between 245 and 366 were told they almost 
certainly would not be drafted. More draft-eligible men in this last category 
advocated total troop withdrawal from Indochina. This relationship was ob- 
tained in two experiments and for three different dates by which withdrawal 
should take place. However, the draft lottery was not random, and higher 
draft numbers were assigned to persons born in the first half of the year. In the 
second experiment reported here, the increased tendency to advocate total 
withdrawal was found only among draft-eligible men with Numbers 245-366 
and with birthdates in the first half of the year. Dissonance theory seems to 
be one plausible explanation for the results: being assigned a draft number 
between 245 and 366 was conceptualized as a reward (gaining freedom from 
the draft), and being assigned this reward by chance or because of a faulty 
randomization procedure was conceptualized as insufficient justification for 
the reward. Such dissonance could be reduced by advocating an end to the 


Indochina war and thereby freeing more persons from the draft. 


On December 1, 1969, the Selective Service 
System introduced a random draft lottery to 
determine which draft-eligible males should 
serve in the armed forces. Slips of paper bear- 
ing the dates of the leap year were drawn 
from an urn, and the first date selected was 
assigned a “1,” the second date was assigned 
a “2,” and so on to 366. Thus, each birthdate 
corresponded with a randomly chosen draft 
number. 

The Selective Service System divided these 
draft numbers into three major categories. 
Males with Numbers 1-122 were told that 
they would certainly be drafted if they 
became eligible for the forces, males with 
Numbers 123-244 were told that they might 


1 The ordering of authors is due to chance and 
does not reflect differences in contribution, The study 
was made possible by a grant from the Research 
Council of Northwestern University to the third 
author, We should especially like to extend our 
thanks to Harold P. Welsch for his help in data 
collection and tabulation and to Richard Bootzin 
and Philip Brickman for their comments on an 
earlier draft of this manuscript. 

? Requests for reprints should be sent to Thomas 
D. Cook, Department of Psychology, Northwestern 
University, Evanston, Illinois 60201. 
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be drafted, while the remainder were to 
it was unlikely that they would be 
However, by June 1970, the time 
present studies, Draft Number 170 h 
ready been called, and it was widely believed 
that most numbers would be called by De- 
cember. Hence, the official tripartite draft 
Classification was probably less psych 
cally meaningful than a bipartite classific: 

tion based on a cutting point at about Draft 
Number 244. A study by McArthur (1970) 
supports this, for she found that the percep- 
tion of external fate control was highest 
among draft-eligible men with Numbers 245- 
366 and that it did not differ between men 
with Numbers 1—122 and 123-244, 

It is not clear how draft categories should 
affect behavior. One relationship is suggested 
by the following quotation from the New 
York Times of December 8, 1969, A young 
man was reported as saying: “It’s going to 
radicalize those people from 1 to 122 and I 
suppose it will make the ones in the (highest) 
third more complacent [p. 51]. A self- 
interest notion is implicit in these comments. 
It is implied that men in the categories most 
likely to enter the forces would increase their 


Aw 


¥ 


w^ E "n 


* efforts to avoid being drafted and that draft- 
ye gible persons in the safe 245—366 category 
would be less interested in changing foreign 
military policies. This implies that persons 
with Draft Numbers 1-244 (or 1-122) should 
have a greater self-interest in advocating 
troop withdrawal from Indochina than should 
persons with Numbers 245—366. 
same New York Times article (De- 
r $, 1969) indirectly indicates a quite 
different reaction to learning one's draft 
€ yry. One interviewee with Draft Number 
3 was reported as saying “I definitely 


d want to go into the Army, but I don’t 
fe ood at all about beating the system 
[». 1]. Why should he feel badly? One 
possibility is that he had just obtained a large 
reward (virtual exemption from military 
service) without having justified it. The 


reward was an accident of his date of birth, 
ar! was not the result either of his labors 
(as it is with some conscientious objector 
is), his special talents (as with some 
essional deferments), or his obvious in- 
ity to serve (as with medical defer- 
ts). Insufficiently justified rewards are 
upposed to arouse dissonance (Carlsmith, 
C ; & Helmreich, 1966; Cook, 1969; 
Festinger & Carlsmith, 1959), and the dis- 
sonance aroused by gaining an unjustified 
freedom from the draft could be reduced by 
extending this same freedom to more persons. 
One way of doing this is to advocate the with- 
drawal of greater numbers of troops from 
Indochina, thereby reducing the overall need 
ior United States forces. According to this 
extrapolation from dissonance theory, draít- 
eligible males with Numbers 245-366 should 
advocate greater withdrawal from Indochina 
than should draft-eligible males with Numbers 
1-244, This prediction directly conflicts with 
the self-interest prediction. 


METHOD 
Subjects 
The respondents for these experiments were male 
students of United States nationality, enrolled at 


Northwestern University. The first experiment was 
conducted in the Technological Institute, where the 
large majority of students are majors in natural 
sciences, engineering, or business; and the second 
experiment was conducted in buildings that are 
occupied by the social science and liberal arts depart- 
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ments. The second experiment took place 3 days — 
after the first, and after the data from the first | 
experiment had been provisionally analyzed. 

In both experiments, a booth was set up, and 
copies of a petition addressed to President Nixon 
were placed upon it. The petition was in three brief 
parts. The introduction, which was slightly modified 
for Experiment II, read: 


Dear President Nixon, 


You often appear to hold stereotypic views of 
university students. However, in actuality, all 
students are not alike, and many views are repre- 
sented on campus. For this reason, I want you to 
know exactly how I feel, as am individual, con- 
cerning the 435,000 American troops presently in 
Indochina. L 


The second part permitted the respondents to indicate 
their attitudes about the size of desired troop in- 
creases and decreases in Indochina (see “Dependent 
Variables”). The final part asked for a signature 
and demographic information—sex, date of birth, 
class, major, and draft status. 

In Experiment I, there were three persons at the 
booth, and each of them approached passersby and 
asked them to sign the petition. In Experiment I, 
booths were set up at three locations, two persons 
served at each booth, and these persons solicited 
petition signers. Males were oversolicited relative to 
females, but females were also approached so that 
the purpose of the study would not seem obvious. 
Both experiments took place less than 2 weeks after 
the National Student Strike, when demonstrations, 
rallies, and petition signing were still relatively fre- 
quent on campus. Nothing indicates that the col- 
lection of signatures was seen as anything other than 
a strike-associated activity, Furthermore, the persons 
soliciting petition signatures could not have known 
the draft numbers of passersby, and so experimenter 
expectancies did not affect the data (Rosenthal, 
1966). 


Independent Variable 


The major independent variable was draft cate- 
gory, and its three levels were determined by whether 
a respondent’s draft number fell into Category 1-122, 
123-244, or 245-366. Draft numbers were calculated 
by taking each respondent's reported date of birth 
and then simply looking up the corresponding draft 
number as published by the Selective Service System. 

A second independent variable was deferment 
classification. Persons who were 1-A (fit for duty) 
and 2-S (educational deferment) were considered 
potentially or actually draft eligible, while persons 
with no draft status or with status other than 1-A 
or 2-S were considered draft ineligible. 

The last independent variable was the date by 
which the percentage of troops in Indochina should 


3 Persons with a 2-S deferment were given a draft 
number in 1969 even though they would not be 
eligible for the forces until they left school. Thus, 


be increased or decreased, In Experiment I, the 
respondents indicated the percentage of increase or 
decrease that there should be at a date 6 months 
from signing the petition, while in Experiment II, 
the respondents indicated the increase or decrease 
that there should be at a date both 1 month and 
12 months from the time of signing the petition. 
The date of withdrawal was, of course, confounded 
with subject differences, in that the 6-month question 
was asked only of the sample in the Technological 
Institute. 


Dependent Variables 


One dependent variable was the frequency of 
petition signing. If this differed between draft cate- 
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gories, it would indicate that draft ries af- 
fected the extent to which responden! ere inter. 
ested in advocating their views about t strength 
in Indochina. Of course, this conclusion id only 
be warranted when it was established : the data 
on frequency of petition signing did r have the 
same distribution as the population fr which the 
sample was drawn. 

The second dependent variable was eported 
percentage of troops that should be wit! n from 
Indochina or that should be added to the Indochina 
forces by a given date. Respondents in F riment I 
read, after the brief introduction addressed to 


President Nixon: 


In my opinion, the number of American military personnel in 
Indochina (including ground troops, advisors, logistics and 


support) should be: 
Ds] increased 
[] decreased 


by. — percent, between now and December 31, 1970. 


(Insert number) 


The dependent variable was somewhat different in 
Experiment II, where respondents read, again after the 


100% between now and June 30, 1971 


introduction : 
In my opinion, during the following two time periods, the number 
of American military personnel in Indochina (including ground 
troops, advisors, logistics and support) should be: 
LI increased 
[C] decreased 
by 0% 20% 40% 60% 80% 
(please circle one) 
and, more immediately, 
by 0% 20% 40% 60% 80% 


(please circle one) 


This dependent variable would be difficult to inter- 
pret if there were a difference in the number of 
petition signers in each draft category, since it 
would be unclear whether differences were due to 
draft category or to category-related differences in 


their number would only become relevant to induc- 
tion during the future year when they lost their 
deferments. The situation was slightly different for 
1-A students, who presumably knew that they would 
be eligible for the army in 1970, Vet, since 2-S 
students’ best estimate of future draft calls was 
presumably present draft calls, and since 2-S and 
1-A students did not differ on any of the measures 
to be reported, 1-A and 2-S were considered a single 
classification of persons who knew that they were 
not permanently ineligible for the draft. 


100% between now and June 30, 1970 


the kind of persons who signed the petition. The 
variable is more interpretable, of course, if the 
number of signees is equal across draft categories. 


Final Sample for Data Analysis 


The number of petitions returned in Experiment I 
was 300. Forty-one were returned by females or 
foreigners; the draft status was not ascertained for 
another 20; and there were no answers to the troop 
question for another 7. The final sample in Experi- 
ment I consisted of 108 draít eligibles and 124 draft 
ineligibles. 

In Experiment II, the number of petitions returned 
was 880; females and foreigners returned 193; a 
further 66 had incomplete or unintelligible responses 
to the questions about birthdate or draft status, 


ít eligibles and 9 draft ineligibles did not 
either of the troop questions. The final 

the 1-year question consisted of 283 draft 
eligibles 2 draft ineligibles, Since some persons 
comp! only the 1-year question, the final sample 
for t month question consisted of 274 draft 
eligib! vd 314 draft ineligibles. 
The from all the petitions were transcribed 

cd, and all the petitions plus a summary 
uits were forwarded to President Nixon. 


while 7 d 
comple! 
samp: 


RESULTS 


Few respondents wanted to freeze or in- 
crease the number of troops in Indochina. 
In Experiment I, 7 draft eligibles (6.5%) 
and 6 draft ineligibles (4.8%) wanted no 
change or more troops by the 6-month dead- 
“line. In Experiment II, 4 draft eligibles 
(14%) and 4 draft ineligibles (1.2%) 
wanted no change or more troops within a 
year, while 20 draft eligibles (7.39%) and 16 
draft ineligibles (5.1%) wanted no change or 
more troops within 1 month. Since these 
percentages are low, it is clear that any find- 
ing cannot be generalized beyond a very 
dovish sample. 


Frequency of Petition Signing 


Chi-square analysis showed that the fre- 
quency of petition signing did not differ by 
deferment classification, draft category, or by 
experiment. The only relationship to approach 
significance was that 1-A persons with Num- 
bers 1-122 tended to sign less than 1-A per- 
sons with Numbers 123-244 or 245-366 
(uncorrected y2= 5.31, df= 2, p« 10). 
This held only for Experiment II. The rela- 
tionship may have resulted from a higher rate 
of induction into the armed forces among 1-A 
persons with Numbers 1-122, a lower rate 
of petition signing among these persons, OF it 
may be a chance phenomenon resulting from 
multiple analyses of the data. 

Percentage of advocated troop withdrawal. 
The number of males is rectangularly dis- 
tributed across draft numbers and across cate- 
gories based on draft numbers. Furthermore, 
since the modal response for all three ques- 
tions was to advocate 100% withdrawal, a J 
curve best describes the relationship of the 
number of respondents to the percentage of 
advocated withdrawal. The steepness of the 
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slope varied with the date by which troops 
should be withdrawn. The slope was least 
steep for the 1-month question (where about 
40% wanted total withdrawal) and was most 
steep for the 1-year question (where over 
80% wanted total withdrawal). These distri- 
butions are grossly nonnormal and preclude 
the sensitive use of conventional regression 
statistics. 

The distribution problem was partially 
solved by first dichotomizing respondents into 
those who advocated total or less than total 
withdrawal and then by relating these fre- 
quencies to the three draft categories for each 
question, This dichotomization can be justi- 
fied on two grounds; for the 1-month and 6- 
month questions, it approximated a median 
split with no gross distortion between the 
proportion of respondents. advocating total 
withdrawal or less; and it seems meaningful 
to distinguish between persons who want total 
withdrawal and those who are prepared to 
see troops remain in Indochina. 

The data from the analysis are in Figure 1. 
One feature of the figure is the effect of the 
time interval by which troops should be 
withdrawn. About 409% of the sample in 
Experiment II would like total withdrawal 
within 1 month, over one-half of the sample 
in Experiment I would like it within 6 months, 
and over 8096 would like it within 1 year. 

A more important feature of Figure 1 is 
the larger proportion of draft-eligible men 
with Numbers 245-366 who advocated total 
withdrawal. This effect was tested statistically 
by contrasting draft-eligible men with Num- 
bers 245-366 to (a) draft eligibles with Num- 
bers 1-122; (b) draft eligibles with Numbers 
123-244; and (c) draft ineligibles with Num- 
bers 245-366. The resulting chi-square values 
are shown in Table 1, and they indicate a 
small but replicable effect. The statistical size 
of the effect increases when the independent 
chi-square values from each experiment are 
summed (McNemar, 1962, p. 222). For the 
contrast between draft eligibles in Category 
1-122 and Category 245-366, the two 
summed chi-square values were 8.20 (p< 
02) and 5.07 (p< .10); for the contrast 
between draít eligibles in Category 123-244 
and Category 245-366, the summed chi- 
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1. The proportion of persons advocating total troop withdrawal as a function of draft category, 
draít eligibility, and proposed date of withdrawal. 


ere 6.92 (p< .05) and 7.75 
for the contrast between draft 
raft ineligibles in Category 
immed chi-square values were 
and 8.77 (p < .02). All of 
s are two-tailed and involve two 
iom. No other within-question 
significant.* 
teworthy feature of Figure 1 is 
U-shaped relationship between 
‘jes and advocated withdrawal 


‘degrees of í: 


ef, ho statis 
at the 1-122 category differed from the 
'gory. Thus, the evidence for the 
J shape is visual and not statistical, and the 
phenomenon would not be mentioned were it 
"mot for McArthurs (1970) finding of an 
apparent curvilinear relationship between 
draft categories and belief in the external- 
"internal control of respondents’ fates. 
| Effects of the biased national lottery. There 
s considerable evidence from data on the 
-1969 draft lottery that numbers put into the 
S ur last were drawn out first (Fienberg, 1971; 
Rosenblatt & Filliben, 1971). Data from 
Northwestern samples revealed this same 
stematic bias. However, there is presumably 
ess systematic bias within half years than 
between them. Hence, the relationship of 
aft category to the proportion of persons 
vocating total withdrawal was examined 
| within half years for Experiment II (see 
Table 2). The number of persons wanting 
total withdrawal was related to draft cate- 
gory in four chi-square analyses, one for each 


4 These differences in the proportion of persons 
- advocating total troop withdrawal are not due to the 
j^ decreased proportion of petition signers with Num- 
bers 1-122 and 1-A draft classification in Experi- 
- ment II. First, the troop withdrawal findings were 
similar in both experiments, whereas the decreased 
etition signing was only found in Experiment II; 

nd second, the decreased signing Was found only 
in the 1-122 category, whereas the differences in 
- the proportion of advocated troop withdrawal in- 
volved the 245-366 category. Furthermore, like the 
; 245-366 category, the 123-244 category showed no 
^ reduced petition signing, but, unlike the 245-366 

- Category, it also showed no increased advocacy of 
| total troop withdrawal. 
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TABLE 1 
CHI-SQUARE VALUES RESULTING FROM CONTRASTS BR- 


TWEEN DnarT-Ericintr MEN iN Drart CATEGORY 
245-366 wir (a) Drart-Evicmie MEN IN 
Drart Carecory 1-122, (b) Dmarr- 
ELIGIBLE Men IN Dmarr CATE- 

Gory 123-244, anp (c) Drarr- 
INELIGIBLE MEN IN DRAFT 
Catecory 245-366 


i 


| Category 245-366 of draft. 


eligibles contrasted with: 
Time period for | FE 5 N 
withdrawal Draft Draft | Draft 
eligibles in | eligibles in incligibles in 
Category Category at 
1-122 123-244 245- 
6 months | go | s879 | 1.80 
(Experiment I) | 
1 year 4.13" 2.05 2.95* 
(Experiment IT) 
1 month 1.00 2.88* 5.82** 
(Experiment H) 


* p <.10, df = 1, two-tailed. 
+ p <.05, df = 1, two-tailed. 


combination of half year and question. For 
the 1-month question, attitude varied with 
draft category for the first half of the year 
(2 = 5.45, df=1, P< 025), but not for 
the second half. For the 1-year question, atti- 
tude tended to vary with draft category for 
the first half of the year (corrected x? = 3.17, 
dj = 1, p < .10), but not for the second half. 
These analyses suggest that the advocacy of 
total withdrawal in Experiment II was greater 
among those persons with Numbers 245-366 
who were born in the first halí of the year. 
Unfortunately, the low number of subjects in 
Experiment I prevented a sensitive analysis 
of the effects of draft numbers within half 
years, and all that can be said of Experiment I 
is that total withdrawal was most preferred 
by persons with Numbers 245-366, irrespec- 
tive of birthdate. 


Alternative Analyses 

The distribution of raw scores limited the 
major data analysis to nonparametric tests. 
This makes it difficult to generalize the find- 
ings beyond the use of a single cutting point. 
‘An arc-sine transformation was used to take 
advantage of the full range of scores and to 
reduce the correlation between treatment 
means and variances. Any analyses based on 
this information can be only supplementary, 
since the correlation between means and vari- 
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TABLE 2 
Disremevtions Or Responses as A Fusction or Dgarr Carecory, Harry Yeap 
Birtu, AND Proposen Date or WiTHDRAWAL FOR EXPERIMENT II 
3 | Respondents born January-June | Respondents born J 
Time period Response | Draft numbers | Draft nur 
] 1-244 248-366 1-244 
| ——————— 
Total withdrawal | — 64 (81%) 55 (93%) 87 (81%) 31 (845 
1 year | Less than total | 15 (19%) 4 (7%) 21 (19%) 6 (16 
Sum | 59 108 
| 
Total withdrawal | 26 (33%) | 29 (545%) | 44 (42%) 15 
1 month | Less than total | — 52 (67%) | 25 (46%) 61 (58%) 22 
| Sum | 78 | 34 105 3 


ances was not reduced to zero by the trans- 
formation. However, we should have greater 
confidence in any findings that are supported 
by both parametric and nonparametric analy- 
ses, irrespective of the shortcomings of any 
one analysis. 

Table 3 presents the transformed means 
and the results of one-way analyses of vari- 
ance for each question. More troop with- 
drawal was consistently advocated by persons 
with Draft Numbers 245-366 than by per- 
sons in the other draft categories. Six ¢ values 
result from comparing Category 245-366 with 
Categories 1-122 and 123-244 for each of the 
three questions. Three of these values are 
significant at p < .05 and one at p < .10. 


TABLE 3 


Mean PERCENTAGE OF Troop WITHDRAWAL AFTER 
ARC-SINE. TRANSFORMATION 


Draft eligibles in category: 
Time period for withdrawal 


1-122 | 123-244 | 245-366 
6 months 
(Experiment I) 
1,10 98 1.358 
SD 58 43 
1 year 
(Experiment II) _ 
X 1.36 1.40 1.505 
SD 49 46 123 
1 month 
(Experiment III) 
DE x 86 -98° 
SD 64 .60 .60 


a One-way F = 4.22, df = 2/105, p < .02. 
b One-way F = 2.70, df = 2/280, p < 07. 
c One-way F = .97, df = 2/271, ns. 


For the 1-year question in Experiment II, 
the transformed mean percentage of troop 
withdrawal for those born in the first half of 
the year and with Draft Numbers 1—244 was 
1.39, and it was 1.53 for those with Draft 
Numbers 245-366. But, for respondents born 
in the second half of the year, the respective 
means were 1.43 and 1.44. The interaction 
between draft category and date of birth was 
only marginally significant (F = 2.55, dj = 
1/279, p < .11). However, the difference be- 
tween categories among early borns was sig- 
nificant (p < .02), while the difference be- 
tween draft categories among later borns was 
not significant. The same pattern was dis- 
cernible for the 1-month question. The inter- 
action F was 1.68 (df = 1/270, p < .17); the 
difference between draft categories within the 
early borns was marginally significant (Xs = 
.84 and 1.02, p < .10), while the correspond- 
ing difference with the later borns was not 
significant (Xs = .94 and .91). These analyses 
indicate that the findings from the nonpara- 
metric tests can be replicated when subjects 
do not have to be divided into those who 
advocate total or less than total troop with- 
drawal. 


DISCUSSION 


Which theoretical explanation of attitude is 
best supported by the strong finding that 
attitude varies with draft category and by 
the much weaker finding that attitude might 
be influenced by the interaction of draft 
category and half year of birth? 


‘Sel{-[nicrest Hypothesis 

The self-interest hypothesis was based on 
persons with Numbers 245-366 knowing that 
they would not be drafted, and so having lit- 
tle interest in advocating a large amount of 


withdrawal. However, persons with safe 
numbers advocated more and not less troop 
withdrawal than others. Thus, the hypothesis 
was corroborated. 


troop 


Graduaté students and seniors with 1-A or 
-S classifications might be presumed to have 
mmediate self-interest in advocating with- 


drawal {rom Indochina, since the state of the 
war will be a major determinant of whether 
they enter the forces when they lose their 
deferments during 1970. Sophomores and 
juniors with 2-S deferments presumably have 
< less immediate interest in a quick end to the 
war, since they will not lose their deferments 
for over 1 year. Because of this reasoning, the 
data from draft-eligible males in each experi- 
ment were broken down by draft category and 
by academic class. Attitude was not related to 
academic class. 

Another self-interest explanation is that 
draft-eligible men with Numbers 245-366 felt 
that they would be called into the forces 
(despite official assurances to the contrary), 
and so they strove more than the others, who 
were resigned to the prospect of military 
service, to lower their chances of being in- 
ducted. What this ex post facto self-interest 
explanation cannot easily explain is why total 
troop withdrawal was advocated in Experi- 
ment II only by those persons with Numbers 
245—366 who were born in the first part of 
the year. 

A more subtle self-interest explanation is 
that draft-eligible males with Numbers 1-244 
believed that they would probably enter the 
forces and might serve in Indochina. Tf this 
were to occur, they could best protect their 
lives by advocating smaller withdrawals, since 
a large army in Indochina would offer better 
protection. The major problem with this ex- 
planation is that draft-eligible males with 
Numbers 1-244 did not differ from their 
quasi-experimental controls (draft-ineligible 
males with Numbers 1-244) in the amount of 
advocated withdrawal. Yet, it was only the 
draft-eligible men who had a stake in protect- 
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ing themselves. The assumption being made 


is that draft-eligible and draft-ineligible men 
would have advocated the same amount of 
troop withdrawal in the absence of draft num- 
bers. However, if the base lines were different 
and draft-eligible men advocated more with- 
drawal, then the present data could be in- 
terpreted in terms of the Draft Numbers 
1-244 causing a reduction in the amount of 
desired withdrawal, and not in terms of the 
Draft Numbers 245-366 causing an increase 
in the amount of desired withdrawal, Obvi- 
ously, we prefer the latter interpretation, 


Dissonance Hypothesis 


The dissonance hypothesis was postulated 
on respondents in Draft Category 245-366 
thinking that their reward (virtual exem] 
from the draft) was insufficiently j 
(since it was due to chance and not to work, 
talent, or physical incapacity). It was further 
postulated that one way of reducing the dis- 
sonance that resulted from insufficient justifi- 
cation would be to help others escape the 
draft. The higher proportion of persons in 
Draft Category 245-366 who advocated total 
withdrawal in Experiments I and IT supports 
the dissonance prediction. 

It is possible that in Experiment Il, draft 
category and date of birth interacted to de- 
termine the amount of advocated troop with- 
drawal. To gain a reward by chance probably 
does not justify the reward psychologically. 
There is even less justification for gaining 
the reward because of others’ incompetent 
randomizing and at the expense of persons 
with later birthdates. Hence, dissonance may 
have been especially high among persons born 
in early months. This interpretation assumes 
that the inadequacy of the Selective Service 
System’s randomization procedure was widely 
known among draft-eligible men. This is 
highly likely given their interest in the draft; 
given the frequency of comments about non- 
randomization in the national, Chicago, and 
Northwestern press; and given that news- 
paper articles describing the biased lottery 
were posted prominently in the Northwestern 
University library for over 2 months prior to 
the time of these experiments. 

In addition to the dissonance and self- 
interest hypotheses, à personality interpreta- 
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tion of the data is possible. Signing the peti- 
tion may have attracted one kind of person 
in the Draft Category 245-366 and a different 
kind of person in the other categories or 
among the draft ineligibles. If such a person- 
ality difference affected attitudes toward 
troop withdrawal, then category-related differ- 
ences in attitude might be due to the person- 
ality variable and not to draft category. Un- 
fortunately, we have no data on the char- 
acteristics of persons who did and did not 
sign petitions. However, any personality ex- 
planation of the present findings lacks plausi- 
bility. It is difficult to conceive of a person- 
ality variable that would, first, be differen- 
tially affected by petition signing across draft 
categories and that, second, would cause dif- 
ferences in the percentage of advocated with- 
drawal—but only among persons born in the 
first half of the year! 

Tt is worth noting that several assumptions 
had to be made in ruling out the self-interest 
and personality interpretations, and this is 
normally a less satisfactory logical procedure 
than having control groups or collecting post- 
test data that rule out alternative explana- 
tions. Thus, while we are confident of a dis- 
sonance explanation, we are not certain of it. 
Quite apart from considerations of falsifiabil- 
ity, the lack of certainty comes because nat- 
ural treatments cannot be tailored to a par- 
ticular theory or theories. They are given by 
nature or by social changes, and the experi- 
menter has to fit theory to treatment, rather 
than vice versa. This makes it inevitable that 
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there will be more than the usual di 
ories about intrapersonal processe 
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repancy 
between the operational definition of an inde- 3 
eferent, 
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s will be 
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pendent treatment and its theoretica 
and it also makes it almost inevitable 
than with experimenter-controlled t: 
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motivated or merely “innocent.” 


An ez study (Lerner & Simmons, 1966) 
began v ihe assumption that people have a 
need to eve they live in a just world—a 
world ix \ nich deserving people are rewarded 
and the indeserving are appropriately de- 
prived punished. Given this need, the 
awarenes. of someone who is suffering through 
no fault of his own creates a conflict for the 
observer. The observer can either decide that 
the worlc is not so just after all or go through 
the efío: of persuading himself that the 
innocent” victim actually merited his suffer- 
ing. One relatively comfortable way the ob- 


server can resolve this conflict is by deciding 
that the victim, though innocent by deed, de- 
Serves his fate by virtue of his undesirable 
personal attributes. 

As a test of these ideas, Lerner and 
Simmons placed students in the position of 
Observing a peer (innocent victim) partici- 
pate in a paired-associate learning task. The 
victim, as a result of making the usual errors, 
apparently received negative reinforcements 
(severe and painful electric shocks). After 
these observations, the students devalued the 
Victim when they believed they would con- 
tinue to see her suffer in a second session 
and when they were unable to help her. An 
additional finding was that the rejection was 


.'The author would like to express his apprecia- 
tion to Ron M. Kisch and Ray Freeberg for their 

tance in gathering and analyzing the data for 
these. studies, Requests for reprints should be sent 
to the author, who is now at the Department of 
Psychology, University of Waterloo, Ontario, Canada. 


OBSERVER’S EVALUATION OF A VICTIM: 
JUSTICE, GUILT, AND VERIDICAL PERCEPTION 


MELVIN J. LERNER ! 


Department of Behavioral Science, University of Kentucky 


Three separate studies are reported in which subjects reacted to someone they 
viewed on closed-circuit television (actually a videotape). Those subjects who 
elieved the victim was actually being shocked gave consistently lower evalua- 
ons than those who believed the victim was acting, or than those who 
believed that though the victim was shocked, she would be compensated for 
ier suffering. This general finding held true whether the subjects were male 
or female, a class of-advanced students oriented to a helping profession, or 
exposed to the situation in experimentally heterogeneous rather than homogene- 
ous groups. The same effects were noted whether the victim was altruistically 


strongest when the victim was presented as 
suffering for the sake of the observer (martyr 
condition). 

Although the findings were nicely compati- 
ble with the initial theoretical notions, other 
plausible explanations could be entertained. 
One of these points to the “guilt” of the ob- 
server of an innocent victim's suffering. Most 
people have internalized the obligation to 
defend the innocent and punish the wicked. 
This moral obligation can require acts that 
are uncomfortable or costly in terms of other 
goals. It is reasonable to conjecture, then, 
that the observers in the Lerner and Simmons 
(1966) study may have felt guilt for not 
intervening, especially in the martyr condi- 
tion where the victim agreed to suffer for 
their sake (so they could receive needed 
“Jab” credit). To eliminate a feeling of 
guilt without having to intervene—risking the 
consequence of intervening—the observers 
condemned the victim. 

Another equally plausible explanation as- 
sumes that the observers did not need to alter 
or distort their perceptions of the victim to 
condemn her, After all, the victim allowed 
herself to undergo a rather extended period 
of severe electric shocks merely for the sake 
of meeting a requirement that she participate 
in one experiment or another. Anyone, how- 
ever, with a normal amount of integrity 
would have refused to participate after the 
first severe shock. An observer might infer 
that the failure to effectively resist indicated 
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an objectionable weakness in the victim’s 
character. The observers merely reflected 
these unadmirable qualities in their ratings 
of her. 

A third alternative explanation rests on 
questions of method. There was considerable 
opportunity for the experimenter to influ- 
ence, albeit unconsciously, the reactions of 
the observers—especially since the instruc- 
tions were administered verbally (Rosenthal, 
1966). A variant of this issue points to the 
fact that the observers participated in small 
groups, All of the subjects in each group were 
in the same condition. It is possible, then, 
that the observers tended to influence one 
another by nonverbal signals to react in a 
relatively homogeneous fashion, thus art'fi- 
cially increasing the likelihood of finding 
differences among experimental conditions. 

Last, there is an entire class of explana- 
tions that stems from the characteristics of 
the observers employed in the initial study. 
They were all female undergraduates who 
were similar to the victim in many respects, 
including the fact that they all were taking 
introductory psychology courses and partici- 
pating in an experiment to meet a require- 
ment. These attributes would point to a sense 
of identification as a mediating variable. Also, 
most of the observers were white, Protestant, 
and from central Kentucky. Possibly more 
sophisticated, urbane observers would exh bit 
less judgmental reactions and be less influ- 
enced by “fundamentalist” orientations to 
rewards and punishments. 

The three studies reported here provide 
some evidence concerning one or another of 
these explanations. 


Srupv I 


This study compared the reactions of male 
and female undergraduates to the “innocent 
victim." In addition, the hypothesis that the 
victim's behavior in the situation was inher- 
ently reprehens!ble—because of her failure to 
resist her persecutor—was tested by having 
some observers aware that the victim was 
merely acting. The observers in this condi- 
tion (denatured) were to judge the victim 
and predict how naive observers would react 
to her. Also, the reaction of the observers to 
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the opportunity to intervene by substituting 
themselves for the victim was explore 
Method 

Subjects 

The observers in this study were 31 m^ 30 
female undergraduates from various sec an 
introductory sociology course. They we ed 
as volunteers to participate in a study of r- 
ception of cues of emotional arousal. The a- 
tion was encouraged by their instructor, no 
way was related to their grade in the cours ey 


were exposed to the experimental situation 
groups comprised of both males a 


subjects in each group were in the same ex ntal 
condition. 
Procedure 

The procedure was similar to that emp! | in 
Lerner and Simmons (1966). There was at tro- 
ductory statement by the experimenter outlining the 
value of obtaining information about the w cople 
appear under stress. The subjects then learn: at 
they would observe a student from a c J- 
ductory psychology who was participati er 
experiment—concerned with the effect lg 


negative reinforcement on paired-ass 
The observer's task was to look for s 
as the subject performed. They were the 
plausible reason why it was necessary 
Observe the session via closed-circuit 
subjects watched what was actually 
videotape, during which the victim receive 
apparently painful electric “shocks” for i 
responses and reacted to them with express 
pain and suffering. The victim was not being 
shocked, but merely gave an effective performance. 

The various experimental conditions were created 
in the following manner: 

Denatured. The subjects in this condition were 
told at the outset that they were observing a video- 
tape of someone who was acting as if she were 
being shocked. This tape was to be used in other 
studies of the effect of suffering on impression 
formation, and their task was to provide base-line 
data concerning the impression that this subject 
gives of herself. After viewing the videotape, the 
observers rated the victim's attributes and then 
were asked to predict how she would be rated by 
observers who believed she was suffering and would 
continue to suffer in a second session. 

Midpoint. This condition is directly comparable 
to the one employed in the earlier study (Lerner & 
Simmons, 1966). After watching the videotape, the 
observers were told they would watch the victim 
in a second session of equal length. They then rated 
the “personality” of the victim, after being told that 
one of the purposes of the study was to determine 
the way people of different personalities reacted 
under stress. 


Opport v to substitute, This condition was 
Similar t midpoint condition except that the 
Subjects \ told that they might find it helpful, in 
darifyin: hunches about the cues people exhibit 
under st hey could watch someone other than 
the origi ject (victim). They were then given 
the opi to indicate, privately, on a slip of 
paper w they would be willing to be the 
"subject ic next session. The experimenter 
would ti : one among those who volunteered. 
The subj en rated the personality of the victim. 
"This ratir urred prior to a public disclosure of 
the volu: 


Measu: 


After g the experimental instructions, the 
subjects ed some forms, including a rating 
Tof the nality of the victim. This measure 
consisted 15 highly evaluative birolar scales 
T (intelligent elligent; likable-unlikable; unco- 
T operativ ative; bossy-easygoing; immature- 
mature ; sinative-unimaginative; irresponsible- 
F responsi! nervous-calm; patient-impatient; 
reasonab!e—:nreasonable; rigid-flexible; courteous- 
rude; se15-^-unselfish; warm-cold; sincere-insincere) . 
These rauines were combined to yield an overall 
index of ottractiveness. The range of possible scores 
Was fror to 135. The higher the score, the more 
Positive the rating, The subjects then rated the 
average college student” on the same set of scales 
for purposes of comparison. The main dependent 
variable s derived by subtracting the average 
college student rating of the victim. 


Results 


P The analysis of variance (see Table 1) 
= indicates that the only significant effect is 
attributab'e to the experimental condition 
| (F— 4,71, p< 02). Neither sex of observer 
nor the interaction between sex of observer 
é and experimental conditions approached sig- 
nificance. Apparently, the ratings in both the 
Opportunity-to-substitute and midpoint condi- 
tions were more negative than those in 
the denatured condition (midpoint condition 
versus denatured condition, t = 2.92, p < .01; 
Opportunity condition versus denatured condi- 
tion, ¢= 2.16, p< .05), but they did not 
differ reliably from each other. 
f In the denatured condition the observers’ 
- predictions of the ratings of naive subjects 
| resembled very closely their own ratings of 
the victim, The male ratings yielded a mean 
of —3.44, whereas their predicted mean was 
— 11. Female ratings had a mean of .72, 
| E the mean of the predicted ratings was 
mi 


OBSERVER's ÉVALUATION OF A VICTIM 


TABLE 1 


MEAN RATINGS OF VICTIM AND ANALYSIS OF 
Variance: STUDY I 


Condition 
Sex i —— 
| Midpoint | Denatured | Opportunity 
Male | -—15.55 —3.44 —102 
| 12 9 10 
Female -15755 | "2 -136 
n Lad emi 10 
Analysis of variance 
Source | 4f | MS | F 
Sex (S) 1 | 26.67 <1 
Condition (C) | 2 1153.20 471* 
sxc | 2 c. hy 105803 <i 
Error 55 | 248 | 
i 
*p«.02. 


It should be noted, also, that none of the 
observers volunteered to make themselves 
vulnerable to being selected to take the 
victim's place. 


Stupy II 


This study took place in the context of a 
regular class meeting. The main purpose was 
to determine if these students—both male 
and female of various ages, all of whom had 
some commitment to a helping profession— 
would react to the victim’s suffering in a 
similar or different manner than had the 
younger groups of undergraduates in the pre- 
vious studies. In addition, all of the subjects 
were exposed to the same experimental situ- 
ation at the same time. This procedure was 
employed to eliminate the effect of the experi- 
menter communicating his expectations to the 
subjects and also the possibility of subjects 
influencing one another to achieve relatively 


homogeneous responses. 
Method 


Subjects 


The subjects were 29 students from one advanced 
undergraduate-graduate class in sociclogy of health- 
related behavior. There were approximately half male 
and half female students. The ages, academic status, 
and majors of the students varied greatly, including 
undergraduates exrecting to enter medical school, or 
majoring in sociology, to mature adults returning to 
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obtain advanced degrees in pastoral counseling, 
vocational rehabilitation, and health education. The 
‘ingle main common denominator among them was 
^ concern with one of the helping professions. 


Procedure 


The experiment took place in the context of a 
guest lecture to a class of students. The experimenter 
announced, initially, that in preparation for his 
lecture on stress, he wanted the class to observe 
someone undergoing a stressful event. He then re- 
lated to them a rationale for the importance of their 
karning the cues for someone under stress. An 
assistant then distributed envelopes containing in- 
structions and scales for the observers to complete 
The packets were identical except for slightly dif- 
ferent instructions designed to create the differing 
experimental conditions, Of course, the packets were 
distributed on a more or less random-chance basis 
so that the subjects in the varying conditions were 
mixed throughout the audience, and the experimenter 
had no idea of the condition for any given subject. 

All of the instructions led the observers to believe 
they would see the victim in two sessions. The 
ratings occurred after the first. After reading the 
preliminary instructions, the students saw the same 
videotape employed in the earlier studies. They 
believed they were observing over closed-circuit 
television. 

Two of the conditions employed in this study were 
virtually identical with those of the previous study— 
a midpoint condition, in which the observers were 
naive, and a denatured condition, in which they were 
informed that the victim was acting. The third 
condition, reward, was similar to the midpoint with 
the additional statement that unknown to the victim, 
she would receive $30 at the end of the experiment. 
The measures were identical with those employed in 
the earlier studies, 


Results 


The ratings were similar in the denatured 
and reward conditions (see Table 2). The 
victim, however, was described more nega- 
tively in the midpoint condition (reward 
condition versus midpoint condition, £ = 2.16, 
p< .05; denatured condition versus mid- 
point condition, ¢ = 1.84, p < 10). 


STUDY III 


Probably the most provocative finding in 
the Lerner and Simmons (1966) study was 
the comparatively extreme rejection of the 
victim who agreed to undergo the shocks so 
the observers would receive credit for par- 
licipating in an experiment, The intent in 


Metyiw J. Leenen 


TABLE 2 


Mxaw Ratinos or Viens AND Axa 
VaniaNCE: SrUDY II 


Condition 

Midpoint Denatured 

—7.88 7.00 

(n = 9) in = 11) 

Analysis of variance 

Source dj M 
Condition 2 814.51 

Error 26 321.96 


creating this condition was that the victi 
be perceived as altruistically motivated. 
The severity of condemnation supposedly 
reflected the extent to which the observers 
were motivated to perceive that this appar 
ently martyred victim deserved her suffering 
after all. 

One alternative possibility is that the sub- 
ject’s status of “innocent observer” was 
altered by the martyr condition. Althougi 
the subjects were not consulted about their 
wishes in the matter, the victim was pres 
sured into and agreed to accept her suffering 
for the sake of the subjects. They may hav 
then come to feel guilty over their complicity 
in the injustice or angry over their enforced 
indebtedness to the victim. 

Another possibility is that the martyr 
scene, rather than creating the impression o! 
someone acting generously írom altruistic 
motives, portrayed the victim as someone 
who conforms easily to unreasonable demands 
of authority figures or is willing to suffer un- 
necessary humiliation and suffering for others" 
approval—what is commonly referred to as 
a “sap.” 

The strategy employed in this study was 
based upon two assumptions. If it is true 
that the martyr condition actually portra; 
the victim in a negative light, then observers 
who did not believe the victim was suffering 
or who believed the suffering victim would 
be appropriately compensated would still de 
scribe her negatively. In other words, the 
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wictim s» be perceived in a uniformly 
negative er, regardless of her fate. 

The s | assumption was that subjects 
who we: observing the victim in order 
to meet uirements of a course in intro- 
ductory logy would exhibit fewer, if 
any, fe of guilt or obligation toward 
the altri ictim. Therefore, to the extent 
that gu resentment are the effective 
determin Í the rejection of this victim, 
subjects a class that has no such re- 
quiremer uld not devalue the victim. 
Other o! rs may be implicated, but these 
subjects iot and therefore should exhibit 
no rejec 

Method 
Subjects 

The su! were 46 coeds recruited in the same 
manner idy I—as volunteers from various 
sections o class im introductory sociology. They 
participat: call groups in which all members 
were in ti 1e experimental condition, 

Procedu: 

When thc subjects appeared for the experiment, 
they were : the rationale for observing someone 
under stress. They were also led to believe that some 
of the observers in the room were from introductory 
sociology, while others were there meeting the 
requirement of a class in introductory psychology. 


The experimenter explained that they would observe 
from introductory psychology participate 
ment over closed-circuit television. 
2w a videotape. 

The tape employed in this experiment contained 
an initial sequence in which the “victim” entered the 
toom, received her initial instructions, and performed 
father well in a digit-span task. At the end of this 
Sequence, Dr. Stewart (the victim’s experimenter) 
informed the model that she was running negative 
reinforcement conditions that day for the second 
task. Upon receiving this information, the victim 
explained that she was terrified of being shocked and 
Tesisted going on with the rest of the experiment. 
Dr. Stewart then urged her to continue for the 
sake of the observers, who would not be able to 
obtain lab credit for participation in an experiment 
if she refused to do her part so they could observe 
her. After a few moments of persuasion based on 
the elicitation of altruistic motives, the victim agreed 

go on "if it is necessary for all of them to get 
credit” This scene, except for the use of a different 
Person as the victim, was virtually identical with 

* one employed by Lerner and Simmons (1966). 

After Watching this scene, the observers were 
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i 
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pendent variable. After completion of the 
the experiment was terminated, and, as 
studies, considerable time was spent working 
the feelings of the subjects and turning the 
into a period of instruction. The subjects 
experiment did not see the victim suffer, 
Each group of observers was awigned to 
four experimental conditions. In three of 
were led to believe the altruistic 
suffer severe electric shocks, However, in 
them, unknown to the victim, she would 
$30 at the end of the session, in another, 
and in the third, there would be no 
The subjects in a fourth condition 
were informed that the subjects were 
traying a victim and would not suffer 
The alternative hypothesis can be stated 
simply. If there is something inherently repugnant 
in the martyr's behavior, then she will be condemned 
uniformly across all conditions. On the other hand, 
if the rejection of the martyr is a function of the 
observer being implicated, then there should be no 
rejection from these sociology students in any of the 
conditions. However, if the evaluation of the martyr 
is a function of the degree of the undeserved suffer- 
ing, then we should find the most rejection in the 
condition where the martyr receives no compensa- 
tion, less in the condition where she will receive 
$10, and even less in the condition where she will 


not suffer at all or be paid $30. 


im 


H 
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RESULTS AND DISCUSSION 


Apparently, the observers in this study were 
affected by the various experimental condi- 
tions. Those who believed the victim would 
suffer without compensation described her 
as considerably less attractive than did the 
subjects who believed she would receive $30 
at the end of the session (see Table 3; 
no-reward condition versus 330 condition, 
t= 332, p< 001) or who believed that 
she was not actually going to suffer (no- 
reward condition versus denatured condition, 
t= 4.76, p < .001). The reactions of the ob- 
servers in the $10 condition resembled those 
exhibited by the subjects who believed the 
victim would receive no compensation. Al- 
though the mean rating in this $10 situation 
was slightly less than in the no-reward condi- 
tion, it was not significantly so. The $10 
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TABLE 3 
MEAN RATINGS OF MARTYR VICTIM AND ANALYSIS 
or Variance; Strupy III 


Condition 
Denatured No reward $10 $30 
I 

5.81 —26.58 —19.75 —1.27 

(n = 11) (n = 12) (n = 12) (n = 11) 
Analysis of variance 

Source | af MS F 
Condition 3 2661.65 9.01* 

Error 42 265.20 | 
*p< 001, 


condition did yield significantly lower ratings 
of the victim than either the denatured or $30 
conditions ($10 condition versus $30 condi- 
tion, £ = 2.71, p < .01; $10 condition versus 
denatured condition, t = 3.75, p< .001). 

It is also worth noting that the observers 
in the denatured condition were relatively 
unsuccessful in predicting the reactions that 
naive observers would have to the victim 
under conditions of no compensation. The 
mean predicted rating for naive subjects 
(—2.09) was lower than their own rating of 
5.81 but decidedly higher than that actually 
found in the no-compensation condition 
(—26.58). 

Taken together, these findings appear to 
offer clear support for the hypothesis that 
the observer's reaction to the altruistically 
motivated victim is affected by the extent 
of her undeserved suffering. The alternative 
hypotheses—that the condemnation of this 
victim is the result of something inherently 
repugnant in her behavior or is the result 
of the observer’s guilt and resentment—do 


not seem to fit the reactions of the observers 
in this study. 


SUMMARY AND CONCLUSIONS 


These studies confirm the earlier finding 
that the victim of undeserved suffering runs 


the clear risk of being condemned by those 
who witnesg his fate. In general, the observers 


in these studies thought considerably less of 


+ 
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the person portrayed on the videotape if they 
believed her fate was one of undeserved suf- 
fering than when they understood she vould 
receive a considerable compensation 

pain, or she was acting and not really 


her 


ing at all. This tendency to condemn the 
victim held true if she was altruistically 
motivated or merely “innocently” pursuing 


her normal routine.” 

Of more importance than the general fird- 
ing, which is probably part of our common 
wisdom, is the attempt to understand ‘ts 
psychological origins. Why do people tend to 
condemn victims of undeserved sulferi::? 
The evidence from the studies reported bere 
seems to be inconsistent with two of the 
plausible explanations offered in the in 
duction to this study. All three of the studies 
contained a condition in which the observers 
were informed that the victim was not actu- 
ally suffering, and they were informed to 
react to the person portrayed on the video- 1 
tape. These observers consistently viewed that 
person as considerably more attractive than 
similar observers who believed she was a 
victim. Also, those informed subjects wcre 
unable to predict the more negative evalua- 
tions of naive observers. These findings offer 


2A question has been raised concerning which of 
the specific attributes among the 15 bipolar adjec- 
tives is more or less affected by the various expcri- 
mental conditions, The results of an earlier analysis. 
of the data from Lerner and Simmons (1966) m 
vealed a relatively “nonsensical” pattern of affected 
attributes. For example, the victim in a midpoint 
(extended suffering) condition was seen as signifi- © 
cantly more nervous, impatient, selfish, cold, and 
insincere than the same victim who was to be com- 
pensated. Similarly, the martyr victim's lower scores 
were mainly caused by her being seen as relatively 
unreasonable, rigid, rude, selfish, and insincere, It 
would have been most unlikely to expect the “altru- 
istically” motivated martyr to be condemned for be- 
ing rigid, selfish, and unreasonable: especially since 
her suffering stemmed, apparently, from her allow- | 
ing herself to be persuaded to suffer for the sake of 
others. Because of this lack of a meaningful pattern, 
no additional analyses of particular attribute differ- 
ences has been done subsequent to these intriguing 
but relatively mystifying findings, Our best hunch ¢ 
has been that the specific attributes selected are not 
particularly related to those processes that most 
interest us, but are probably relatively unique to 
the particular stimulus situation. 


; 

good evidence that the victim’s fate rather 
than her behavior in the situation elicits the 
observer’s condemnation, 


Study 1!! dealt most directly with the 
“guilt” explanation, The volunteer subjects 
in this study were not the implicit benefi- 


ciaries of the martyr’s largess, and yet they 


condemned when they believed she would 


not be itly compensated. Unfortu- 
nately, the design of this study does not allow 
a comparison of their reactions with those of 
observers : directly implicated, as in the 
initial study. It is possible, also, that the 
observers in this study, though not impli- 
cated directly by the martyr’s beneficence, 
may have fel! some guilt for not intervening 
on her behali 

The results of Studies I and II bear most 
directly o: the methodological questions 
raised in introduction. A comparison of 
the ratings given the victim by the subjects 
in Study I) with those in Study I suggests 
some inte ig similarities and differences. 
Although differences between the ratings 
given by the subject in the midpoint condi- 
tion versus the denatured condition are virtu- 
ally identical for the two studies—approxi- 


mately 15 points—the victim is seen as some- 


what more attractive by the subjects in 
Study II, This was not predicted, but would 
be consistent with the notion that people 
Who are oriented to helping others (Study II) 


would tend to see victims in a more positive 
light. Nevertheless, this generally positive 
Orientation to unfortunate people did not 
teduce the tendency found in all the studies 
to devalue someone as a function of their 
suffering, 

The results of Study II also tend to dis- 
confirm the hypothesis that subtle cues from 
the experimenter or other subjects can ac- 
Count for the observed condemnation of the 
victim, The experimenter in this study had 
virtually no contact with the subjects and 
did not know which subject in the classroom 
Was in a given condition. It is extremely un- 
likely then that he could have induced the 
Subjects to respond as he hoped they would. 
1 The hypothesis concerning the observers' 
influence on one another and the resultant 
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tendency to give a common response can be 
tested by comparing the within-condition 
variance in Study II with those studies where 
the subjects were run in experimentally 
homogeneous grouping. The within-condition 
variance in Study II, 321.96, is comparable 
to that found in the initial Lerner and 
Simmons study, 344.9. This is especially note- 
worthy given the relative heterogeneity of 
the subject characteristics in Study II. The 
within-condition variance of these two studies, 
though, is somewhat larger than in Study I, 
247.72, and in a similar unpublished study? 
using this same measure, 262.9. 

Apparently, whatever subtle communica- 
tion occurs among the subjects and the ex- 
perimenter in this situation does not account 
for the main findings of this series of studies, 
nor does it appear to introduce an increased 
homogeneity of responses when a collection 
of subjects are given the same experimental 
instructions. 

It seems safe to assert at this point that 
the devaluing of the victims employed in 
these studies was probably not the result of 
something inherently repugnant in the be- 
havior of the innocent or altruistically moti- 
vated victim, or of the experimenter's silent 
signals to the observers or theirs to each 
other, or of something unique to female 
undergraduates from “fundamentalist” Ken- 
tucky. It also appears that one does not 
have to be the beneficiary of a martyr's 
largess to end up condemning her. The most 
consistently reliable determinant of the reac- 


3 Caroline W. Simons, "The effects of deception 
manipulations within an experiment on reactions to 
victims of misfortune.” Unpublished honors thesis, 
University of Pennsylvania, 1968. (Done under the 
direction of Jane Piliavin. Simons’ experiment in- 
volved two conditions of degree of suffering—end 
point versus midpoint. In addition, her subjects were 
run under four different conditions of awareness of 
the experimental situation, Her main findings were 
similar to those reported here. The naive observers 
had significantly lower ratings in the midpoint than 
in the end-point condition. These effects did not 
occur when the observers were aware that the victim 
was not suffering. It is also worth noting that Si- 
mons’ procedure was similar to that employed in 
Study II—written rather than verbally administered 
instructions administered to experimentally hetero- 
geneous groups of subjects 
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tions of these observers—powerless as they 
were to help the victim—was the degree of 
injustice in her fate: the less deserved or 
compensated her suffering, the greater the 
likelihood that the victim would be devalued. 
The next steps should be more direct tests 
of the determinants of this belief and the 
association of condemnation and help with 
varying degrees of “need to believe in a just 
world.” 

In addition to the main results of this 
study, there are some intriguing findings that 
followed further analyses of the data. The 
main dependent variable in these and a num- 
ber of earlier experiments is based on the 
subtraction of the observer’s rating of the 
victim from a standard stimulus, usually an 
average college student. Initially, it was 
assumed that these two measures would be 
positively correlated, and, therefore, the dif- 
ference score would reduce that part of the 
total variance due to individual differences 
in responding to the bipolar scales. Since this 
measure has proved fruitful over a series 
of studies, there has been little reason 
to question it, and continuing it provided 
comparability of data across experiments. 

At the suggestion of the editor, John T. 
Lanzetta, however, the ratings given the 
average student by the subjects in the vari- 
ous experimental conditions were examined 
for Experiments I and III (these data were 
relatively inaccessible for Experiment II). 
This analysis yielded some surprising results. 
To begin with, it became clear that the mean 
ratings of the average student do vary across 
experimental conditions from a low of 89.44 
to a high of 105.05. Second, if one looks 
at the ratings of the victim only and ignores 
the average student rating, the results change 
dramatically for the male subjects in Study I 
but hardly at all for the female subjects in 
Studies I and III. The male subjects re- 
spond almost uniformly to the victim with 
a mean of about 86. The females, however, 
devalue the innocent victim in the midpoint 
and can-volunteer conditions, and the martyr 
is devalued significantly in the $10 and 
no-reward conditions, 

The main conclusion up to this point would 
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m 
be that it is relatively unimportant whether 
one uses absolute or difference scores with 
female subjects. But what about the males? 


If one looks at the difference scores, they 
appear to be virtually identical with females 


in condemning the innocent victim; however, 
an opposite inference would derive from ‘ne 
absolute scores. Apparently, the males, un xe 


the females, did not condemn the innocent 
victim but instead merely changed their nər- | 
ception of people in general (average s'u- 
dent) as a function of the experime: al 
conditions. For the males, the average pc: on 


is seen as more attractive after viewing the 
suffering of an innocent victim whom «3e 
cannot help. 


To explore further the implications of ti-se 
sex differences, the results of an additioyal 
experiment were examined. In that st Jy, . 
male subjects reacted to another male zer 
they caused the other person or the other 
person caused them to suffer under “justifi- 
able" or “unjustifiable” circumstances. “he 
results of that study, using difference scores, 
followed the hypotheses closely: causing 
another person to suffer or being caused to 
suffer by another person under justifiable 
circumstances led to more positive evaluation 
of the other person than when the suffering 
was caused for less “good” reasons. 

For the purposes of this discussion, how- 
ever, the main finding is that if one looks 
only at the absolute ratings of the other 
person in this study and ignores the stan- 
dard, all of the differences across conditions 
are eliminated. The significant effects are 
all due to systematic changes in the ratings 
of the standard—the average person. This is 
exactly what was found in Study I when the 
males were reacting to an innocent female 
victim in an entirely different experimental 
paradigm. 

What then can be concluded, even tenta- 
tively, at this point? Certainly, there seems 
to be a difference in the way men and women 
react to the awareness of suffering in others. 
However, is the difference one of men altering 
the view of society—people in general— 
while women change their perception of the 
victim? Even if this is so, some of the conse- 
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he victim are the same—the 
m is seen by both men and 
ively less desirable than most 
native possibility is that the 
likely to resist varying their 
{ the other person—victim or 
they are no less motivated to 
perceptions than the females. By 
male subjects with the oppor- 
e the average person, they are 
complish the same end psycho- 
less direct consequences for 
te interaction with the other 
:im or nonvictim). 
xpsnation implies similar processes 
female observers, with the 
Difference that men have been 
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taught—forced—to have a greater invest- 
ment in their immediate surroundings. Of 
course, this is pure speculation at this point, 
and skirts the fringes of male chauvinism. 
However, what is not speculation is that the 
male and female reactions are different, while 
they seemingly end up at the same relative 
valuation of a victim. 
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SUBJECT COMMITMENT AND DEMAND CHARACTERISTICS 
IN ATTITUDE CHANGE' 


ROBERT E. LANA * axo ROBERT H. MENAPACE 


Temple University 


Attitude and galvanic skin response (GSR) measurements were taken on 56 
hospital patients. In a counterbalanced 2 X 2 design, some subjects were given 
information as to the experimenters' expectation that they would change their 
attitudes as a result of their 4-6 weeks in the hospital, and others were not 
given such information. Some subjects were required to answer pretest ques- 
tionnaires, and others were not. Posttest questionnaires and GSR readings were 
collected from all subjects after the 4-6-week rehabilitation experience. The 
attitude change data were best predicted by a compound commitment and 
demand characteristics hypothesis. GSR changed significantly in all main effect 


treatment conditions. 


Pauling and Lana (1969) concluded that 
subjects responding to an attitude pretest 
make a commitment to their initial attitude 
statement. This is evidenced by the fact that 
they were less influenced by a subsequent 
two-sided communication than were subjects 
who were not pretested. Other studies (Lana, 
1964, 1966, 1969: Lana & Rosnow, 1963) 
also tend to support the notion that a pretest 
can inhibit attitude change. 

However, it is also possible to conceive of 
a pretest as a situation which may inform 
the subject of the intent of the experimenter. 
When the subject is examined via pretest, 
later receives a communication, and is then 
posttested, he probably concludes that the 
experimenter is looking for change. If he is 
sensitive to the presumed demand character- 
istics (Orne, 1969) of the study, he will 
change in one direction or the other. This is 
precisely what does not happen. 

When the subject Signs his name and 
records his opinion as a matter of public 
record, he tends to maintain this opinion in 
the face of later attempts to change it. The 
demand characteristics argument, on the 
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other hand, predicts the opposite 
Pretest sensitization and the demand charac- 
teristics of the pretest-treatment—pos'test- 
attitude-change design should influence the 
subject in opposite directions—demand char- 
acteristics toward change, and commi(:nent 
toward immobility. 

Because one of the major difficulties of 
using an attitudinal pretest is that it often 
causes artifacts of commitment and demand 
characteristics to occur in an experimental 
situation, it is important to investigate some- 
what less obtrusive techniques of assessing 
its behavioral concomitants, 

A possible way around both the commit- 
ment and demand artifacts would be to use 
a measure of initial attitude that is not 
under the direct control of the subject. For 
example, measuring galvanic skin response 
(GSR) before and after the subject receives 
à persuasive communication might yield in- 
formation about differentially sensitive atti- 
tude areas, even though the GSR would 
obviously not indicate attitudinal direction. 
One purpose of this study was to explore 
the use of GSR measurement as an indicator 
of attitude intensity with a view to surmount- 
ing the pretest-sensitization problem, Kaplan, 
Burch, Bloom, and Edelberg (1963) re- 
ported that interacting persons increased 
GSR responses during verbal interchange. 
Prejudiced persons show high GSR rates to 
various ethnic stimuli (Cooper, 1959a, 1959b; 
Cooper & Siegel, 1956; Parier & Lott, 
1967; Vidulich & Krevanick, 1966: Westie 
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& DeFleur 59). Other studies indicate 
GSR res! to personality test items 
(Willis, ! statements about the church 
(Dickson Ginnies, 1966), extreme views 
in general , Cardoret, Hughes, & Abbey, 
1965), an | neurotics as compared with 
schizophr: (Ward & Carlson, 1964). 

Since t studies indicate that GSR 
change is elated with extreme attitudes 
and with al states which indicate emo- 
tional att ent of the subject to this en- 
vironment is possible that GSR changes 
also correlate differentially with the principal 
conditions his study. That is, a commit- 
ment via pretest may be accompanied by an 
emotional ize as measured by GSR which 
does not ny a demand characteristic 
operating ı result of informing the subject 
of the intest of the experimenter. Of course, 
it is also possible that GSR changes may be 
greatest wil» demand characteristics. 
Hypothese 

la. It was predicted that a significant 
interaction effect would occur between pre- 
testing and information about the intent 
of the ex nenter. According to the com- 
mitment hypothesis: when no pretest and no 
information is given (AgBy), the subject is 
not aware of the intent of the experimenter 
and, therefore, there is no demand made 


upon him to change his attitude. Also, he 
has not made a commitment to an attitude 
Which would inhibit change. Therefore, his 
attitude should change to a greater degree 
than a subject who is pretested and informed 
as to the intent of the experimenter (A1Bi). 
In the pretested and informed condition, 
the subject is committed to his prior atti- 
tude, and is thus particularly unwilling to 
Change when he knows that this is the 
€xperimenter’s expectation. 

lb. Conversely, the demand hypothesis 
predicts that the information concerning the 
experimenter's intent and taking a pretest 
Will yield greater change than in the no- 
Pretest-no-information condition. 

lc. In the remaining two experimental 
conditions, no differential predictions from 
the commitment and demand hypotheses can 
be made, However, information should facili- 
late change in the information-no-pretest 
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TABLE 1 
EXPERIMENTAL DESIGN 
Condition | Bi—Pretest pem 
A,—Information Ai Bi ABe 
Ar—No information AjBi ABs 


condition (A1Bs), because demand character- 
istics increase change and there is no com- 
mitment effect of the pretest. In the no- 
information-pretest condition (A+B), there 
should be little change, since demand is mini- 
mal and there is commitment via pretest. 
A summary of the experimental design is 
presented in Table 1. 

2. It was predicted that there would be 
different GSR change scores among the treat- 
ment groups because the emotional compo- 
nents accompanying demand characteristics 
are likely to be different from those accom- 
panying commitment. 


METHOD 
Subjects 


Subjects were 56 patients at the Moss Rehabilita- 
tion Hospital in Philadelphia, ranging in age from 
29 to 81 (X = 61). Twenty-eight were inpatients 
who had suffered from cerebrovascular accidents of 
one kind or another, such as a stroke or a fall. 
The remaining 28 subjects were outpatients who 
were suffering from pulmonary ailments. GSR and 
attitude measurement of the pulmonary and cerebro- 
vascular accident subjects were taken on about the 
same days. Educational and economic backgrounds 
were heterogeneous and randomly distributed across 
treatment comparisons. 


Procedure 


All of the procedures and equipment described 
below were pilot tested on some 70 physical rehabili- 
tation patients at the same hospital. Each subject 
was taken to a room where silver-silver chloride 
GSR electrodes were attached to the index and 
middle fingers of the subject's nonpreferred hand. 
The recording equipment was located in an adjoin- 
ing room out of the subject's view. The GSR 
recorder was a Sanborn 320DC, two-channel re- 
corder, Each subject was presented with 21 Likert- 
type statements, 1 at a time, printed on 3X5 
inch index cards. The statements included opinions 
of the hospital conditions, the patient's perception 
of the outside (nonhospital) world, and the pa- 
tient's self-perceptions, especially regarding charac- 
teristics of his disabled status. Each statement had 
five alternative responses. The first two questions 
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TABLE 2 


Summary Tante rog Tuxkg-WAY. ANALYSIS OF 
VaxiaNCE Or Atrrrupe CHANGE Scores 


Source ss 4 | us | F 
Information (A) 24.76 1 | 24.76 | 247* 
Pretest (B) AN LP | 05 «1 
Diagnostic (C) 15.52 1 15.52 | 1.55* 
AXB 57.53 | 1| 5753 | 575% 
AXC 26.88 | 1 | 2688 | 2.68* 
BXC 1452 | 1 | 14.52 | 1.45* 
AXBXC 18 | 1 48 | «i 

Error 479.33 | 48 | 999 
"p> .10, 
> € 05, 


were unrelated to any of the above topics, and were 
merely used to give the subject an opportunity to 
settle down in the novel testing situation. Subjects 
from both patient groups were randomly assigned 
to each of the four cells in the 2 X 2 factorial design. 


Experimental Variables 


In the demand condition (A), the subject was 
informed that the experimenter expected his opinions 
to change during the course of his hospital stay and 
that he would be tested again later on to determine 
whether or not his opinion had Changed. During 
posttesting, the subject was again reminded of the 
experimenter's earlier expectation. In the no-demand 
condition (A;), the subject was told nothing of the 
experimenter's expectation either in the pretesting or 
posttesting sessions. 

In the commitment condition (Bi), the subject 
registered his opinions by answering each attitude 
statement aloud to the experimenter who, in full 
view of the subject, recorded his answers on a sheet 
of paper. In the no-commitment condition (Bs), the 
questions were presented and read silently by the 
subject, but answers were not given to the experi- 
menter. The subject was given the ruse that it was 
important for him to answer the questions silently 
to himself as part of the procedure of testing the 
efficiency of the GSR equipment. Stress was placed 
upon the experimenter’s interest in testing bodily 
functions and that the questions simply allowed 
the subject to “concentrate better.” After all of the 
questions had been presented, the GSR electrodes 
were removed and the subject returned to his room, 
The ensuing 4-6 weeks, during which the patient 
underwent physical and psychological (if necessary ) 
rehabilitative therapy, were conceived to be the 
persuasive situation. Part of a rehabilitation pro- 
gram is directed toward convincing the patient that 
rehabilitation will most likely be successful, and that 
the patient will benefit greatly from a positive turn 
of mind toward his hospital stay and toward the 
hospital personnel. This Provides a more natural 
persuasive context, as compared with the usual 
experimental setting. This, coupled with the sub. 
jects normal expectation of being tested within 
the hospital context, presumably reduced the sub- 
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jects suspicion of the experimenter’s i | the 
no-information condition. 

At the end of the 4- or 6-week period bject 
was retested with the identical question 1 in 


the pretest. All subjects gave their opini ally 


while the experimenter recorded them. 1 ject 
in the information treatment was remi: that 
the experimenter thought he might c his 
opinions. GSR measurement was again tal ing 
the posttest. 
RESULTS 

Scoring 

For each attitude item, change scor re 
calculated by subtracting the pretest score 
from the posttest. The absolute value the 


change score was the dependent measure. 
Pretest scores were estimated for the non- 
pretested subjects, from average scores of 
all comparable pretested subjects (cf. Lana, 
1969). 

GSR scores were determined for each ‘em 
by dividing the deviations from base ‘ine 
conductance during a question period by 
their deviations from base-line conduci: ce 
during a rest period (the period shortly after 
a question and just before the next, which 
was used to allow GSR to return to base-line 
conductance level). 


Attitude Change 


Analysis of variance indicated that there 
were no significant pretest differences in atti- 
tude. There were no differences in attitude 
change for the hospital-, self-, or outside- 
oriented questions. The three-way analysis of 
variance presented in Table 2 shows only an 
Information X Pretest interaction. 


TABLE 3 


SUMMARY TABLE FOR THREE-WAY ANALYSIS OF 
Variance or GSR CHANGE SCORES 


Source SS df MS F 
Information (A) 108.25 1 | 10825 | 3.30** 
Pretest (B) 128.47 1415128:47.|13,02*** 
Diagnostic (C) 134.91 1| 13401 | 4.12**** 
AXB 48.43 l| 4843 | 148* 
AXC 21.55 1| 21.55 <i 
BXC .01 $ 01 <i 
AXBXC 188.56 1 | 188.56 | 5.75**** 

Error 1572.81 | 48 | 32.77 
* 
£5 200 

wD z 06. 


HEH b < O5. 


is analysis of this interaction 
e information-pretest (A;B;) 
yn—pretest (A3B,) cells are 
ıt are significantly different 


y analysis of variance of the 
‘ich is presented in Table 3, 
ffects for pretesting ($ < .06) 
gory (cerebrovascular acci- 
. p < .05) and a weak effect 
| (p < .10). The triple-order 
s also significant. Means and 


| indicates n 
diagnostic 
‘dent-pulm 
- for inform 
interaction 


standard : tions of the treatment groups 
- are presen i Table 4. 
DISCUSSION 


la and 1b, predicting a signifi- 
‘ion X Pretesting interaction, 
supported. The information- 
ip (A1B1) changed the least, 
ced, but the no-information— 
oup changed the most, which 
ted. Apparently, a subject re- 
he opposite direction to the 
demand characteristic provided by the infor- 
mation gives only when he is pretested. The 
pretest apparently served to set the subject 
for possibie change when he was not told 
of the expectations of the experimenter. 


itself tends to inhibit change, but all subjects 
Lin those studies were college students. Con- 
-teivably, the heterogeneous subjects of this 
— study were not as aware of the implications 
"of taking a pretest (i.e. that they were likely 
_ to be posttested) as are college students, so 
| they responded largely to the explicit demand 
characteristics of the study, It seems clear 
- that these results suggest that a pretest is 
perceived very differently by different 
M Subject populations. 
Hypothesis 1c, predicting differences be- 
- tween the information-no-pretest condition 
— (ABs) and the no-information-pretest condi- 
tion (A;B;) was not supported. This result 
© Must be interpreted within the context of 
ypotheses 1a and 1b, and simply indicates 
t neither the obvious demand character- 
cs of the study or the commitment of the 
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TABLE 4 


TREATMENT MEANS AND STANDARD DEVIATIONS 
ror ATTITUDE AND GSR Cmawor Scores 


| | Bi—No pretest 


| Bi—Pretest 

Condition Fi 

Atti- - 
due | CSR | dude | CSR 
Ar— Information | C 
X | 3.29 | 820 | 525 | 13.09 
SD | 134 | 440 | 1,76 6.75 

A,—No information | 

ï 6.64 | 7.28 | 4.56 | 845 
Sp 521 | 413 | 327 | 8.10 


subject operate in isolation. 

Hypothesis 2 was supported. Since it was 
of an exploratory nature, stating merely that 
there would be differences among the various 
treatment conditions in GSR change scores, 
we may now describe and try to interpret 
the results. 

The information-no-information difference, 
although weak (p < .10), indicates that the 
expectations of the experimenter produced an 
increase in emotional reaction from pretest 
to posttest in the subject. This may indicate 
that the subject resented the pressure placed 
upon him by the expectations of the experi- 
menter. The main effect of pretesting indi- 
cates that nonpretested subjects changed 
more on GSR than pretested subjects. Since 
initial commitment was less for nonpretested 
subjects, there was less emotional reaction 
during the pretest than during the posttest, 
and hence there was greater change than in 
pretested subjects whose emotional reaction 
was the same during pretesting and post- 
testing. 

The main effect differences between pulmo- 
nary and cardiovascular accident patients are 
not surprising, since pulmonaries were out- 
patients and cardiovascular accident patients, 
inpatients. Also, the ailments are sufficiently 
different from one another to create different 
hospital experiences for both groups, and thus 
different sensitivities to the conditions of this 
experiment. 

The significant interaction poses a problem. 
No convincing explanation of a triple inter- 
action can be made, and we must simply say 
that it indicates possible nonlinear relation- 
ships among the various levels of the study 
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on GSR. Simple-effects analysis shows that 
the difference between cardiovascular accident 
and pulmonary cells is significant at all levels 
($ < 05). 


CONCLUSIONS 


The following conclusions summarize the 
results of this study: (a) The commitment 
and demand hypotheses can be combined to 
give more successful predictions of attitude 
change than either hypothesis alone, under 
conditions where information regarding the 
intent of the experimenter and pretesting 
interact; (^) GSR can be used successfully 
to discriminate areas of attitudinal sensitivity 
without the subject being directly aware of 
such an assessment as he may be when a 
pretest is used to assess initial attitude. 
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One of the clearest examples of behavior 
control is found in the police interrogation 
m, where a suspect can often be induced 
confess to a crime which he may or may 
‘not have committed. The magnitude of this 
| control becomes particularly impressive when 
We realize that the consequences of such 
jehavior are usually imprisonment or even 
"death. The manner in which confessions are 
obtained has important legal implications, 
since the presence of psychological coercion 
could influence their admissibility as trial evi- 
“dence. Zimbardo’s (1967) analysis of the vari- 
Ous psychological techniques used to elicit 
“confessions highlights the problems such meth- 
— ods pose for our system of justice. 

In spite of the importance of this phe- 
“nomenon, only one experimental study (Bem, 
1966) has focused on the cognitive process 
y that may be underlying it. Bem found that 
o under certain conditions, subjects come to 
believe in the false confessions that they 
f have made. According to his self-attribution 
- fheory, an individual bases his subsequent 
attitudes and beliefs on the behavior that he 
- has observed himself performing. Thus, if 
p sees himself making statements under 
circumstances previously associated with tell- 
ing the truth, he will believe that the state- 


lThis research was supported by Office of Naval 
rch Contract N000-14-67-A-0112-0041 to Philip 
Zimbardo, who also provided the laboratory 
ties and valuable advice. Thanks are extended 
to Robert Matschullat, John Weeks, Edward Oliver, 
i En Jan Wattles for their help. 

i Requests for reprints should be sent to the au- 
Ond is now at the Department of Psychology, 
E rsity of California, Berkeley, California 94720. 
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€ ist, the lie stimulus was shown to 
evidenced by better recall, greater confidence, 
different pattern of physiological responsivity. 


THE “TRUTH” ABOUT FALSE CONFESSIONS’ 


CHRISTINA MASLACH * 


Stanford University 


One of the few experimental studies to focus upon variables which might play 
a role in police interrogation is Bem's “false confession" 
that subjects come to believe that their false statements are true when emitted 
in the presence of a discriminative truth stimulus. In an attempted replication, 
resent study found evidence to support an alternative explanation of this 
gz, based upon decreased vigilance induced by the truth stimulus. In 
promote more cautious responding as 


research. He showed 


slower reaction time, and a 


ments are true, (even if they are, in fact, 
false). Both the practical and theoretical 
implications of this phenomenon make it 
deserving of further study. Because Bem’s 
results were based on a rather small sample, 
his experiment should first be replicated with 
a larger population, in order to establish the 
reliability of the phenomenon, and then be 
extended further. The addition of new vari- 
ables and dependent measures to Bem’s basic 
paradigm could perhaps elucidate the dy- 
namics of this hypothesized self-persuasion. 
The present study is an attempt to provide 
such a replication of Bem’s original findings 
and to begin to delineate the underlying 
process. 

In this experiment, each subject first per- 
formed a word task in which he was asked 
to cross out a specific set of words but not 
others. He was then trained to make true 
statements in the presence of one colored 
light (the “truth light”) and to make false 
statements in the presence of a second light 
(the “lie light"). Finally, he was required 
to make statements about words which he 
previously did or did not cross out. Half of 
these “confessions” were false and half were 
true, and each of them was made in the 
presence of one of the two lights. The sub- 
ject was then asked to recall whether he 
had or had not crossed out the word, and this 
recall was checked against his actual per- 
formance on the word task. Following from 
the pattern of Bem’s (1966) results, the pre- 
diction was that there would be an inter- 
action between the lights and the confessions. 
False confessions made in the presence of 
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the truth light should produce more recall 
errors than false confessions made when the 
lie light is on. On the other hand, true 
confessions emitted in the presence of the lie 
light should produce more recall errors than 
true confessions made in the presence of the 
truth light. 


METHOD * 
Subjects 


Fifty-five Stanford college students (36 males and 
19 females) were asked to participate in a study on 
lie detection, Halí were recruited from campus 
dormitories and were paid for their participation, 
while the others took part in the study to satisfy 
a requirement of their introductory psychology 
course. 


Procedure 


The experimental paradigm consisted of four 
successive phases: (a) Phase I—The subject engaged 
in some activity which could be later used as the 
content of his confessions; (b) Phase II—the experi- 
menter obtained information from the subject which 
would be used in the training procedure of the next 
phase; (c) Phase III—two discriminative stimuli 
were established for truth telling and lying; (d) 
Phase IV—these stimuli were paired with the 
confessions that the subject was required to make 
about his previous activity in Phase I. The major 
dependent measures were the subject's recall of what 
he actually did in Phase I, as well as his confidence 
in the accuracy of his recall. 

In the first phase of the experiment, the subject 
was given a list of common nouns and an alpha- 
betical guide which contained 50 oí these words. 
He was then asked to cross out each word on the 
lit that also appeared in the alphabetical guide. 
In the second part of the study, the subject com- 
pleted a 50-item self-information form, which asked 
such questions as “What is your favorite music?” 
and “What did you eat for dinner last night?” 

For the remainder of the study, the experimenter 
communicated with the subject (who was in a 
soundproof chamber) via an intercom, The follow- 
ing procedure was then used in Phase IIT to estab- 
lish two colored lights as the discriminative stimuii 
for truth telling and lying, The experimenter asked 
questions, one at a time, from the subject’s self- 
information form. After each question, one of two 
colored lights was illuminated in the subject’s room 
(according to the “lie detection” cover story, the 
lights were connected to “voice recording equip- 
ment”). If the red light came on, the subject had 
to answer with a truthful statement; if the white 


3A more detailed procedural description of this 
experiment, as well as the stimulus materials, can 


be found in the laboratory manual by Lane and Bem 
(1965). 


CHRISTINA MASLACH 


light came on, he had to give an u 
to the question (the colors were reverse 


iswer 


ilf. of 


the subjects). Half of the 50 question sired 
with the truth light and the remaind the 
lie light. 

The final phase of the study tested t t of 
this discrimination training on the su ecall 


performance, The subject was required 40 
statements about the words he did or d ross 
out in the Phase I word task. Halí oí ite- 


ments were truc ones, and halí wer For 


example, the statement “I did cross out ord 
DOG" was a true one, since the subject ! ally 
crossed out this word on the previou ask. 
In contrast, ^I did cross out the word APPI was 


an untrue statement because the subjec not 
crossed out this word earlier. While the s vas 
making each of these forced response 


fes- 


ur ted. 


sions, one of the two colored lights was 

There were thus four experimental condition 10 
words each. Two of the conditions pro ia 
conflict between the light and the confessio: alse 
confession-truth light and true confession-lic ght. 
In the other two conditions, there was no such 
conflict: false confession-lie light and true c fes- 
sion-truth light. This within-subjects design «/) wed 
each subject to serve as his own control iter 
making each confession, the subject indicated 
whether he thought he actually had or hac not 
crossed out the word. He also indicated his confi- 
dence in the accuracy of his recall on a 5-poin: scale 
(ranging from 5=“absolutely sure" to 1! -—— not 


sure at all"). At the end of the experiment, the «ub- 
ject was questioned about his reactions to the lichts 
and the confessions, and was then debriefed. 

Thus, the present study utilized a 2 X 2 design 
which each subject appeared in each of the 
cells. These four groups were an exact replicati 
of four of the five conditions in the Bem ( 
study. Bem's fifth condition (a control group in 
which there were no lights or confessions) was not 
included in the present experiment, since only the 
directional hypotheses were being tested. 


RESULTS * 


A preliminary analysis of the data showed 
that Bem's predicted interaction was not 
occurring. Rather, the subjects were more 
accurate in their recall under both of the 
lie-light conditions than under the truth- 
light conditions. This finding suggested that 
the lie light may have acquired the properties 
of a "vigilance" cue, signaling the subject to 


*Because of the within-subjects design of the 
present study, £ tests were used to analyze the data, 
rather than an analysis of variance. Difference scores 
were obtained for all subjects, and the mean dif- 
ference score between two conditions was tested 
against zero, All reported £ tests are two-tailed (as 
opposed to the one-tailed tests used by Bem). 


nd think carefully about his 
ie property could have re- 
training procedure in Phase 
fairly easy for the subject 
ı true answer to a question, 
harder and make a “double 
ler to lie. That is, he has to 
true answer and then think 
rom it. Such responses, which 
nplex reasoning, have been 
(1969) to be associated with 
: time. Therefore, to test this 
alternative n of vigilance, the subject's 
response ti: c., the interval between the 
confession e recall response) was mea- 
sured, the | ‘tion being that it would be 
longer in both lie-light conditions. 


response” i 
first think 
of one diffe 
involve mo 
shown by ( 
a longer r« 


Tn addit; he subject’s physiological re- 
Ssponsivity it rate and galvanic skin re- 
sponse—G‘ was recorded on an Offner 
Type R dy» <:aph. It was predicted that the 
greater vivilance induced by the lie light 
would be re‘iocted in a greater probability of 
GSR responses. Following from the Lacey, 
Kagan, Lac y, and Moss (1963) findings of 
heart deceleration with sustained attentive- 
ness, it was also predicted that heart rate 
would be lower for the lie-light conditions. 
The use of recording electrodes did not ap- 
pear to disrupt the experiment in any way; 


in fact, it actually enhanced its validity by 
Making the cover story of lie detection even 
More plausible. 

Overall, the data for 55 subjects strongly 
Support the vigilance hypothesis (main effect 
Of lights) and definitely do not confirm Bem’s 
theory of self-observation and persuasion 


TABLE 1 
Mean NUMBER or RECALL Errors FOR 10 TRIALS 


n 


Confession 
Stimulus light epa 
E True False 
Truth — i 
Present study 3.04 3.76 
Bem (1966) 2.36 3.82 
Lie | 
Present study 2.49 2.91 
Bem (1966) 3.82 1.82 
iN —— — 


N 


Mute —Lie versus truth light (¢ = 3,68, df = 54, p < .00D 


“versus false confession (£ = 1.92, df = 54.2 <-10)- 
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TABLE 2 
MEAN SCORES COMBINING ACCURACY AND CONFIDENCE 


Confession 
Stimulus light — — 
True | False 
Truth 6.90 6.45 
Lie 7.27 7.07 


I 


Note,—Range: 1-10. Lie versus truth light (f = 4.62, df = 
s, p s 001); true versus false confession (f = 1.82, df = $54, 
f <.10). 


(interaction of lights and confessions). As 
shown in Table 1, the subjects made fewer 
errors in the lie-light conditions than in the 
truth-light conditions. This comparison is sig- 
nificant at the .001 level in the false-confes- 
sion condition, and thus replicates Bem's 
original finding for these two groups. How- 
ever, the same comparison in the true- 
confession condition (p < .02) is opposite in 
direction to Bem’s result. The ¢ test of Bem’s 
predicted interaction (the mean difierence 
score of the conflict minus the nonconflict 
conditions tested against zero) did not ap- 
proach significance (¢ = .88). The subjects’ 
confidence ratings generally reflect their recall 
performance; confidence was higher for the 
lie-light than for the truth-light conditions 
(t = 2.10, df = 54, p < .05). 

A better index of the subjects’ recall per- 
formance combines recall accuracy and confi- 
dence into a single score. An error that is 
made with a confidence rating of 5 is a 
“bigger” error than one made with a confi- 
dence rating of 1, for example, and should 
be weighted differently. Accordingly, each of 
the subjects’ responses was given a score 
from 1 to 10, where 1 = error with a confi- 
dence rating of 5, 2 = error with a confidence 
rating of 4, . . . 6 = correct response with 
a confidence rating of 1, . . . 10 = correct 
response with a confidence rating of 5. 
Table 2 presents the means of these combined 
scores for each of the four conditions, 

A more precise metric than combined scores 
is provided by the memory-operating curve 
(Atkinson, Bower, & Crothers, 1965). This 


5 There was no difference between the experimental 
results for males and females, except that the 
females were slightly more variable. Similarly, there 
were no differences between the responses of paid 
and volunteer subjects. 
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Curve takes into account not only the size 
(ie, weighting) of the response, but the 
type of correct or incorrect response that is 
made. In other words, there are two types 
of errors (saying "yes" when the true answer 
is "no," or “no” when the true answer is 
"yes") and, similarly, two types of correct 
responses. To determine the shape of the 
memory-operating curve, a 10-point scale of 
the subjects’ possible responses, ranging from 
“no” with confidence of 5 to “yes” with con- 
fidence of 5, is used. The frequency of sub- 
jects’ responses at each of these scale posi- 
tions is recorded, thus forming a response 
frequency distribution. One such distribut'on 
is calculated for the test items which had 
à correct answer of “yes,” and another for 
the items which were correctly answered by 
“no.” These two distributions are converted 
to cumulative probability distributions which 
then form the axes of the memory-operating 
curve, Two such curves were plotted for the 
present study—one for the lie-light conditions 
and one for the truth-light conditions. 

The shape of the curves describes how well 
the subjects remembered what words they did 
or did not cross out in Phase I. If their 
recall had been no better than chance, the 
resulting curve would be the diagonal shown 
in Figure 1. That is, the subjects would be 
just as likely to give a particular response 
(e.g, “no” with confidence of 4) to a yes 
item as to a no item. However, if ther recall 


P (RATING OF "NO" ITEM 2 X) 


9 LIE LIGHT 


2 O—O TRUTH LIGHT 
CN:553 


o 2 4 6 8 LO 
P(RATING OF "YES" ITEM 2 X) 


Fic. 1, Memory-operating curve. 
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was absolutely perfect (i.e., they alw 
the correct answer with confidence 
curve would be a right angle along 


gave 
, the 


»per 
lefthand corner of the graph. Thus, etter 
the subjects! memory, the furt! away 
the curve is from the chance pe: ance 
diagonal and the closer it is to the rfect 
performance right angle. Figure 1 ents 
the memory-operating curves for the ‘sent 
study and shows graphically that ects 
had better memory in the lie-light c ions 
than in the truth-light conditions. The mean 
difference between the lie-light and the ‘ruth- 
light curves is significantly different fr zero 
(t = 4.67, df = 54, p < .001).5 

On the measures of response timc and 
physiological responsivity, the mean c ffer- 
ences were in the predicted direction bu: fell 
short of significance. Thus, the lie-light condi- 
tions showed a slightly longer response ‘ime, 


lower heart rate, and larger number of CSRs. 
In the truth-light conditions, longer reaction 
times were associated with less accurate re- 
call performance (r = —.212, df= 154, p< 
.008),* as though the subjects were taking a 
long time to think only on those items where 
they did not know the answer. In contrast, 
the subjects in the lie-light conditions took 
more time for all items, regardless of how 
accurate they were in their recall, Further- 
more, longer reaction times in the lie-light 
conditions were correlated with decreases in 
heart rate (r = —.264, df = 188, p < .001), 
thus providing some support for the prediction 
of heart deceleration with sustained attentive- 
ness. For truth-light items, however, longer 
reaction times were associated with increases 
in heart rate (r = .200, df = 188, p < .006), 
which would suggest that the subjects in 
these conditions were more anxious or upset 


5 The statistical analysis used for comparing these 
two curves was suggested by Gordon H. Bower. 
Each subjects response frequencies were multiplied 
by their corresponding confidence levels (ranging 
from 5 to 1 for correct answers and from —5 to —1 
for incorrect ones) and then added tozether to yield 
a total score for the curve. The difference between 
the scores for the lie-light curve and the truth-light 
curve was then tested against zero. 

* The degrees of freedom for all reported correla- 
tions are based upon 20 items for each of the 
subjects for whom there is complete data on reaction 
time, heart rate, and GSR. 


ot know the answer and had 
o try to think of it. In addi- 
tude was highly correlated 
for truth-light conditions 
159, p < 001). This was sig- 
05) less true of items with 
- 168, df = 160, p < .05). 


n they di 


DISCUSSION 


of this study suggest that the 
lie light causes the subjects 
ind to be more cautious, with 
t they make fewer errors and 
ident in their recall accuracy. 
| evidence from the measures 
e and physiological respons:v- 
impressive, since this is a 
: design; the people who re- 
in one way to the lie light are the same 
who show a different pattern for truth- 
rhe lie light appears to be a 
eralized state of vigilance and 
ss, and thus has a different cogni- 
significance for the subject than the 
light. Further support for this hypothe- 
‘comes from the subjects who voluntarily 
exorted that they became more anxious and 
tive when the lie light was present, but 
axed and did not try so hard when the 
friendly” truth light came on. 
An analyss of each subject's pattern of 
ponding reveals that the majority (5176) 
of the subjects showed the vigilance main 
, while 20% of them showed Bem's pre- 
dicted interaction. Most of the remaining 
b'ects showed response patterns that were 
‘reverse of these two hypotheses. E*ghteen 
cent had interaction patterns that were 
posite of Bem's predicted interact'on, 
y only 7% showed a reversed main effect 
that was contrary to the vigilance hypothesis. 
The remaining 4% of the subjects showed 
ho effect of the discriminative light training 
On their recall. If we assume that each of 
patterns has an equal probability of 
rring, then such results are significantly 
erent from chance (x? = 42.64, df=4, 
<.001). This finding not only supports 
e vigilance interpretation, but suggests that 
me uncontrolled situational variable might 
Ve been responsible for the different set of 
sults in Bem's original study. Perhaps the 
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critical difference was that the lights served 
primarily as motivating stimuli in the present 
study and as discriminative stimuli in Bem’s 
experiment. If subjects were not highly moti- 
vated to do their best and be accurate, then 
the lie light would be a cue to be attentive 
and work harder. This general motivating 
property of the lie light might then over- 
shadow any response to it as an informa- 
tional cue. However, if the subjects were 
motivated by experimental instructions to be 
highly accurate and attentive for the entire 
recall test, then the lights would not fulfill 
any motivating function and would serve only 
as a discriminative cue for truths and lies. 
It is possble that in the latter situat'on 
Bem's self-persuasion effect would emerge 
(D. Bem, personal communication, August 4, 
1970). 

Although Bem speaks oí inducing belief in 
false confessions, this phenomenon is not 
actually demonstrated by the results of either 
his original study or the present one. If a 
person had actually come to believe h's false 
statements, his confidence in the accuracy of 
his recall should be about as high as the 
confidence in his true statements. However, 
the experimental findings show that confi- 
dence ratings drop as the subject makes more 
errors. In other words, he is, with good 
reason, becoming more unsure of his recall 
accuracy. Such confusion and lack of confi- 
dence may set the stage for belief in false 
confessions (by making the person more vul- 
nerable to subsequent interrogation tactics), 
but they are definitely not evidence of the 
belief itself. Making a suspect confused and 
unsure of himself is the goal of some poce 
interrogation techniques (Zimbardo, 1967), 
and its effectiveness in obtaining a confess'on 
can perhaps best be illustrated by an actual 
case. In New Jersey, a young man named 
William Doss was convicted of first-degree 
murder, in spite of contradictory evidence, 
solely on the basis of an alleged confession 
(unsigned) that he made during pol'ce ques- 
tioning. When asked about this confess'on, 
Doss said, “They were asking so many ques- 
tions, I don't know when they started or 
when they stopped or anything. Evervthing 
is so mixed up, T don't know what I told 
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them [Asbury Park Evening Press, November 
12, 1966].” 

Future research could better establish the 
possible link between confusion and suscepti- 
bility to attitude change. The findings of the 
present study suggest that subjects would be 
most likely to change their attitudes after 
being exposed to the  truth-light-false- 
confession condition, but would be most re- 
sistant to attitude change techniques in the 
lie-light-true-confession condition. In con- 
trast, Bem’s model and experimental findings 
would predict that both of these conflict con- 
ditions would be equally susceptible to atti- 
tude change techniques, and more so than 
either of the nonconflict groups. 

It is somewhat difficult to extrapolate from 
the two present studies to the real world, 
because the experimental situation was a 
rather artificial and uninvolving one. Further 
experiments on false confessions should use 
different paradigms which build in a greater 
degree of mundane realism, and in which the 
act of confession has greater consequences 
for the subject. While the false confession 
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of an experimental subject does no 
than lower his test score, the false 
of the suspect in the squad room 
case of William Doss and peri 
others, can result in the loss of hi 
or life. 
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; OF TEST ANXIETY ON ACQUISITION, RETENTION, 
ND GENERALIZATION OF A COMPLEX VERBAL 
TASK IN A CLASSROOM SITUATION* 
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zh- and low-test-anxious subjects learned material drawn from Hullian 
irning theory. Half of the subjects were given an acquisition test trial after 
h of the five acquistion study trials, and the other half were administered 
acquisition test trial only after five acquisition study trials had been com- 
ted. Half of the subjects in each of these test trial conditions were asked 
report rehearsals and readings on each information item. The giving of 
í-reports did not significantly affect performance in acquisition, nor was 


While test anxiety is recognized to be a 
major probiem of college students, few sys- 
tematic s: have been conducted to ex- 
amine the cect of anxiety on learning in a 
Classroom situation. Rather, work has been 
performed he laboratory which is at best 
only a poor analogue to the actual academic 
setting, both in terms of situations involved 
and materis!s utilized. If research findings 


- from the isboratory are to be utilized in 
Work with college students, then it behooves 
the researcher to extend his work to more 
‘Classroom-like situations. 

Katahn and Branham (1968) developed a 
task which consisted of material similar to 
that found in psychology textbooks. These 
authors presented difficult material drawn 
- from Hull’s postulate system to high- and 

low-anxious subjects. The subjects were tested 
x by two sets of questions: one (acquisition) 

Was worded exactly as the original informa- 
tion items; another (generalization) was 
Worded differently but required the same re- 
Sense. Katahn and Branham found that low- 
anxious subjects tended to be superior on 

*The current study was presented by the first 


author to the Department of Psychology, Vanderbilt 


ony, as partial fulfillment of the requirements 
A AM master's degree, 

Requests for reprints should be sent to Martin 
tahn, Department of Psychology, Vanderbilt Uni- 


T Versity, Nashville, Tennessee 37203. 


re a significant Test Anxiety X Report interaction. There were significant 
‘ferences in performance due to test anxiety for the group tested after each 
juisition study trial but not for the group tested after five study trials. 
\trolling for original learning, there were significant differences due to test 
nxiety on both the retention and generalization tests. 


the acquisition task, and were significantly 
superior on the generalization task. 

The present study modifies the Katahn and 
Branham design by performing the task in an 
actual classroom situation and by the adding 
of a retention condition. The study reported 
here is based upon the hypothesis that task- 
irrelevant responses associated with test anx- 
iety interfere with the acquisition of a cor- 
rect response. Not only have high-anxious 
subjects generally been inferior in perform- 
ance to low-anxious subjects, but they have 
also been characterized by more rumination, 
self-deprecation, and negative affect (Feather, 
1965; Mandler & Watson, 1966; Neale & 
Katahn, 1968; Snyder & Katahn, 1970b). 
The work of Katahn, Stenger, and Cherry 
(1966), in which test-anxious college stu- 
dents reported an inability to organize while 
studying, and a lack of rehearsal while read- 
ing assignments, also lent support to this 
hypothesis. 

Expanding upon Cattell and Scheier’s 
(1961) theoretical distinction between state 
and trait anxiety, Spielberger, Gorsuch, and 
Lushene (1969) developed the State-Trait 
Anxiety Inventory (STAI). State anxiety is 
conceptualized as a temporary and transitory 
emotional condition, and in the STAT, state 
anxiety is operationalized by instructing the 
subject to respond to 20 questionnaire items 
with his feelings “right now,” Trait anxiety 
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is conceptualized as a fairly stable predispo- 
sition to react with anxiety responses in a 
variety of situations, and in the STAI, trait 
anxiety is operationalized by instructing the 
subject to respond to the 20 questionnaire 
items with his feelings “generally.” In their 
test manual, Spielberger et al. (1969) noted 
that other instructions are possible in addi- 
tion to the state and trait instructions. Fol- 
lowing this suggestion, Snyder and Katahn 
(1970a) modified the STAI instructions in 
order to estimate test anxiety before a class- 
room test. Under these instructions, the sub- 
ject imagines that he is about to take a 
classroom examination, and responds to the 
20 STAI questionnaire items accordingly. 
This modified version of the STAI has been 
used to estimate test anxiety in several studies 
in our laboratory (Snyder & Katahn, 1970a, 
1970b; Snyder & Ray, 1971). This modi- 
fied version of the STAI is best conceptual- 
ized as a trait disposition to react with anx- 
iety in the test-taking situation. Using the 
same task employed in the current study, 
Snyder and Katahn (1970b) have demon- 
strated that this trait disposition to react 
with test anxiety is accurately reflected by 
ongoing state anxiety in the test-taking situa- 
tion. In this study, then, the term “anxiety” 
refers to the trait of test anxiety as measured 
by the modified version of the STAI. 

This investigation also sought to determine, 
through self-reported rehearsals and readings 
during acquisition trials, the relationship of 
reading and rehearsal to performance on this 
task in particular, and to academic achieve- 
ment in general. Overall, then, the major 
purposes of this study were to examine the 
effects of test anxiety on acquisition, reten- 
tion, and generalization of complex verbal 
material in an actual classroom situation, and 
to examine the relationships between self- 
reported rehearsals and readings of the in- 
formation items, to performance on the task 
and academic achievement, 


METHOD 


Subjects 


As part of course requirements, 122 males enrolled 
in introductory psychology at Vanderbilt University 
served as subjects. Subjects were divided into high-, 
medium-, and low-test-anxiety groups according to 
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their scores on a modified version of t STAI 
(Spielberger et al, 1969), designed to esti feel- 
ings before a classroom test. The usual 20 s oÍ 
the STAI were utilized (ex, “I feel calm,” `I feel 
worried,” “I feel relaxed”), but the instruct were 
changed so that the subject was told to r 1 to 
the items in terms of his feeling as he “w bout 
to take an examination.” Using this modifi TAI, 
high anxiety was defined as the upper 25th | tile 


(n—29, X —93.14, SD = 10.76), and low 
the lower 25th percentile (n = 30, X = 54 
7.88), of all males enrolled in introductory 
ogy (m=292). The medium-range anxiety ects 
were used only in the correlations in c: to 
represent all anxiety levels. 


Materials 2 
4 è 
The learning task consisted of a set of 16 in{orma- 
tion items based on Hullian learning theory and two 
sets of 14 questions. One example of an item is as 
follows: “The reaction threshold (S*R) is the lowest 
value of the reaction potential that will. ally 


produce a response.” For testing in acquisitio: and 
retention, the words “reaction threshold” were not 
included and had to be filled in by the subject. In 

ization L the question was as follows: 
educed to zero, any response 


occur.” In the above example, the subject had to 


booklet 
form. 


Conditions 


Half of the subjects during acquisition were tested 
after each of five acquisition study trials, and the 
other half were tested once after all five study trials 
were presented. Both groups were tested for reten- 
tion and generalization after 48 hours. Within cach 
group, those tested five times and those tested once, 
half were asked during acquisition to check cach 
time that they read an information statement, and 
to check each time that they rehearsed an informa- 
tion item. The other half in each group was not 
asked about reading and rehearsal. All subjects were 
naive, in that none of the information included 
in the task had been dealt with in introductory — 
psychology. 


Procedure 


The task was administered during an actual class 
period, in which half of the class met in one lecture 
hall, and half in another. The retention and generali- 
zation tests were administered during class 2 days 
later in the regular lecture hall. During acquisition, 
each subject was given a booklet which consisted of 
the material for five acquisition trials. Instructions 
were printed on the front of the booklets, and the 


test items for the acquisition tests were counter- 
balanced. 


A general instructions was given before 
the administration of the task in which the subjects 


were told by t nstructor that this was a study 
dealing with : rning of psychological material. 
Subjects wer jd that the information they 
were to lear: information which would be 
presented in r in the semester and that it 
was to their adventage to learn the material at this 
time, since it t of the course work. In addi- 
tion, the subje ere told that the booklets were 
different in f presentation and instructions 
and they wer >) be alarmed if the person next 
to them appeared to be doing a different task. The 
booklets wer: passed out, and the subjects were 
asked to read structions on the front of their 
booklets. 

For the gro hat reported rehearsals and read- 
ings, the inst: ns were as follows: 

You are to learn the following information as 
you would ı examination in psychology. You 
will be teste this information. The informa- 
tion has bec ken up into sentences. Each time 
you read a sentence, you are to place a check 
in the space vided on the right-hand side of 
the page. If vou read any sentence more than 
once, or come back to that sentence for a reread- 


would place a check for each time 
sentence. Each time that you 
, that is, practice the sentence 
the sentence, you are to place a 
it of the sentence. You will have 
only a limite? amount of time in which to learn 
the material, «c try to work as quickly as you can. 
DO NOT TURN THE PAGE UNTIL YOU ARE 
TOLD TO DO SO. YOU MAY NOT RETURN 
TO A PAGE ONCE TIME HAS BEEN CALLED. 


The instructions for the groups that did not report 
readings and rehearsals were as follows: 


You are to learn the following information as 
You would for an examination in psychology. You 
will be tested on this information. The informa- 
tion has been broken up into sentences, You will 
have only a limited amount of time in which to 
learn the material, so try to work as quickly as 
you can, DO NOT TURN THE PAGE UNTIL 
YOU ARE TOLD TO DO SO. YOU MAY NOT 
RETURN TO A PAGE ONCE TIME HAS 
BEEN CALLED. 


, The high-test-anxious and low-test-anxious sub- 
Jects were divided into four groups, with Groups 1 
d ? meeting together, and Groups 3 and 4 meeting 
gether in another lecture hall. These groups were 
as follows: 
Nate 1: Self-report, interspersed acquisition test 
ten elects in this condition studied the informa- 
Teh ‘or 24 minutes. They were asked to record 
Casals and readings, At the end of 24 minutes, 
the. Were told to turn the page and to complete 
m panion test trial for which they were given 
fon utes: In all this group completed five acquisi- 
n study-test trials, 
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Group 2: No self-report, interspersed test trials. 
Subjects in this condition learned the material with- 
out indicating whether they had rehearsed or read 
an item. There were no blanks on the information 
page for that purpose. They were tested after each 
study trial as was Group 1. Both Groups 1 and 2 
performed the task simultaneously with different 
instructions. 

Group 3: Self-report, no interspersed test trials. 
Subjects in this condition learned the material but 
did not receive an acquisition test trial until they 
had completed five study trials. Like Group 1, they 
were asked to indicate rehearsals and readings, 

Group 4: No self-report, no interspersed test trials. 
Subjects in this condition learned the material for 
five study trials, but were neither asked to report 
rehearsals and readings nor were they tested after 
each study trial, Groups 3 and 4 performed the task 
simultaneously with different instructions. 

All four groups were brought together 48 hours 
later for retention and generalization tests during the 
regular class period, The instructor told the class 
that this was a “pop quiz” to determine how much 
of the information they had learned. The retention 
and generalization tests were administered together 
in the pop quiz booklet. Five minutes were allotted 
for the completion of both tests. 


RESULTS 
Acquisition 
A three-way analysis of variance was per- 
formed, considering test anxiety, report, and 
test trials as between-subject variables. In 


order to keep the amount of time each sub- 
ject was exposed to the material constant 


10 


ob Aà Test Trial After Each Study Trial 
O——0o Single Test Trial At Completion 


Of 5 Study Trials 
o 


Low High 
TEST ANXIETY 


Fic. 1. Mean number of information items correct 
for high- and low-anxiety groups under interspersed 
and single-test-trial conditions, 


NUMBER OF INFORMATION CORRECT 
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(124 minutes), the dependent variable for 
this analysis was the number of items correct 
on Trial 3 for the group that had five inter- 
spersed test trials, and the number correct 
on the single aquisition test trial for the 
one-test-trial groups. 

The only variable in acquisition involving 
test anxiety to reach statistical significance 
was the Test Anxiety X Test Trials interac- 
tion (F = 4.37, p < .05). As may be noted 
in Figure 1, the interaction appears to have 
resulted from differences between test-anxiety 
groups in the five-test-trial condition (8.50 
versus 5.93; t= 2.59, p < 02). High- and 
low-anxious groups in the single-test-trial con- 
dition had very similar scores (3.93 versus 
4.00, ns). 

There was a significant main effect due to 
test trials (F = 25.07, p < .001). The per- 
formance of the five-test-trial groups on 
Trial 3 was superior to that of the one-test- 
trial groups on the single acquisition test trial 
with the amount of exposure to the material 
being equal (X — 7.23 versus 3.66). For the 
test-trials groups, test anxiety correlated 
—.23 with performance on Trial 3, while for 
the no-test-trials groups, test anxiety cor- 
related .15 with performance on the single 
acquisition test trial. The performance of the 
groups that were asked to report rehearsals 
and readings during the acquisition study 
trials did not differ significantly from those 
groups that did not report (X — 5.79 versus 
5.26). 


Retention and Generalization 


In order to determine whether effects of 
the test anxiety interfered with retention and 
generalization over and above that observed 
in acquisition, two analyses of covariance 
were performed. Considering the subject's 
score on the last trial in acquisition as the 
covariate and the retention and generaliza- 
tion scores as the variates, significant main 
effects for test anxiety were obtained (m 
3.91, p< .04; F= 10.26, p< 01, respec- 
tively). For the test-trials group, test anxiety 
correlated —.23 with retention and —.18 with 
generalization performance, while for the no- 
test-trials group, test anxiety correlated —.10 
with retention and —,23 with generalization 
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TABLE 1 
Mean NUMBER CORRECT AND STANDARD Í ATIONS 
FOR HiGH- AND Low-TEsT-ANXIOUS SUF 5 ON 


THE ACQUISITION, RETENTION, ^ 
GENERALIZATION TESTS 


Condition | Acquisition | Retention Y 

Low anxiety 

X 7.87 8.13 73 

SD 4.67 444 62 
High anxiety 

2 6.69 6.10 04 

SD | 3.99 3.99 03 

| 

Note.—Acquisition data from last test trial test- 
trials group and the single test trial for the -trials 
group. 
performance. On both the retention a» the 
generalization tests, the low-anxious subjects 


(X = 7.63 and 6.62, respectively) were supe 
rior to the high-anxious subjects (X 
and 4.36, respectively), with the effects of 
acquisition statistically controlled. Table 1 


presents the unadjusted means and sis «dard 
deviations of the number correct for high- 
and low-anxious subjects on the acq: “ition, 


retention, and generalization tests. 

The above results suggest that the effects 
of test anxiety on generalization were greater 
than those on retention. In order to 
determine whether this did occur, another 
covariance analysis was performed, using 
generalization scores and controlling for 
retention. There was a significant anxiety 
main effect, with low-anxious subjects (X = 
6.09) being superior to the high-anxious 
subjects (X = 4.70; F = 6.70, p < 04)? 


Effects of Reading and Rehearsal Reports 


The self-report information was divided 
into three scores. The rehearsal score was 
composed of the number of times a subject 
reported that he rehearsed the material. The 
read score was composed of the number of 
information items that a subject read once 
on any given acquisition trial. The reread 
Score was the number of readings aíter the 
first on any trial. 

The first part of Table 2 presents correla- - 
tions between the rehearsal, read, and reread 


* These are adjusted treatment means with the 
effect of the covariance on the variate statistically 
removed by a covariance analysis, 
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TABLE 2 
CORRELATIGNS OF NUMBER CORRECT BETWEEN PERFORMANCE AND 
EXPERIMENTAL VARIABLES AND CRITERION SCORES 
— : : 
; Ec. Acquisiti t | 
1 E E : (ast trial). | Retention | Generalization piece 
Rehears | | 
5 test T) .40* | 26 IX su | 10.6 
1 test r) 17 —.16 | .03 | 16.1 
Read | | 
5 test T) 24 42 44 DEC: 
1 test t rT) 31 .39* | 67** 51.2 
Reread | | : ve 
5 test FT) 37* A | 38* 11.2 
(dtest TT) 42 .02 | 06 144 
Semester point average* | 
5 test ti rT) 43** .36** 40" 1.64 
1 test i NTT) | 38** 30" ao 1.71 
Introduc hology | 
Examin: t res | 
5 test trials (TT) Ale Ei ,96** 167.8 
1 test NTT) .40** 36** 45** 169.5 
Note.—T1 t 1 o test trials. 
PER z i 
**5 <01 


scores for t 
anxiety, and | 
generalizatio: 
conditions. T 
readings, anc 


ubjects across all levels of 
acquisition, retention, and 
ores according to test-trial 
mean number of rehearsals, 
readings for each subject in 
the test trial: and no-test-trials conditions is 
also presente? in Table 2. In the test-trials 
condition, the correlations between rehearsals 
and reread and acquisition, retention, 
and generalization scores were stronger, 
Whereas in the no-test-trials condition the 
Correlations between readings and acquisition, 
retention, and generalization scores were 
Stronger, Similarly, from Table 3 it may be 
noted that in every case the correlations 


TABLE 3 


Co TION 
MESE eEONG BETWEEN NUMBER OF REHEARSALS, 
FADINGS, AND REREADINGS FOR TEST-TRIAL 
(TT) anp No-Test TRIAL (NTT) 


=e CONDITIONS p 
" ati Si m 
Source score and condition poet pow 

average average 

Rehearsal 

i test trials (TT) 27 .29 
Rene trial (NTT) 10 05 
5 test trials (TT) 13 16 
trial (NTT —. 
trading (NTT) | 09 07 
€st trials (TT) ET 29 
Hs trial (NTT) a [ 


between rehearsals, readings, and rereadings, 
and both semester and cumulative grade point 
averages for the self-report groups were higher 
in the test-trials condition than in the no- 
test-trials condition. 


Relationship of Task Variables to External 
Criteria of Learning 


To ascertain whether some of the same 
variables accounted for performance on the 
experimental and classroom work, the acqui- 
sition, retention, and generalization scores 
were correlated with the semester grade point 
average and total correct on all examinations 
in the psychology class in which the experi- 
ment was performed.* Product-moment cor- 
relations were computed separately for the 
test-trials and no-test-trials groups because 
of the significant differences obtained for this 
variable. For both the test-trials and no-test- 
trials groups, performance on the experi- 
mental tasks related significantly to actual 
classroom performance. Correlations for the 
test-trials groups were stronger than those for 
the no-test-trials groups. These correlations 
used subjects across all test anxiety levels 
and are presented in the second part of 

4Semester grades were computed to two decimal 
places on a 3-point scale; the psychology grade con- 
sisted of four quizzes and a final examination (300 
multiple-choice question total), 
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Table 2. Furthermore, correlations were per- 
formed in order to ascertain the relationship 
between the test anxiety measure employed 
in the current study and semester grade point 
averages and introductory psychology scores. 
For the test-trials group, the corre'ations with 
subjects from all test-anxiety groups were as 
follows: test anxiety and semester grade point 
average, r= .06, ms; test anxiety and intro- 
ductory psychology scores, r = .03, ns. For 
the no-test-trials group, the correlations with 
subjects from all test anxiety groups were as 
follows: test anxiety and semester grade point 
average, r= .11, ms; test anxiety and intro- 
ductory psychology scores, r = .15, ns. There- 
fore, the test-anxiety measure was not sig- 
nificantly related to either of the two external 
criteria (semester grade point average and 
introductory psychology scores). 


Discussion 
Acquisition 


The Anxiety x Test Trial Conditions inter- 
action suggests a rationale which may ac- 
count for differences in results found in pre- 
vious studies. Most previous studies using 
paired-associate material, or problem-solving 
tasks, administered the materials in such a 
manner that the subjects were continually 
tested during acquisition, and thus there was 
a continuous flow of feedback to a subject 
concerning his performance. In the present 
study, groups that obtained feedback by 
means of interspersed test trials differed ac- 
cording to anxiety level. Low- and high- 
anxious subjects in the groups without inter- 
spersed test trials did not differ significantly 
in their acquisition performance. Thus, con- 
tinuous testing in acquisition may be the 
variable which elicits the task-irrelevant re- 
Sponses suggested by Mandler and Sarason 
(1952). That is, little or no test anxiety may 
be elicited in high-anxious subjects until they 
become aware of how they are to be tested 
and which items they have failed to learn. 
This is always consistent with Atkinson and 
Litwin’s (1960) formulation that high test- 
anxiety scores represent a disposition to be- 
come anxious about failure under achieve- 
ment stress. Using the same experimental task 
and continual testing during acquisition em- 
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ployed in the present study, Snyder an 
Katahn (1970b) found that the 

sition to react with test anxiety (a 

by the modified STAI) predicted 


testing (for the test-trials group:) evoked 
differential levels of ongoing state anxiety in 
persons who differed in the trait of ‘ost anx- 
iety. Therefore, the differential els of 
ongoing state anxiety, as predict y the 
trait of test anxiety measured by th ;dified 
STAI, were probably responsible in part for 
performance deficits in acquisition, vell as 
the retention and generalization períormance 
discussed below. 


Retention and Generalization 
main 
vriginal 
odds 
which 
of psy- 
aired- 


The findings of significant anxie 
effects in retention, with level o! 
learning and acquisition controlled, is : 
with earlier studies in this laborator: 
used meaningful material (definitions 
chological terms) presented in a 
associate format (Pagano, 1968; Pagano & 
Katahn, 1967). Both of these previous studies 
support Underwood’s (1964) contention 
retention effects can be accounted for 
marily by level of original learning." 
present study, however, does replicate the 
findings of Branham and Katahn (1969), in 
which the identical Hullian concept-formation 


task was used. Branham and Katahn found à 
significant anxiety main effect in retention 
controlling for level of acquisition, The con- 
sistency of Branham and Katahn's results 
with the present study suggests that some 
factor exists in this task which produces an 
anxiety effect in the recall of the material. 

Additional support for the hypothesis that 
anxiety can affect not only the learning but 


5Both Pagano and Katahn (1967) and Pagano 
(1968) used Underwood's (1964) single-entry pro- 
jection technique to control for original learning: 
while covariance was the technique used in the cur- 
rent study. Since the present results were consistent 
with those of Katahn and Branham, who used the 
Underwood technique, and since Pagano comnare 
the two techniques and concluded that the Under- 
wood technique did not offer any added precision. 
the use of covariance in the present study does not 
appear to be a critical factor. 


id 

t dispo- 

asured 

lifferential 

levels of ongoing state anxiety. Although on- 
going state anxiety was not meas in the 
current study, it is likely that the inuous 


| 


iso the re »f complex verbal material 
demons: by the analysis of covari- 
hee perform. : the generalization scores. 
significant main effect for anxiety was 
ined both when original learning in 


acquisition, | the level of retention, 
e control] two separate analyses of 
Generalizatio ores. 
The abo sults suggest that anxiety 
ak sparate and independent contribu- 
ınce in three conditions: 
i learning of the complex 
(b) the verbatim recall of that 
in the utilization of that 
ralized test questions. Theo- 
pears to be completely con- 
landler and Sarason’s (1952) 
ormulations. That is, when 
z occurred in acquisition, the 
low-anxious subjects was 
creater extent than that of 
subjects. When test-like feed- 
present in acquisition, there 
Were no significant anxiety differences. Simi- 
larly, Mandler and Sarason’s formulations 
for the retention and generaliza- 
which were presented as a 
"pop quiz. Anxiety appeared to make an inde- 
Pendent contribution to both measures to an 
extent which cannot be explained by the level 
of original learning. 

There are at least three interrelated reasons 
for the nonsignificant correlations obtained 
between the test-anxiety measure employed 
in the current study and the semester grade 
"Point averages and introductory psychology 
test scores. First, Vanderbilt students have 
Probably compensated for their high test anx- 
dety by means of superior study habits; 
‘Second, high aptitude tends to obscure anx- 
lety effects in situations which permit advance 
- Preparation and overlearning of the material 
- (Katahn, 1966); and, third, Vanderbilt stu- 
- dents typically know in advance the nature 
of the material, the setting, and the extended, 
- reasonably flexible time limits for their class- 


En analysis of variance with the SAT scores 

E B high- and low-anxious groups demonstrated 

i B ennt main eífects for anxiety. This rules 
— subj the rival hypothesis that the low-anxious 

5 Cid superior performance was a function of 
elligence alone. 


material in 
Tetically, thi 
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room tests, which in itselí may be anxiety 
reducing. In the experimental setting, how- 
ever, higher test anxiety may be achieved 
because the students are unfamiliar with the 
test materials, time limits, and setting, and 
thus could prevent complete learning. Over- 
all, then, the high aptitude, compensating 
study habits, and familiarity with the class- 
room test-taking situation militate against the 
debilitating test-anxiety effects in the stan- 
dard approach to the classroom testing of 
Vanderbilt students. 


Selj-Report Data 

Higher correlations were found between 
rehearsals and rereadings, and acquisition, 
retention, and generalization scores in the 
test-trials condition than in the no-test-trials 
condition. On the other hand, there were 
higher correlations between the number of 
readings and acquisition, retention, and gen- 
eralization scores for the no-test-trials sub- 
jects. A possib!e rationale for this is that 
subjects in the test-trials condition, knowing 
the nature of the test questions, would devote 
most of their time to the rehearsing and re- 
reading of unlearned items. It would not be 
advantageous for this group to read each 
information item on every study trial. The 
no-test-trials condition did not offer any feed- 
back as to the nature of the test items, and 
thus the best strategy was to read as many 
items as possible on each study trial. This 
reasoning would lead one to expect higher 
correlations of the read score with acquisition, 
retention, and generalization scores for the 
no-test-trials condition, and higher correla- 
tions between rehearsal and rereading scores, 
and acquisition, retention, and generalization 
scores in the test-trials condition. 


Relationship of This Task to Academic 
Achievement 

It has been argued elsewhere (Katahn & 
Branham, 1968; Pagano & Katahn, 1967) 
that an important prerequisite for generaliz- 
ing inferences from an experimental task to 
other learning situations is that performance 
in the experimental situation should have 
reasonable correlations with performance in 
the area of generalization. This prerequisite 
is supported in the present study by the 
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findings that similar correlations have been 
obtained between academic achievement and 
scholastic aptitude tests (Branham & Katahn, 
1969). 

It is interesting to note from the second 
part of Table 2 that the relationship between 
performance on the present task and academic 
achievement is strongest when test trials 
are included in acquisition. Similarly, from 
Table 3, it may be noted that for the groups 
which reported rehearsals, readings, and re- 
readings, there were higher correlations be- 
tween these measures and academic per- 
formance when test trials were included in 
acquisition. That is, subjects who have higher 
grade point averages report more readings, 
rereadings, and rehearsals when test trials are 
included in acquisition. This suggests that 
test trials elicit some of the same motivational 
factors and, perhaps, some of the same 
learning skills as are required in academic 
performance. 

The above results, combined with the find- 
ing that the inclusion of certain self-report 
measures does not affect performance, suggest 
two potentially profitable lines for future 
research examining the processes and variables 
involved in the learning and generalization of 
complex material. The first would be to de- 
velop additional self-report measures which 
do not affect performance, and the second 
would be to apply these both within the 
laboratory and outside, for example, in 
academic learning situations. 


‘Correlations ranged from 35 to 45, 
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ause of another’s behavior 

: himself (internal locus) or 
ces in which he is behaving 
as consequences for impres- 
vhich have been carefully 
cider (1958). In their extension 
:nd Davis (1965) also empha- 

le^ which Kelley (1967) has shown 

to be coordinated with the internal-external 
dimension. The major theme in the Jones and 
Davis analysis is that the stronger the external 
Pressures impinging upon an actor to behave 
ina given way, the less information his be- 
havior provides for an observer. The extent 
to which an actors behavior is seen as "free" 
from external pressures, such as high social 
desirability, appropriate role demands, or a 
high prior probability of enactment, the more 
Information the observer would have about 
the actor’s true disposition, Bem’s (1967) 
analysis of dissonance effects extends this view 
to the impressions an actor forms of himself. 
The actor is said to use his own behavior for 
inferring his attitudes in the same way an 
Outside observer would make such inferences. 
The freer the actor feels from such external 
Constraints as high reward or forced com- 
Dliance, the more his behavior represents his 
true dispositions. This view seems particularly 
Cogent when internal instigations, events 


Ascribing 
either to the 
to the circu: 
(external lox 
sion formatio 
described by 
of Heider, Jor 
sized variable 
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FFECTS OF BEING KIND OR HARSH TO 
ANOTHER ON LIKING? 


JOHN SCHOPLER ? anp JOHN S. COMPERE 
University of North Carolina 


pothesis that being kind to someone, in comparison to being harsh, would 
the self-perception of greater liking for the target of kindness was tested. 
four male subjects, serving in the role of an experimenter, administered a 
ne task to two accomplices. Through random designation, each subject acted 
toward one accomplice and harshly toward the other. Subjects then rated 
‘traction for each accomplice. The hypothesis was confirmed. Additional 
res suggest that the effect was not mediated by distorting actual task success 
ith the evaluative direction of the behavior or by the ease of enacting the 
ts. Order of presentation showed no significant effects, but one of the ac- 
ces was significantly better liked than the other. 


known only to the actor, for performing the 
behavior are weak. 

The present research was undertaken to test 
the hypothesis that an actor would use his 
own behavior toward another as the basis for 
inferring his attraction toward that other. It 
was specifically hypothesized that an individual 
would like another person more if he thought 
that he were being kind to him than if he 
thought he were being harsh to him. The 
test of the hypothesis occurred in a setting 
where internal causes for the behavior were 
held constant at a moderate level and were not 
manipulated. 

The general hypothesis has received some 
support from three previous studies which 
were derived from the dissonance view of 
counterattitudinal advocacy. Davis and Jones 
(1960) have shown that subjects induced to 
read very derogatory statements to a target 
person will change their ratings toward less 
attraction if they have had high choice and 
there is no possibility of future interaction. 
Glass (1964) demonstrated that under condi- 
tions of high choice and high self-esteem, sub- 
jects who were opposed to the use of electric 
shock in experiments, but who nonetheless ad- 
ministered shock to another subject, devalued 
the shocked person. Finally, Jecker and Landy 
(1969) induced subjects to return part of their 
experimental winnings to a mildly unattrac- 
tive experimenter or to the psychology 
secretary. Although the manipulation of 
amounts returned showed no effect, subjects 
who returned winnings to the experimenter 
liked him better. As the authors noted, the 
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meaning of this result is not clear because the 
experimenter made the request only in the 
condition where winnings were returned to 
him. This may have made him seem more 
human, thus his attractiveness. 
While these studies might be viewed in terms 
of self-attributions, they clearly lend only 
qualified support to our hypothesis. In addi- 
tion, their tie to dissonance theory required 
inducing subjects to behave in ways that were 
maximally discrepant from their private atti- 
tudes, for example, administration of shock by 
individuals opposed to the use of shock or 
doing a favor for a mildly unattractive experi- 
menter. Our theoretical view, on the other 
hand, permitted the use of a more moderate 
set of behaviors which were likely to be repre- 
sentative of those contained in any subject's 
repertoire. 
METHOD 


Subjects 


The specific procedure can best be understood first 
describing the general format. eterna des 
odes 3s experimenters for a learning task in which two 

subjects, actually accomplices, were to be given 
either kind or harsh verbal feedback for their perfor. 
mance on a cube-tappi 


pany the experimenter to the experimental room. It was 
that one of the peop! 
needed to perform a different task. It was pointed out 

“signed up" on a different sheet 
than the one used for the other two "subjects" because 
his task would. require less time. In addition, the im- 
portance of minimizing the likelihood that he would 
know either of the other two was emphasized. (None 
of the subjects knew the accomplices.) The subject 
was then told that the experimenter was interested in 
studying the effects of positive and negative verbal 
reinforcement when the source of the reinforcement was 
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à peer. He was, therefore, being asked 
to each of the other “subject 
them, he was to give positive feedba: 
compliment him when he gets a trial cor 
other one, he was to give negative feed! 
ing when he gets a trial wrong.” In ord 
sense of choice for the subjects, as we 
out anyone for whom the task would for 
aversive or unnatural, the subject was : 
he would agree to serve in this role. Or 
declined to continue with the experiment 
The subject was given an instruction s} 
the task. He was to tap out patterns on a 
blocks which the other "subject" was t 
mecessary material was in front of the 
table. It consisted of four 3-inch wooden cu i 
red, which were attached at equal intervals «cross the 
top surface of a piece of wood, which was 14 inches 


long and painted black. Two 8-inch sticks for tapping 
out patterns were placed on the table. The instruction 
sheet also contained the patterns for two pr trials 


and a final set of 10 patterns which constituted the 
learning task. The final patterns contained seven-digit 
sequences and were therefore fairly dificult 


After the subject had mastered the inst: ns, it 
was again indicated that one of the accompli: = would 
be assigned to the positive condition and to the 
negative condition at random. He was asked 1o be as 
convincing as possible in complimenting correct re- 


sponses in the kind condition and in criticiziny mistakes 
in the harsh condition. Some possible phrase: “or each 


condition were suggested, but he was t^! to do 
“anything which comes natural to you which will make 
him feel positive (negative) about this experience.” 
He was told that this was of special importance because 


the experimenter was predicting that “a negative ex 
perience with these blocks will tend to lower self- 
confidence and that a positive experience wil! tend to 
raise it” on a personality questionnaire to be given to the 
accomplices immediately following the reinforcement 
session. The tenor of the instructions was intended to 
induce the subject to view his behavior as helping or 
hurting the accomplices. To eliminate the possibility 
that the subject would plan to subsequently “take 
back" the behavior, he was told that he would have no 
opportunity for further interaction with the accomplices 
following the task because the experiment would not 
be completed for 3 weeks, At that time, the accomplices 
would return and be assessed for any lasting effects of 
the reinforcement experience on self-confidence. 

The subject was then given a chance to practice the 
task. After the first accomplice was brought into the 
room, the experimenter observed through a one-way 
mirror, along with the accomplice not being tested- 
This permitted two observers to make ratings of how 
convincing the subject was in his performance. 


Independent Variatles 


The first variable was type of reinforcement. The 
opportunities for being kind or harsh were equated 
use each accomplice memorized a prearrani 
Sequence of responses in which 5 of the 10 responses 


TABLE 1 
ACH ACCOMPLICE IN Eacu Re 
NDITION IN Eacu ORDER 


h order Harsh-kind order 
-—— | 
Harsh | Kind Harsh 
reinforce- | reinforce- | reinforce- 
ment | ment | ment 
73.0 | 752 | 633 
79.2 | 80.8 83.0 


der of the reinforcement was a 
Í the subjects gave kind rein- 
half gave it second. The third 
lices; each accomplice was alter- 
inforcement and order condition. 
hese factors produced a 2 X 2 X 2 
Lindquist (1953) Type IV design. 
re paid for participation in the 
ed to remain blind to the experi- 
nii! after the study was completed.* 
d to remain neutral to the sub- 
little as would be natural in the 


tes 


each subject had finished administering the 
he second time, the experimenter reentered the 
: subject to complete a rating scale 
in the order he had seen them. 
the subjects rate both accomplices 
ture might have served to weaken 
. It was deemed important enough, 
risk in order to present the ratings 
as a side issue of interest to the experimenter. 
Tating sheet contained 15 bipolar items. When 
ject had completed both ratings, the experi- 
» who had presumably taken the accomplices 
next task, gave each subject a questionnaire 
hing 3 items. These items asked the subject to 
how convincing he felt he had been for each rein- 
ent condition and to rate how much freedom he 
in choosing to administer the task. Each subject 
also requested to estimate the number of correct 
attained by each accomplice. An additional 
included for the last 13 subjects which asked 
to rate how much interest each accomplice had 
experiment. 


RESULTS 


ch response to one of the 15 bipolar items 
cored from 0 to 8. Higher scores indicate 
attraction. The scores for the 15 items 
‘Summed into a rating score for each 


ment for the accomplices was primarily pro- 
by a grant from the Smith Fund to the second 
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TABLE 2 
. SuMMARY TABLE FOR ANALYSIS OF VARIANCK 
! 
Source 4f l MS r 
Between Ss E | 
Order | 1 | 3% | 220 
Reinforcement X | 
Accomplice 1U4 498 59 
Reinforcement X | | 
Accomplice X Order 1 | 21 1.16 
Error (between) m 172.6 | 
Within Ss n 
Reinforcement | | 705 in 
Accomplice 1 [120 |845% 
Reinforcement X ] 
Accomplice X Orde: i % r1 
lice du y 
Error (within) | 20 1412 
Total «4 | 
* p «05. e ory sue FB Md 
“pco 


accomplice. Table 1 presents the means for 
these scores for each accomplice in each rein- 
forcement condition, in each of the orders used. 
The summary table for the analysis of variance 
is displayed in Table 2. 

The analysis revealed a significant main 
effect for reinforcement (F = 4.99, df = 1/20, 
p < 05)* The mean rating for the kind re- 
inforcement condition was 82.3, compared to 
a mean of 74.4 for the harsh condition. This 
difference is in the predicted direction. Another 
way of viewing these results is to note that of 
the 24 subjects, 17 gave higher ratings to the 
accomplice under kind reinforcement than to 
the one under harsh reinforcement (p < .07).° 

The other significant effect revealed by the 
analysis was a main effect for accomplices 
(F = 845, df = 1/20, p < 01). The mean 
rating for Accomplice 1 was 73.5, compared to a 
mean rating of 83.5 for Accomplice 2. This was 
obviously not predicted. The greater attrac- 
tiveness of Accomplice 2 seems to be related 
to the incidences of reversals in the comparison 
of each subject’s two ratings. Of the seven 
reversals noted above, five occurred when 
Accomplice 1 was given kind reinforcement 
and Accomplice 2, harsh. 


* All probability levels reported are based on two- 
tailed tests of significance. 

*'The exact probability of 17 or more subjects, out 
of 24, responding in the predicted direction, for Hy = .5, 
is .032 
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Accomplice 2 


Accomplice 1 


Positive Negative 


Fic. 1. Mean attractiveness ratings for each accomplice 
in each reinforcement condition, 


The reinforcement main effect and accomp- 
lice main effect are shown in Figure 1. It can 
be seen that the reinforcement effect appeared 
to work with both accomplices, although it 
tended to be considerably weaker for Accomp- 
lice 2 than for Accomplice 1. The difference 
was hot sufficient, however, to produce a 
significant interaction effect. Neither the main 
effect for order nor any of the interactions 
were significant. 

The postexperimental question asking how 
convincing the subjects felt they had been in 
administering each type of reinforcement was 
not related to any of the conditions. The 
average effectiveness ratings showed no signifi- 
cant differences for either type of reinforce- 
ment or accomplice. The observers’ ratings 
of the subjects’ effectiveness, unfortunately, 
showed too little interrater agreement to 
warrant any analysis. Finally, the assessment 
of whether the reinforcement effects might be 
mediated by a distortion of the positively rein- 
forced accomplice’s performance was not sup- 
ported. The mean estimates of number of 
correct trials for kind and harsh reinforcement 

were 5.1 and 4.8, respectively. These estimates 
approximated the actual number of correct 
trials and were not significantly different, 


DISCUSSION 


The major hypothesis of this stuc is con 
firmed. How one behaves toward a r will 
affect how much attraction will be fi toward 
that other. The subjects’ behavior: ied to 
provide the information cues th ected 
their feelings of attraction. This + was 
not mediated by distorting actual ta ccess 
in line with the evaluative directi f the 
behavior or by the ease of enacting kind 
acts. 

How one behaves toward anothe: yt the 
only determinant of liking, as was s! own by 
the main effect for accomplices. The differ- 
ential liking for the accomplices may be .imilar 
to a person's attitude prior to engaging in new 
behaviors, While disagreements exis about . 
the extent to which a subject’s prior «‘titude 
is salient in any particular experimen’ | con- 
text, (see, e.g, Bem & McConne! 1970; 
Jones, Linder, Kiesler, Zanna, & Breh . (968), 
it is clear that the effect of any new b« :avior 
will be lessened to the extent that ths prior 
relevant attitude is salient to the acto: if one 
assumes the higher attraction rats of 


Accomplice 2, in the present study, «re in- 


dicative of greater salience of the subjects’ 
prior evaluative attitudes, then it would follow 
that smaller effects would obtain, because the 
information provided by the behavior ^ad to 


counteract stronger initial impressions f med 
of Accomplice 2. It was because the experi- 
menter noticed apparent differences in the 
accomplices’ behavior in the waiting room, on 
à dimension akin to interest in the experiment, 
that a rating of this dimension was added for 
the last 13 subjects. Although there is no way 
to demonstrate conclusively that this initial 
interaction was the basis for the different 
attractiveness ratings, it is in keeping with such 
a view that 12 of the 13 subjects rated Accomp- 
lice 2 as having more interest. 

Because of the nature of the design, it is not 
possible to specify whether the effect depends 
upon kind behaviors enhancing attractiveness 
or harsh behaviors lowering attractiveness: 
We were, however, able to demonstrate differ- 
ences between the two. Although this demon- 
Stration was clear, the next step is to test the 
effects of each type of behavior against a con- 
trol condition, 
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TRIVIAL INCENTIVES, MARGINAL COMPREHENSION 


AND DUBIOUS GENERALIZATIONS FROM 
PRISONER'S DILEMMA STUDIES! 


ROBERT E. KNOX? ann RONALD L. DOUGLAS 


University of British Columbia 


Several Prisoner's Dilemma (PD) studies have now shown that incentive magnitude 
has little effect upon average cooperation levels. This finding has prompted some 
investigators to conclude that the typical use of trivial payoffs does not jeopardize 
the meaning of previous PD findings. The present experiment was undertaken to 
examine the tenability of this conclusion. Eighty male undergraduates served as 
subjects in a 10-trial PD game. Two levels of incentive, pennies and dollars, were 
crossed with two levels of instruction, traditional and comprehensive. Average 
cooperation levels did not differ across the four experimental conditions, High 
incentives, however, produced significantly greater interdyad variance than low 
incentives, The difference in variance between instruction conditions was not 
significant. Reliability coefficients for the task and measures of response latency 
Suggest that the increase in variance for high incentives was true variance, rather 
than error variance. Results suggest that generalization from PD studies using 


trivial incentives should be limited to behavior in parlor games. 


In the past decade, the Prisoner’s Dilemma 
(PD) and similar games have been widely 
used in the study of social relationships which 
involve conflicting interests or goals between 
two persons. A dominant characteristic of this 
research is the modest level of cooperative re- 
sponding displayed by subjects. Because the 
response combination traditionally labeled 
“mutual cooperation” can be viewed, ostens- 
ibly at least, as the reasonable solution to an 
iterated dilemma, these findings are of con- 
siderable interest. In this regard, both Gallo 
and McClintock (1965) and Messé and Sawyer 
(1966) have suggested that insufficient incen- 
tives and inadequate understanding of the 
implications of their choices may account for 
the self-defeating behavior of subjects in the 
PD game. These authors maintain that the 
small payoffs at stake in most PD studies 
may have no real value for the subjects, and, in 
fact, may even stimulate them to invent a new 
game, in which the object is to maximize the 
difference between own and other's payoffs. 

1 This study was supported by grants from the Com- 
mittee on Research and the Institute of Industrial 
Relations at the University of British Columbia. A 
paper based upon a portion of the results of this study 


was presented at the Western Psychological Association 
convention, San Diego, March 1968, 

? Requests for reprints should be sent to Robert E. 
Knox, Department of Psychology, University of 
British Columbia, Vancouver 8, British Columbia, 
Canada. 
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In addition, it is not certain that a subject 
in the PD game understands just how his pay- 
off depends upon his behavior and the behavior 
of the other person in the game. It is quite 
possible that the nature of this interdependence 
relationship is never fully grasped by some 
subjects, and may be understood by others 
only after actually playing a few trials of the 
game. In the latter instance, by the time a 
subject learns the implications of his choices, 
he may well be locked into a mutually competi- 
tive response pattern from which escape is- 
difficult (cf. Gallo & McClintock, 1965, p. 76). 
On the basis of such considerations, these 
authors contend that a level of cooperative 
responding higher than that traditionally ob- 
served in PD studies might be anticipated as 
a consequence of (a) increasing the size of 
monetary incentives and (b) providing instruc 
tions which facilitate a better understanding 
of the outcome contingencies. Assuming that 
conflict behavior can be meaningfully assessed. 
only when the subject cares about the possible 
outcomes of his behavior and understands the 
role which that behavior plays in determining | 
those outcomes, a finding of this nature, then, 
would raise serious questions concerning the — 
generalizability of previous PD findings. For 
this reason, the issues of incentive magnitude | 
and subject comprehension have been the 
focus of a fair amount of research interest. 


^ —9 
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In a numt f PD studies, subject compre- 
hension ha assessed using postsession 
questionnai interviews. Although Solo- 
mon (1960 ated that almost all of his 
subjects dí trated adequate comprehen- 
sion of the mechanics, Komorita (1965), 
Gallo (196¢ Kee, Knox, and Coughlan 
(1967) repo »nsiderable misunderstanding 
on the par the subjects concerning the 
implication cir choices. The results of two 
studies in v experimenters attempted to 
manipulate ct comprehension as an inde- 
pendent vai provide some equivocal sup- 
port for the thesis that cooperation will be 
facilitated nore extensive instructions. 
Using three |cvels of instructions, differing in 
the type of ;rmation and amount of pre- 
game pract given subjects, Wrightsman, 
Lucker, Bruininks, and O'Connor (1968) ob- 
served dif tial effects upon level of co- 
operation, but intermediate instructions, not 
the most extensive instructions, produced the 
most cooperation. Radlow (1965) reported 
more coopersiive responding with extensive 
as opposed to briefer instructions. However, 
his extensive instructions come close to sug- 
gesting a cooperative solution to the subjects. 

Incentiv gnitude has also been manipu- 
lated as an independent variable in several 
studies, Some of the findings appear to be con- 


sistent with the hypothesis that large monetary 
incentives and other meaningful payoffs will 
increase cooperation in game situations. For 
example, subjects who played a version of the 
Deutsch and Krauss (1960, 1962) trucking 
game for real money were observed to be con- 
siderably more cooperative than those sub- 
jects who played for imaginary money (Gallo, 
1966); and small but significant increments in 
Cooperative responding have been reported be- 
tween conditions which paid ié and 2¢ for 
€ach matrix point in a maximizing difference 
1961) (McClintock & McNeel, 1966a, 1966b, 

However, several investigators have drawn 
attention to certain structural aspects of the 
trucking and maximizing difference games 
which suggest that large incentives should in- 
Crease the salience of both own-gain and/or 
Joint-gain motives (Gumpert, Deutsch, & 
Epstein, 1969; McClintock & McNeel, 1966b; 
Oskamp & Kleinke, 1970). Since either one or 
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both of these motives can be served by the 
cooperative response, increasing the incentive 
magnitude can be expected to increase coopera- 
tion in these two games, By contrast, the 
motive structure of the PD is such that a co- 
operative response serves the joint maximizing 
motive, but not necessarily the motive to 
maximize own gain. 

To the extent that the motive to maximize 
own gain is made more salient by large in- 
centives, then, one would expect variations in 
incentive magnitude to affect behavior in the 
PD game in a manner different from behavior 
in the trucking and maximizing difference 
games. Specifically, large incentives should 
lead to greater cooperation in the latter games, 
but not necessarily in the PD game. With the 
exception of a report of small but significant 
increments in cooperative responding between 
imaginary- and real-money conditions (Rad- 
low, Weidner, & Hurst, 1968), researchers 
employing the PD and similar games have 
observed no facilitatory effects with large in- 
centives. Here, we refer to reports of no differ- 
ences in the average level of cooperative re- 
sponding between subjects who played a 
non-zero-sum game closely resembling the PD 
for imaginary money and those who played for 
real money (Willis & Joseph, 1959; Wrights- 
man, 1966); between subjects who played a 
PD game for imaginary money and those who 
played for highly valued examination points 
(Evans, 1964); and between subjects who 
played a PD game for pennies and those who 
played for dimes (Oskamp & Kleinke, 1970). 

Perhaps the most convincing evidence 
against the hypothesis that large monetary 
incentives will produce increased cooperation 
in a PD game comes from a recent study by 
Gumpert et al. (1969). These authors expressed 
concern about the validity of the commonly 
held assumption that subjects take the game 
seriously when confronted with low or imagi- 
nary monetary incentives. Specifically, they 
noted that previous PD research based upon 
this assumption would be seriously damaged if 
it were shown that low levels of cooperative re- 
sponding were merely an artifact of trivial pay- 
offs. In order to test this possibility, Gumpert 
et al. systematically varied incentive magni- 
tude in a 20-trial PD game. Not only did they 
fail to obtain differences in cooperative re- 
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sponding with widely differing incentive magni- 
tudes (pennies, nickles, dimes, and dollars), but 
they reported that their "real-dollars" condi- 
tion yielded significantly less cooperation than 
an “imaginary-dollars” condition. On the basis 
of these results, Gumpert et al. concluded that 
markedly increasing the payoff magnitude in 
a PD game does not lead to an increase in co- 
operation, and they expressed reassurance 
about the meaning of previous PD research 
utilizing small payoffs. Somewhat comparable 
results prompted Oskamp and Kleinke (1970) 
to issue a similar expression of confidence in 
the results of PD studies with low or imaginary 
rewards. 

These reassurances are unconvincing, how- 
ever, insofar as the authors’ conclusions are 
apparently based solely upon comparison 
among average levels of cooperation for the 
different payoff conditions. It is well known 
that some independent variables, while having 
no significant effect on central tendency, have 
large and psychologically meaningful effects 
upon variance. For example, stress has been 
found to influence the intersubject variance 
of certain performance scores, but not their 
means (Lazarus & Baker, 1956; Lazarus & 
Eriksen, 1952). Similarly, nontrivial payoffs 
may affect the intersubject variability in co- 
operation, but produce no demonstrable effect 
on the group averages. This possibility, about 
which other investigators have not expressed 
concern, also has potential implications for the 
generalizability of findings from the glut of PD 
studies that have utilized low or imaginary 
incentives. 

The present PD study was conducted to 
investigate the effects of two widely varying 
payoffs upon average cooperation levels and 
upon intersubject variability. In our high-pay- 
off condition, it was possible for a single pair 
of subjects to leave the laboratory after 40 
minutes with a $60 profit. Whatever meaning- 
ful effect these incentives were to have, how- 
ever, should be contingent upon subject com- 
prehension of the dilemma. Concern that PD 
instructions are frequently inadequate led to 
the inclusion of instruction as an independent 
variable with two levels, one clearly more 
comprehensive than traditional instructions, 
In addition to cooperative and competitive be- 
havior, the effects of the incentive and instruc- 
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tion variables upon response latency e also 
observed. 
METHOD 

Subjects were 80 male undergraduate v rs Te 
cruited from large introductory psychology . The 
experiment consisted of 10 consecutive tria he PD 
game, using the payoff matrix displayed i ire L 
A simple 2 X 2 layout incorporated 2 lev: incen- 
tives (low and high) and 2 levels of i -tions 
(traditional and comprehensive). In the lo ntive 
condition, numbers in the payoff matrix : nted 
pennies, and in the high-incentive condit: lars. 
The traditional instructions were adapted ír codel, 
Minas, Ratoosh, and Lipetz (1959) and Ra; t and 
Chammah (1965) to approximate the mod: vel of 
instructions employed in PD studies. Comp: -'.ensive 
instructions were our own creation, and in a series of 
pilot trials produced demonstrably superior compre- 
hension. These instructions were longer and more de- 
tailed than the traditional ones, and they strc--cd the 
importance of complete understanding by enco raging 
subjects to consider possible rationales for eac!: choice. 
Subjects in the comprehensive instruction c«-dition 
also completed an eight-item pregame quest -^naire. 
This questionnaire required them to review the payoff 
structure (e.g, “What is the most money (x other 
person could possibly earn in this situation? If he were 
to earn that much, how much would you ean '') and 
to anticipate the logic and social consequences c! alter- 
native strategies (e.g., “Assume that the other person 
understands the instructions, and he thinks that you 


are going to choose red. He then chooses red. What 
do you think his reasons might have been for choosing 
this way?”). 

Subjects assigned to the same experimental condition 
were scheduled in pairs and were seated on either side 
of an opaque screen. No communication was permitted 
between them, Each subject was provided a control 
panel which displayed the payoff matrix (Figure 1) and 
also the means for making his response, The “row” 
subject responded by moving a toggle switch up toward 
a yellow light (top row) or down toward a red light 
(bottom row). The “column” subject responded by 
moving his toggle switch left toward the yellow light 
(left column) or right toward the red light (right 


SUBJECT 2 


SUBJECT ] 


Fic. 1. Payoff matrix employed for all 
conditions in the study, 


' subjects had responded, an elec 
d lights within the cell deter- 
choices. 

r the traditional or the compre- 
hich described the PD matrix 
llar or penny payoffs. Following 
before commencing actual play, 
ditions were given a demonstra- 
during which the experimenter 
ice contingency and answered 
ay, the subjects with comprehen- 
rapleted the eight-item question- 
he subjects in the high-incentive 

-ed that they would, indeed, be 

nings. 
iè experimenter, play commenced 
0 trials. These trials were inter- 

ough to provide immediate cash 
nstruct the subjects to return the 
:els to the neutral position before 

as openly recorded on videotape 

s latencies were subsequently meas- 

+ elapsed between the return of the 
position after any Trial N and the 
m Trial N 4-1 constituted the 
latency. Finally, all subjects com- 


lined by the s 
Subjects rea 
Pensive instruc: 
lin terms of c 


instructi 
fire. In add 


At a signal 
and continue 


pleted a post imental questionnaire designed to 
assess task cor hension. 
RESULTS 
Over the 10 trial game, any pair of subjects 
might have made up to 20 cooperative choices 
- between them. However, as is customary in 


_ PD studies, verage number of cooperative 
responses fell far short of this maximum. The 
median number of cooperative responses was 
selected as the measure of central tendency 
most appropriate for these data because of the 
marked heterogeneity of variance across con- 
ditions and skewed distributions within con- 
‘ditions, The medians for all conditions are 
“Presented in Table 1. Median tests for main 
d effects confirmed what is apparent from inspec- 
tion: the difference between the two levels of 


TABLE 1 
Mepian NuMBER OF COOPERATIVE RESPONSES 
FOR Dyaps 


1 Instruction condition 


Incentive 
Condition a i e CE ER 
Traditional | Comprehensive Combined 

p—— 
Pennies 6.70 5.50 6.25 
pus 7.00 5.50 6.00 

ombined | 6.70 5.50 6.17 
—— = 
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TABLE 2 


INTERDYAD VARIANCE OF COOPERATIVE RESPONSES 


| 
| Instruction condition 
Incentive | 
condition | [T 
| Traditional | Comprehensive | Combined 
Pennies | 216 ua | 234 
Dollars 20.25 30.47 25.50 
Combined 12.53 22.85 17.89 


incentive and the difference between the two 
levels of instructions were not significant.* 

While neither incentive nor level of instruc- 
tion affected central tendency, differences in 
variance across conditions were striking. These 
variances, shown in Table 2, range from 2.16 
in the low-incentive-traditional-instruction 
condition to 30.47 in the high-incentive- 
comprehensive-instruction condition. Com- 
bined over the instruction conditions, the 
variance was 25.50 for the high incentive and 
7.34 for the low incentive. The F produced by 
the ratio of these two variances was significant 
at the .02 level (F= 3.47, df = 19/19). 
Combined over incentive, the variances for 
traditional and comprehensive instructions 
were 12.53 and 22.85, respectively. The F ratio 
for these two variances approached, but did 
not reach, conventional significance levels 
(F = 1,82, df = 19/19). 

In order to assess whether the observed in- 
crements in variance, particularly when in- 
centives were high, were due to greater true 
score variation or merely due to greater error 
variation in the distribution of cooperation 
scores, split-half reliability coefficients were 
computed across dyads in each incentive level 
and in each instruction level. True variance 
would contribute to higher coefficients and 
more reliable differences among dyads; error 
variance would not. Two split-half methods 
were employed: (1) the number of cooperative 
responses made by a dyad on the odd-num- 
bered trials was correlated with the number 
on the even-numbered trials; and (2) the 
number of cooperative responses made by a 
dyad on the first five trials was correlated with 
the number on the second five trials, These 


? An analysis of variance for the means also indicated 
no significant main effects, nor was the interaction 
significant. 
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(Pearson) coefficients are presented in Table 
3. In the high-incentive condition, reliability 
was respectably high (7 = .86 by Method 1 
and 81 by Method 2). However, when incen- 
tives were low, reliability was significantly 
lower (r= 51 by Method 1 and .16 by 
Method 2). Reliability for comprehensive in- 
structions (r = SO by Method 1 and .71 by 
Method 2) was slightly higher than for tradi- 
tional instructions (r « .72 by Method 1 and 
-50 by Method 2). Differences between instruc- 
tion conditions for coefficients calculated by 
the same method are not statistically 


significant. 

It is also interesting to contrast reliability 
estimates for the low-incentive-traditional- 
instruction condition, the group in this study 
that best approximates modal con- 
ditions in PD studies, with the high-incentive— 

i condition. These 
values are also shown in Table 3. It can be 
seen that low incentives together with tradi- 
tional instructions resulted in low, even nega- 
tive (!), reliability (r = — .06 by Method 1 
and — .35 by Method 2), while the high in- 
centive and comprehensive instructions pro- 
duce a reliable distribution of cooperation 
scores (r = .87 by Method 1 and .85 by 
Method 2). 

Average latencies over the last 
nine trials were obtained for each subject. The 
medians of these average response latencies 
are reported in Table 4. In the high-incentive 
condition, the subjects required, on the aver- 


TABLE 3 


Seur-Hawy RELIAmLITY COEFFICIENTS COMPUTED 
FOR COOPERATIVE RESPONSES Across Dyaps 

n a 

Odd trials | First 5 trials 
vs. 


va, 
second S trials 
(Method 2) 


even trials 
(Method 1) 


20 3 AG ** 
| 20 81 
Traditional 20 50 
Comprehensive 20 4n 


* Note.—Differences were evaluated by the method described 
in Hays, 1963, p. 532. Asterisks refer to 
difference between the two etic ta a Cd oe 
p «.05. 


p> <.01, 


p * 
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TABLE 4 
Meptax Larewcues (rw Secoxos) Avis Ovis 
run Last Nixe Trias ror Ixptrvipva: rcrs 
| Instruction condi: 
Incentive R2 jm ico 
condition 
puem Comprehensive ined 
Pennies 3 4.73 ső 
Dollars 6.39 8.73 2 
Combined 478 5.73 1 
age, 7.22 seconds to respond. In the ‘ow-in- 
centive condition, the subjects required n the 
average, only 4.56 seconds. A me test 
showed this difference to be significan’ at the 
.01 level (X* = 7.24, df = 1). Latency differ- 
were 
mificant (X 
inter- 
varia- 
iance, 
m of 
hat in vari- 
ability in cooperative responses accompanies 
increased latency across all four cells. Con- 
ditions that produced the shortest latency 
also produced the lowest variance; conditions 


that produced the longest latency (presumed 
to reflect greatest predecisional deliberation) 
produced the highest variance. 

The postexperimental questionnaires were 
evaluated under blind control conditions by @ 
person not previously associated with the 
study. This analysis revealed that while about 
15% did not seem to grasp the nature of the 
“dilemma,” nearly all subjects had sufficient 
understanding to cope meaningfully with the 
task. Differences in judged understanding be- 
tween groups having traditional or comprehen- 
sive instructions were in the predicted direc- 
tion, but were not significant. Failure of the 
instruction variable to produce statistically 
significant differences on the dependent mea- 
sures is consistent with the apparent weakness 
of this manipulation. 


DISCUSSION 


Our results, like the results of Gumpert et al. 
(1969), indicate that marked increases in pay- 
off magnitude in the PD game do not lead to 


© cooperation. However, the 
ces in variance obtained here 
concerns expressed by Gallo 
t the possible limitations of 
ure justified. Subjects do 
in the PD game when moti- 
ful incentives; some become 
and others less cooperative, 
nces among subjects have 
reliable. On the other hand, 
e low, subjects behave fairly 
be PD just as players do in 
lonopoly—they rather uni- 


f these findings, we see little 
accepting the PD in the 
igful incentives as an analogue 
erpersonal conflicts. The PD 
w major conflicts such as labor- 
gaining and international 
ms inappropriate—indeed ludi- 
tors employing this paradigm 
ives, particularly if precautions 
ensure understanding, should 
e imitations with respect to infer- 
€nces about individual differences and limit 
generalization to parlor games for which the 
PD may have some validity as an analogue. 
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ABILITY ATTRIBUTION AND IMPRESSION FORMAT! 


IN A STRATEGIC GAME: 
A LIMITING CASE OF THE PRIMACY EFFECT! 


ROBERT G. JONES? 


University of Virginia 


AND JAMES B. WELSH 


Harvard University 


Using closed-circuit television recordings of two students playing a game, an ex- 
periment was performed to test the hypothesis that ability attribution under con- 
ditions of strategic play would be governed by a recency effect. The hypothesis 
stands in contrast to most previous research which has obtained a primacy effect. 
In general, the hypothesis was confirmed, although an anomaly in the recalled trend 
of the game suggests that primacy may have been more involved than anticipated. 
An explanation of the results is offered in terms of a model of inverse probability 
affecting assimilation and contrast processes. In addition, it was found as predicted 


that the performer's style of play was favorably or unfavorably, interpreted as a 
Satan 


function of the apparent success of his behavior, 


Research in person perception has recently 
been stimulated by the theoretical examina- 
tions of Jones and Davis (1965) and Kelley 
(1967) of the processes and conditions under 
which attribution of enduring personal charac- 
teristics are made. While Jones and Davis 
focused on the conditions under which *'cor- 
respondent inferences” are made, that is, 
judgments of manifest behavior which are 
held to be more informative of enduring dis- 
positions and intentions, Kelley has pursued 
an analogy from the analysis of variance to 
propose a program by which attributions are 
established. Another significant examination 
of the process of person perception is given by 
deCharms (1968), who particularly emphasized 
the uniqueness of person attribution from ob- 
ject perception. Following the lead of Heider 
(1958), he stressed the essential inconstancy 
of the person over time which introduces a 
new order of psychological processes to be 
identified. 

At least in American culture, one of the 
critical dimensions of judgment about persons 


1 This research was supported by National Founda- 
tion Grant GS-1802 awarded to the first author. The 
authors are indebted to Joseph Curran and Carl 
Fredonis, who ably performed the roles of X and Y, 
and to Philip AuClaire and Nils Hovik, who helped 
considerably as experimental assistants. Martin 
Richter gave useful advice on certain aspects of the 
statistical analysis. 

2 Requests for reprints should be sent to Robert G. 
Jones, Department of Sociology and Anthropology, 
University of Virginia, Charlottesville, Virginia 22903, 
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is ability. In light of the pervasiveness of «bility 
attributions and the long-standing interest in 
social psychology in achievement motivation, 
it may be surprising that attempts social 
psychology to uncover the essential determi- 
nants of this sort of assessment of pecie are 
quite rare. Indeed, Jones, Rock, Shaver, 


Goethals, and Ward (1968) found that the 
matter has been “almost entirely ignored.” 
These investigators performed six separate 
experimental studies of ability attribution 
under different temporal patterns of success 
in solving intelligence-test-type items. In these 
studies, the subjects observed a person who 
either obtained an ascending, a descending, or 
a random pattern of success. In all cases, the 
person obtained 15 correct answers to 30 test 
items. Jones et al. repeatedly and somewhat 
unexpectedly found a strong primacy effect: 
the person with the descending pattern of suc- 
cess was seen as more able and more intelligent 
and as having obtained a higher score overall. 
The findings suggest, the authors stated, “that 
early information about ability is overweighted 
and leads to premature and persistent attribu- 
tions [Jones et al., 1968, p. 336].” This bad 
news for “late bloomers” supports the aphor- 
ism “First impressions are lasting impressions.” 

However, as Jones et al. suggested at the 
end of their article, the results do not establish 
whether this process of attribution is more 
intrinsic to the psychology of the judges or the 
conditions and nature of the task on which the 
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nts are made. The present study was 
d on the belief that the primacy effect 
reality a more limited effect, and in situa- 
more akin to the contingencies of every- 
y life, recency is the rule. Specifically, the 
es et al. study can be described as involving 
solution of a set of puzzles, while conditions 
everyday life almost invariably entail con- 
siderations of strategy. This distinction is de- 
ed from Moore and Anderson’s (1969) dis- 
"ussion of socialization processes, in which they 
gue that all known societies have relied upon 
nes as effective means of socialization, and 
different types of games enable the child 
o discover different things about his world 
about himself. Of their typology (games of 
ce, puzzles, games of strategy, and the 
ation of aesthetic objects), we consider 
es and games of strategy virtually ex- 
tive of the conditions under which judg- 
ents of ability in our society are made. 
The critical feature of zames of strategy that 
sels them apart from puzzles is the required 
Orientation of the actor to his task. Not only 
nust the player be able to retrieve relevant 
mation rapidly but he must utilize the 
mation that he derives from the play 
to plan his own behavior in light of what 
coming to know about the behavior of the 
ter, The useful deployment of feedback in his 
procal perspective on the play is the key 
the strategist’s success, which is to say 
it he learns and profits from experience with 
ongoing encounter. The puzzler either 
mows the answer, can assemble things he does 
mow to make an educated guess, or does not 
now the answer. The situation itself provides 
learning opportunities for the puzzler. 
= The present study endeavored to follow 
ghly the Jones et al. investigation, with the 
Shifted to a strategic task, and with the 
En poral pattern following a U-shaped, rather 
nana linear, trend. This variation was adopted 
Part because of convenience and in part 
ause it was hoped it could better illuminate 
* Processing of cues in ability attribution. 
otape recordings were made of two persons 
atl and Joe) playing a strategic game. The 
tn of success was varied so that in the 
mental conditions, one player (X) re- 
Y Outcomes which eventually led to an 
ding pattern of success, while the other 
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player (Y) eventually obtained decreasing 
success. The scoring was planned so that at 
the end of the tenth of 30 trials, X was 18 
points behind (or Y was 18 points ahead). At 
the end of the twentieth trial, both had main- 
tained their standing, so that X was still 18 
points down. By the end of the game, X gained 
18 points, so that the game actually came out 
even. 

The principal concerns of the study were, 
first, to determine whether X would be seen as 
the more able player, thus reflecting a recency 
effect in contrast to the findings of Jones et al.; 
and second, whether the eventual ascending 
and descending patterns would influence the 
evaluation of the personal characteristics of the 
player. In other words, would the pattern of 
success serve as a pivotal cue in the perception 
of the person so that under ascending success 
a trait would be evaluated more positively while 
evaluated more negatively under descending 
success ? 


METHOD 
Subjects 


Subjects in the experiment were 85 undergraduate 
men at Lehigh University, who volunteered from intro- 
ductory psychology classes. Two subjects in the Joe- 
as-X ascending condition and three subjects in the 
Carl-as-X ascending condition were eliminated from 
the data analysis either because they failed to identify 
the correct player as X, or because of extreme dis- 
crepancy in reporting the course of the game outcomes. 
Three subjects were eliminated for the former reason 
and two for the latter. Twenty subjects were thus 
represented in each condition. 


Procedure 


Subjects were run in groups of 5-8 until each condi- 
tion had 20 subjects. As each group arrived at the 
experimental room, they were asked by the experi- 
menter to have a seat and wait until all had arrived. 
When the group was complete, the experimenter briefly 
noted that he was going to play a videotape which 
could be seen on the monitor in front of them and that 
their instructions would be given in the recording. He 
then played one of four videotapes which had been 
prepared for the experiment. 

As each tape begins, an experimenter is seen drawing 
a game matrix on a blackboard. When finished, he 
stands back from it for a moment, checking it, and then 
pulls a large map down over it. He exits to the right 
and returns shortly with two men of college age who 
are motioned to sit behind a table in front of the black- 
board. The experimenter sits down across the table with 
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his back to the camera. Meanwhile, a voice has related 
the following instructions to the subjects. 


You are now looking at what is called a payoff 
matrix. The numbers in the matrix refer to the points 
that each of two players gets in a simple type of game. 
In a moment, you will see a study of the way people 
play a competitive game whose outcome is deter- 
mined by the matrix. You are looking at this study 
through a one-way mirror. The players in the game 
are not aware of this recording, although they later 
consented to our use of it for further research 
purposes. 

The investigator will describe the rules of play. 
Your task in watching this game is to determine the 
ability and the skill of the two players. This is a 
more difficult task than you are likely to expect. 
After the players play a game, you will be asked some 
questions pertaining to the way that they played. 
In watching, concentrate on how the two players do 
in the competition. See if you can perceive differences 
between them in terms of their success and their 
ability in playing the game. 


At this point, the sound track switched to the scene 
on the monitor where the experimenter was beginning 
his instructions to the two players before him. 


This is an experiment dealing with game theory. 
Your task is to play an elementary game which is 
designed to help us understand the way people play 
games. Before you is a sheet which has a 4 X 4 matrix 
drawn on it. You will note that it is a large square 
which has 16 boxes [both players look intently at a 
piece of paper in front of them]. Each box is identified 
by the row and column it lies in. 

This 4 X 4 square is what is called à payoff matrix 
in game theory. Each box actually contains two 
payoffs or scores: one for each person playing the 
game. I have not included the payoff values in the 
diagram which you have, and part of your task in 
playing the game will be to come to an understanding 
of the values in the matrix. I can tell you this much 
about the matrix: (1) the two values in each cell 
may or may not be the same for both of you; (2) 
the only values are — 1, 0, 1, or 4; (3) the values in 
the matrix will remain constant throughout the 
game; and (4) there may or may not be a general 
pattern by which values in the matrix have been 
assigned. 

The payoffs in the matrix are actually drawn on the 
board behind you, and therefore Iam going toask you 
not to turn around and look at it [the experimenter 
gets up, rolls up the map, and returns to his seat ]. 
When the game is over, of course, you can look at it. 

But if you saw it now, it wouldn't be much of a game. 

The game is played by both of you jointly identify- 
ing a box. To do this, we will alternate who goes first. 

and who goes second each time. There will be 30 
trials in a game. The object of the game is to win by 
the greatest number of points possible. That is, you 
should try to beat your opponent by as large a score 
as you can. 

We will proceed as follows. One of you will be 
Player "X" and the other will be Player “Y” [ 
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Fic, 1. Payoff matrix used in a! 
experimental conditions. 


have arbitrarily decided to make you X [the e i 
menter points to the player on his left], and thé 
means you will be Y. [The experimenter then place 
large cards with the letters X and Y in front of the 
appropriate player. ] X. will give a number from oi 
to four, and Y will follow with a letter, A, B, C, or. 
Thus, X might say “3,” and Y might follow with: 
“A.” This would identify the third cell in the te 
row of the payoff matrix, I will then tell you the seo 
you get in that cell. Then, Y goes first, identifying e 
row, and X follows, identifying a column. I will give 
you the payoffs in that cell, and we will continue th 
way for 30 tries. That will constitute a game. 


There followed a brief interchange, in which each of | 
the players asked the experimenter a question which he 
answered by repeating a relevant part of the instruc- 
tions. The players then played a trial game of 10 t 
which was the same for all conditions of the experim 
and which Y “won” four to two. Meanwhile, the camel 
had zoomed in so that the payoff matrix dominated the 
center screen, framed by the figures of the two players: 

The payoff matrix used in all conditions is given m 
Figure 1. The matrix was designed to be symmetrical 
in payoffs for X and Y, with the center yielding a low 
advantage and the marginals a high advantage. The 
mental task required to master the matrix in play looked 
easy to the subjects in the experiment, but naive sub 
jects actually required around 80 trials to become b 
proficient in play. [ 

All calls in the televised games were in fact predeler- 
mined, and the players were receiving cues from 
screen assistants for each call. Despite this, the pla 
had been extensively rehearsed, and the play appeare 
highly natural. No subject expressed suspicion about 
prearrangement of the games, In all, four separate tapes 
were made. The two ascending condition tapes had 
role of X played by Joe and by Carl. The role of Y in 
each case was played by the other person. The twa 
random condition tapes replicated this procedure so 
Joe and Carl played both roles in both conditions. The 
progress of the game in the ascending condition had 
X losing quite steadily in the first third of the game 
In the middle third, he managed to hold his own: 
the final third, he gained ground as steadily as he h: 
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5 10 15 20 25 30 
X CAL 41422 32313 353251 22444 31112 22324 
Y CA BDDBA BBADD ACAAC DBBDA CCBAA CDDCA 
OUTCO! 3-50 -1-3 -4-1-2-5 3 -21-3-23 2-1-30 3 1350-35 12315 
FOR X 
ASCENDING 
i 5 10 15 20 25 30 
X CA 242212323 34423 24334 323122 |4221 
Y CA BDDBA CCABC ABDDD BAACB DCAAB CCDAC 
OUTC 4:02: p0 11-24 1 -2-302 3 -15-21-5 3 10-3-1 3-52-35 
FOR 
RANDOM 
Fic. 2 1. of calls used in ascending and random conditions with outcome in terms of X's score. 
(Underlined call was given first in each trial.) 
TRIAL 
2 4 6 8 lo 12 14 16 18 20 22 24 26 28 30 
3 5 T. 9 Lu 15.5.15. ATS 19> el es 25 27 29 
R A 
AEAN R 
| « i ca ear AR 
| x RON a HD 
ot 
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oS! ed 
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-23 
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Fic. 3. Actual pattern of outcomes in both experimental conditions in terms of X's score. 
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lost it in the first third, and by the last trial he was 
back even with Y. In the random condition, no great 
advantage was ever gained by either X or Y, and at 
the end of every five trials the score was even. 

Even in the random condition, the game is fascinating 
to watch, and the attention paid by the subjects to the 
play was typically quite intense. The actual calls made 
under both conditions of the experiment are given in 
Figure 2, while the patterns of the payoffs in terms of 
X's position are given in Figure 3. Following the view- 
ing of the televised game, the subjects were asked to 
complete a questionnaire dealing with the game and 
the players. After being asked not to discuss the ex- 
periment with others, the subjects were given course 
credit for participating and were dismissed. 


RESULTS 
Perception of Outcomes 


The experimental conditions of the study in 
effect placed Player X against Player Y in a 
strategic game whose outcomes led from initial 
heavy losses by X to an eventual winning 
trend (ascending condition). By the game’s end, 
X was even with Y. The control conditions 
were designed to show the same players in a 
standoff, with both players gaining a grasp 
of the game at the same pace (random 
condition). 

As an indication of their perception of the 
game's outcome and their attribution of ability, 
the subjects were asked which player had won 
the game, who was leading at the one-third and 
two-thirds points, and who would be the winner 
in the next game. In the ascending condition, 
the subjects’ reconstruction of the game’s 
progress shows steadily increasing success for 
X. Such a trend is projected into the estimate 


by 2% 3 NEXT 


D NEXT 
Va ĉa 3% GAME 


X'S ADVANTAGE OVER Y 
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FIG. 4. Recalled pattern of outcomes in both experi- 
mental conditions in terms of X's score, 
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TABLE 1 


ANALYSIS OF VARIANCE OF RECALL? ND 
PREDICTED OUTCOMES 


Source df MS F 
Pattern of play (A) 1 61.25 
Outcomes (B) | 3 1171.15 10,50* 
AXB 3 1010.03 9,05* 
Error | 312 111.57 
i * p <.001, E 
of victory for the succeeding game. The pattern 


of results is shown in Figure 4. Separate trend 
analyses of these data for the Joe-as-X and 
Carl-as-X conditions show the linear trend to _ 
be highly significant in each case (p < .001). 
Further, in the Joe-as-X condition, X's ex- - 
pected margin of victory in the next game is 
significantly different from zero (¢ = 5.03, 
b < .001), while the Carl-as-X condition al- 
most reaches significance (! = 2.03, p < .06). 

In contrast, the random conditions, as ex- 
pected, showed no trend in the progress of the 
game. The outcomes are never recalled by the 
subjects as having differed significantly from 
zero, nor are the two predictions for the next 
game significantly different from zero. Inter- 
estingly, however, the contrast between the 
prediction for the next game for Joe and Carl 
approaches significance (f= 1.73, p < .10). 
This prediction of greater success by Joe sug- 
gests a difference in the attribution of ability 
to the two players which is examined in more 
detail below. 

As a test of the significance of the overall 
pattern of recalled and predicted outcomes, 
a 2 X 4 factorial analysis of variance was per- 
formed, contrasting the ascending and random 
conditions against the four outcome estimates. 
A significant interaction in this analysis would 
imply that the average slope of the lines de- 
scribing the outcome data was different be- 
tween the ascending and random conditions. 
As Table 1 summarizes, the interaction is 
highly significant. Moreover, separate analyses 
for Joe-as-X data and Carl-as-X data also 
indicate significant interactions (for Joe as X, 
F = 439, < 01; for Carl as X, F = 5.37, 
p < .005). 

These findings stand in clear contrast to 
those obtained by Jones et al. (1968). As they 
reported of their results, 
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We are unq: ly dealing with a rather robust 
and replicabl ; that the performer who does 
better at first ı as more capable than the per- 
former who e r in the series but solves the same 
number of pr 335]. 

Our genera that the player who does 
better at t f the game is considered to 
be the mor: onfirms the notion that the 


ty differs under conditions of 
n those of puzzle solving. 
et al. investigations show 
t a primacy effect occurs in 
f ability to the puzzler, the 
strategist do not appear 


attribution 
strategic p 
While the 

convincing] 
the attribut 
attributions the 


to follow th e pattern. 
Perceplion rjormers 

Through : tions asked of the subjects, it 
is also possit/« to assess other impressions of 
the perforn ;oth as role players whose be- 
havior was letermined by the script and 
as individu: hose behavior within the roles 
reflected pc:sonal variations. Each subject 
was asked to rate X on 16 pairs of adjectives 
separated by 7-point scales, and to repeat the 
procedure for Y on an identical questionnaire. 
Eight of the adjective pairs were polar op- 
posites, 4 of which were included to test the 
difference between X and Y, and 4 of which 
Were included to test the difference between 
Joe and Carl. In coaching Joe and Carl in 
preparation for the videotaping, the contrast 
in their natural styles of play was emphasized. 


Thus, Joe played as a more intense and rapid 
Player, while Carl played his roles much 
Slower, if not ploddingly. While Joe’s calls were 
given without hesitation, Carl invariably 
paused, mused for a moment, and then an- 
nounced his call. The other 8 adjective pairs 
Were designed to characterize contrasting 
Styles, but with one adjective more favorable 
in connotation than the other. As examples, 
for Joe it was designed so that subjects would 
ave to choose between impulsive and quick 
thinker, and for Carl, between sluggish and 
deliberate. The particular interest of these 
questions was to determine if the perform- 
ances of Joe and Carl as X and Y would 
Significantly influence how they were perceived 
aS persons, 

Table 2 presents the means and comparisons, 
With the significance level derived by one-way 
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analyses of variance. In the ascending condi- 
tion, X was perceived as clearly more retentive, 
successful, and generally smarter than Y, 
whether X was played by Joe or Carl (column 
1). None of the test adjective pairs 1-4 ap- 
proached significance in the random outcome 
games (column 2). These contrasts are 
attributable to the differences prescribed in the 
roles of the games. 

Columns 3-6 present comparisons between 
the two players when they played the same 
roles. The adjective comparisons 5-8 were in- 
serted into the questionnaire specifically to 
determine if the contrasting styles of play 
of the two persons would be perceived in spite 
of the standardization required by the script. 
In general, this occurred. In both the patterned 
and the random games, Joe was seen as the 
quicker player and the more nervous player. 
Carl, by contrast, was seen as slower and more 
composed. These differences validate the 
attempt in the experiment to contrast the 
players’ styles, and they are obtained whether 
the player is performing the role of X or Y. 

With clear effects thus attributable to the 
role and to the person, it is of interest to see 
if the interaction of role and person led to con- 
trasting attributions. Did the fact of having 
apparently learned a winning strategy from 
the outcome matrix alter the perception of Joe 
and Carl? This effect is tested by the forced 
choice between similar adjectives whose con- 
notations vary on a dimension of more or less 
favorable evaluation. The critical comparisons 
for Joe are the adjectives 9-12 in column 7, 
and those for Carl, the adjectives 13-16 in 
column 8. In general, the anticipated effect 
was obtained. When he had the ascending 
pattern, Joe was seen more as à quick thinker 
than impulsive, more astute than careless, and 
more decisive than rash. The opposite impres- 
sions were formed with the descending trend 
when Joe played the role of Y. By way of 
comparison, Carl, with his more methodical 
style, was perceived when his outcomes had 
the ascending pattern as more deliberate than 
sluggish, and more attentive than uncompre- 
hending, with the opposite impressions when 
he played as Y. 

The effect of the ascending pattern, then, 
was to elicit a more favorable evaluation of 
essentially the same behavior. This effect can 
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resulting from the centrality 
f success in a manner similar 
it findings of Asch (1946). 
trait effect was obtained from 
iptions of persons described 
list of adjectives in which 
1” were substituted. Kelley 
tly found that altering the 
old in introducing an instruc- 
ted students’ later interaction 
esent finding serves to extend 
by showing that the same 
ewed favorably or not by the 
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to them f: 
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this phenor 
behavior ca 


consequence hat behavior. 

To chara : the contrast to which these 
data point, Jos, when he plays the role of Y, 
appears as a {.<t player, but one who does not 
have much ı mind for the game. He is 
impulsive, | before he looks, even when 
events turn usiavorable. He seems rather un- 
interested in improving his position when the 
trend indicates some change in his behavior is 
required, Or e other hand, holding to this 
same style under conditions where they even- 


tually lead to an ascending success pattern 
Produces the impression of a person whose 
fundamental acumen surfaces in time because 
of his unusual persistence. In comparison, 
Carl, with his more deliberate manner, is seen 
as à very slow learner when playing the role 
of Y. It is not clear whether with 100 trials 
he would even learn the matrix. This style 
Yields a different impression when, as X, he 
Manages to reverse the downward trend to a 
Steady pattern of success. Under these cir- 
cumstances, he seems to be a person who has 
^ Systematic plan of analysis, who is patient 
and careful in working it through, and who 
eventually puts the elements of the task to- 
Sether to master the game. These comparisons 
are striking when the two ascending tapes are 
Viewed consecutively. In effect, they support 
the aphorism “Nothing succeeds like success." 
There is in addition a further contrast be- 
tween the ability attributions to Joe and Carl. 
eviewing the significant differences in columns 
36 of Table 2, there are several instances in 
Which Joe is generally seen as the more able 
Person. Obviously, the person who achieves 
Success with acts that require no hesitation 
Should be seen as more able than the person 
Who succeeds equally well, but with apparently 
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more effort. However, the scattered com- 
parisons that are significant are less evident 
in the comparison between Joe as X and Carl 
as X in the ascending condition. One might 
think that a person seemingly rushing toward 
defeat (Joe as Y, column 4) might be considered 
less able. This is not the case, and several com- 
parisons in columns 5 and 6 favor Joe when his 
actions yield no discernible advantage. One 
exception is where Carl is seen as more cautious 
and less laborious than Joe, but this comparison 
seems less relevant to successful play in the 
game. While the effect is unexpected, there 
does seem to be a characteristic about Joe's 
behavioral style which elicits an impression 
of a more able or smarter person. This is re- 
flected as well in the fact that both random 
conditions expect Joe to win in the next game. 
Perhaps a quicker, less hesitant performance 
by a person yields an impression of greater 
ability independent, to some extent, of the 
success of the performance. Or, mannerisms 
not identified in the study could be responsible 
for the effect. 


DISCUSSION 


The results of the experiment generally con- 
firm the expectations which guided its design. 
When the subjects are asked to predict the 
winner of the next game, the ascending player 
in the experimental conditions is reliably 
chosen. Such a finding would seem to indicate 
the operation of a recency effect under condi- 
tions of strategic play, in contrast to the pri- 
macy effect obtained under puzzle-solving con- 
ditions by Jones et al. The subjects seemingly 
find the ascending pattern of X to be the more 
successful, and they attribute greater enduring 
characteristics of ability to him. However, an 
unexpected anomaly arises in the pattern given 
by the subjects in their recall of the progress 
of the game in the two ascending conditions. 
As Figure 3 indicates, the actual progress of 
the game was a fairly smooth curvilinear down- 
ward-to-upward path for X. In contrast, as 
Figure 4 indicates, the subjects reconstruct the 
path in a linear slope upward for X. after the 
tenth trial. Moreover, the recall of Joe as X 
even produces in the trend analysis a signifi- 
cant cubic (p < .05) in the opposite direction 
from that which was presented in the recording. 


TABLE 3 


NeTwat axo Recanto Cuance or X Every 
10 Teiars: Borm Ascrxpmxa 


Cosbprmiows. C 
Change Ith trial wh trial 
Actual -150 0 1&0 
Kecalled -103 90 | 27 


This distortion in the recall of the game's 
progression is summarized in Table 3. The 
pooled results from Joe as X and Carl as X 
indicate that at the end of the first block of 10 
trials, the subjects underestimated X's losses 
by 7.7 points. During the second block of 10 
trials, their recall overestimated his gain from 
the first block by 9.0 points, while at the end 
of the game, they underestimated his gain from 
the second block by 15.3 points. The actual 
path from the fourteenth trial was a curve of 
Positive acceleration, and the recalled path 
from the tenth trial was a curve of negative 
acceleration, Moreover, this distortion is pro- 
jected into the prediction for the next game, 
since the anticipated Winning margin for X is 
only 7.2 points, while extrapolating the actual 
path of the last 15 trials over an additional 30 
trials would yield a winning margin of 42 
points. 

Several possible explanations could be ad- 
vanced to deal with this apparent displace- 
ment of the actual winning trend in the last 
block of trials to the middle block in the re- 
called trend. As one possibility, there might 
have been some interference in following the 
game, introduced by the novelty of the task 
and the time required for the subjects to be- 
come familiar with what was going on. While 
a preliminary game of 10 trials was included 
on each tape to minimize such confusion, per- 
haps this was not sufficient. This explanation 
has two difficulties, however. On the one hand, 
the subjects in the random conditions ap- 
parently did not have the problem, and, on the 
other, the argument would imply that recall in 
the later trials would be more accurate. This 
effect, however, was not manifest. 

A second possibility is the operation of a 
halo effect. From their observation of the 

eventual ascending success of X, the subjects 
formed a general impression at the end of the 
game of X as the more able player. When they 
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were then required to indicate in : detail 
the path by which X demonstrate: ibility, 
the subjects tended to reconstruct ath in 
its simplest possible form, a straigt e, and 
give X a more favorable position a: point 
In effect, the subjects ignored the ces of 
the proceedings by focusing atten m who 
was going to win overall. This exo... tion is 
plausible, and the only difficulty w t from 
the present data is the fact that is no 


evidence for it in the random-Cóm s, de- 
spite the almost signific contr. | the 


prediction of success for Jc over Ce- in the 
next game. a 

We tend to favor an explanation #0! based 
on biased reconstruction of the gam: t one 
dependent upon the perceptual imics 
operating as the game proceeded. s ex- 
planation posits that the salience oí i. forma- 
tion for impression formation is cont: led by 
contrast and assimilation effects as "^ íts in 
X's fortunes occur. Thus, the mos: iking 
impression formed over the course of : game 
is the contrast in X’s outcomes between the 
first and second blocks. This contrast could be 
heightened by the subjects’ awarenc-s that 
sheer luck plays an important role in each 
player's outcomes during the initial explora- 
tion of the matrix in the early trials. When 
considerations of strategy become meaningful 
by the second block, X may be seen as manag- 
ing a dramatic reversal of the game's course. 


The subsequent trend then established in the 
third block is assimilated to the established 
impression of X as the more able player, and 
as such is not as salient information, despite 
the fact that the gains in that block are the 
greatest. In effect, the subjects could be con- 
sidered to have adapted their expectations by 
the third block to X's developing a winning 
trend, and the fact that it occurs is not news. 

In terms of a more formal model for describ- 
ing this process, the idea of “information gain” 
in information theory is close to what we have 
in mind (Cherry, 1957). Information gain 1s 
defined as the logarithm of the ratio of the 4 
Posteriori probability to the a priori probability 
of Hypothesis x. Thus, 


Information gain = log b(x|y)/p(x), 


where p(x) is the a priori probability of Hy- 


y) is the a posteriori prob- 


ly of H s x given Evidence y. 
imp! s of this expression include 
tt tha: two probabilities are equal, 
" is zero, since the log of one 
Als zreater the a posteriori set 
dt priori set, the greater the 


However, if the a priori 
; the a posteriori, the a 
subject to rejection, since 
rith . fraction is negative: there 
i tion gain. It may be inferred, 
the a priori probability of an 
v high, the disconfirmation 
fr yw value in the a posteriori 
` be regarded as error (in the 
ise) instead. This will occur 
v has sufficiently shored up 
ility to some optimum level. 
is not at that level, it could 
umed that actual error will pass in the 
of infor» tion supporting an alternative 
hesis. >: ncipally, the effect of the a 
i ma'e more likely that the 
a posteriori set will be biased 
likely a priori hypothesis— 
indiscriminately facilitate recogni- 
inhibit rejection of error. 
information theory derivations re- 
integration with more strictly psycho- 
concepts, including most immediately 
ation and contrast effects. We thus 
Posit a range of a posteriori values in the 
orhood of the a priori value which 
fests assimilation to the initial hypothesis. 
Steriori values increasingly smaller or 
than the a priori eventually contribute 
trast effects, and the initial hypothesis 
n revised. In the present experiment, a 
st effect would have been more easily 
during the second block of trials 
the a priori hypothesis that X was a 
'" was tempered by the knowledge that 
much to do with his early outcomes. 
ming the tide in the middle block repre- 
| substantial information gain and the 
ion of a new hypothesis. His later 
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success provided less information gain and was 
assimilated to the hypothesis that X was the 
more able player. 

The data of the present investigation do not 
allow any rigorous test of this proposal, al- 
though it is not incompatible with any of the 
evidence at hand. Further research will be 
required to establish whether, indeed, the re- 
called trend is a reliable finding. Finally, this 
proposal requires a modification in our initial 
expectation regarding primacy and recency 
effects under strategic conditions. While the 
prediction of success in the next game ap- 
parently does manifest a recency effect, the 
more detailed examination of recalled cues 
points again to the overweighting of early in- 
formation in impression formation. The expla- 
nation we are led to support assumes a modifi- 
cation of the primacy effect under a curvilinear 
outcome pattern in a strategic game, but it 
does not exclude it. 
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TRANSGRESSION, SELF-IMAGE, AND COMPLIANT BEHA\ 


DAVID L. MCMILLEN ! 
Mississippi State University 


A hypothesis was tested that compliance increases following transgression, primarily 
because compliance restores self-esteem which was lost as a result of transgression 
Self-esteem and transgression were manipulated in a 2 X 2 factorial design. Differ 
ences in compliance between transgression and no-transgression subjects occurred 


only when the subjects received no self-esteem manipulation prior to the compliant 
request. The results are discussed in terms of consistency between self-image and 


behavior. 


Several studies have demonstrated that 
compliant behavior increases following trans- 
gression. Transgression has been operational- 
ized in a variety of ways, including destroying 
a machine (Brock & Becker, 1966; Wallace & 
Sadalla, 1966), upsetting the order of a 
graduate student’s index cards (Freedman, 
Wallington, & Bless, 1967), costing another 
person green stamps (Berscheid & Walster, 
1967), lying (Freedman et al, 1967), and 
administration of electric shocks (Carlsmith & 
Gross, 1969), and the results have been essen- 
tially the same. However, the theoretical ex- 
planations of the data are not uniform. Some 
investigators (e.g., Carlsmith & Gross, 1969; 
Freedman et al., 1967) employed a “guilt” in- 
terpretation to these findings. The guilt in- 
terpretation contends that when an individual 
transgresses, he feels guilty. The request for 
compliant behavior gives the individual an 
opportunity to alleviate his guilt. 

Brock (1969) stated that the guilt interpreta- 
tion of the relationship between transgression 
and compliance is not warranted on the basis 
of the existing data. Brock expressed concern 
over the imprecise conceptual status of guilt 
and the lack of manipulation checks. In addi- 
tion, he suggested that the body of data can 
be explained in terms of fate control and 


maintenance of social consistency, According 
to Brock (1969) : 


An individual who has affected the fate of another 
person in a certain magnitude will repeat that magni- 
tude of control over the other person (or a person in a 
similar role) if an opportunity to do so presents itself 


[p. 143]. 


! Requests for reprints should be sent to the author, 
Department of Psychology, Mississippi State Uni- 
versity, State College, Mississippi 39762. 


The “fate control" interpretatior eived 
support by McMillen (1970). He ío::-3 that 
subjects whose transgression enabled them to 


gain fate control over another person complied 
more than subjects who did not gain i:te con- 
trol by transgressing. 

An additional interpretation sugse::-d by 
Carlsmith and Gross (1969) and Lillen 
(1970) centers on the importance of compliant 
behavior as a means of bolstering one’s self- 
image following transgression. According to 
this interpretation, the individual's act of 


transgression is inconsistent with his «c/(-con- 
cept. Complying with a request for assistance 
gives the individual a chance to resolve this 


inconsistency by restoring his self-image. This 
interpretation is derived from a recent Lempt 
by Aronson (1969a, 1969b) to clarify disso- 
nance theory. According to Aronson, one of the 
major determinants of dissonance arousal is 
whether or not inconsistency exists between a 
cognition about the self and a cognition about 
a behavior which violates this self-concept. In 
the majority of the transgression-compliance 
experiments, the subject is faced with a cogni- 
tion concerning his incompetence (e.g, “I 
damaged the experimental apparatus") or a 
cognition concerning his immorality (e.g., “I 
cheated”). For most people (i.e., those with a 
positive self-concept), such a cognition should 
arouse dissonance. 

Indirect evidence for the role of compliance 
as a means of restoring the self-image comes 
from Carlsmith and Gross’s (1969) finding that 
the subject is more likely to comply when the 
request is made by someone other than the 
person transgressed against. Carlsmith and 
Gross pointed out that continued contact with 
the victim may serve as an unpleasant re- 
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minder of one’s transgression. By complying 
with a re t by someone other than the 
victim, the idual could restore his self- 
image and | the person he transgressed 
against. 

The pur of the present experiment was 
to assess t! portance of the self-concept 
in determi f compliance would increase 
following t: ession. The experiment con- 
sisted of a - factorial design. Half of the 
subjects were given an opportunity to cheat 
on a test; ti her half were not. Half of the 
subjects were given information designed to 
bolster their self-image; the other half were 
not, It was licted that the greatest amount 
of compliance would occur among subjects 
who cheated but received no information to 
bolster their self-image. 

METHOD 
Subjects 

Forty-four males were recruited from introductory 
psychology classes during the summer session at 
Mississippi > University. On the first day that 
classes met students in the introductory course 
were admir ed the self-esteem scale of the California 


Psychologics! Inventory (CPI). Students were told 
that the test was a personality inventory and was being 
administered to obtain the information concerning the 
characteristics of students at Mississippi State Uni- 
versity, The students were told that they would receive 
the results of their own test score in the next few days. 
Eleven subjects were randomly assigned to each of 
four groups. Subjects were scheduled in pairs. When 
they arrived, they were seated in a waiting room and 
told that the previous subjects had not completed the 
experiment. as yet. The experimenter left the room; 
1 minute later the confederate entered saying that he 
was looking for a book he had forgotten when he was 
in an experiment earlier in the day. The confederate 
engaged the subjects in a brief informal discussion. 
During the course of this discussion, the confederate 
furnished half of the subjects with information about 
the experiment. Specifically he said: 


Are you here for an experiment? [All subjects 
answered in the affrmative.] I was in one earlier 
where I had to take a psychology test. The guy told 
me if I had made a good score it would help my 
Psychology grade, but I didn't do well. He went over 
the test with me afterwards, and most of the answers 
were *B."? If only I had known. 


? Actually, Answer B was correct for 10 of the 50 
questions, During the course of debriefing, the experi- 
menter questioned the subject. to determine if he had 
been, for any reason, suspicious of the confederate. 
None of the subjects stated that they were suspicious. 


AND CoMPLIANT BEHAVIOR 177 


In the control condition, the confederate did not dis- 
cuss the experiment. If the subject asked about the ex- 
periment, the confederate claimed that he had been 
in a person-perception experiment. 

The confederate's decision to administer the experi- 
mental manipulation was determined by a coin flip 
prior to entering the waiting room. The experimenter 
was not made aware of the confederate's decision until 
the experiment had been completed. Four or 5 minutes 
after he entered the waiting room, the confederate 
exited. A few minutes later, the experimenter returned 
and began to explain the purpose of the experiment. 
The following preliminary instructions were given to 
all subjects: 


Have either of you been in this experiment before 
or heard anything about it?* 

As you probably know, the university is presently 
engaged in a large-scale self-study and self-improve- 
ment program. In this department we are trying to 
assess the effectiveness of our courses in meeting 
student needs. The course most frequently criticized 
both by students and instructors is introductory 
psychology. There are several changes we are con- 
sidering for the course; however, before making major 
changes, we need to determine the extent to which 
the average student is knowledgeable in the field of 
psychology. In this experiment we are administering 
diagnostic psychology tests to a large number of 
students in an attempt to find out how much psy- 
chology students know before taking the introductory 
course.‘ The test which you will take is a 50-question 
multiple-choice test. I want you to take your time 
and read each question carefully before you mark an 
answer. Remember, it is important that we get a 
true picture of your knowledge of psychology. This 
test will not effect your course grade unless you make 
a high score. If that occurs, I have made arrange- 
ments with your instructors, Dr. Hudson and Mr. 
Mapp, for you to receive credit for this test in your 
course grade. If your score is fair or poor, it will 
not effect your grade. Do you have any questions? 


At this point, the subjects were taken to separate 
cubicles to complete their tests. They were observed 
through a one-way mirror to determine when they had 
completed the test. At this point, the second confederate 
entered the cubicle. The second confederate was the 
same person who administered the personality test to 
all students on the first day of class. Before entering 
the cubicle, the confederate flipped a coin to determine 
whether or not he would administer the self-esteem 
manipulation to that particular subject. If the subject 
was to receive no manipulation, the confederate entered 
the cubicle and picked up a folder of papers from a table 


3 None of the subjects admitted knowing anything 
about the experiment, nor did anyone confess during 
the course of the experiment. 

“All subjects were run during the second and third 
day of classes, and the test questions were on topics 
usually covered during the last half of the course. 
In all cases, this was the first psychological experiment 
that the subject participated in. 


178 Davis L. McMitiex 
m= the wate of the cubicle from the subject. Resutrs AND Discuss: 
rA pi. "m To determine if the subjects n.: le 


[rb 
itl 
di 
[i b 
iti 


TIT 
EPR 
H 
P 
TH 
HT 


H: 
rl 
i 
i 
F 
B 
= 
$ 
H 
: 


| 
| 


Group 


Test information — | 
No test information | 


— -” 


the information furnished by the 
the frequency of “B” responses 
was analyzed. The means for the \ 
ment groups are shown in Table | n 
son of the information groups with the f 
information groups was highly signifies 
(F = 228, df = 1/40, p < .001). Subjects 
not admit to knowledge of the e:perimenl 
but they used this knowledge in atte 
to improve their test performance 
It could be argued that denying knc 
of the experiment was not considere! a ti 
gression by the subjects, Possibly, the 
menter’s question “Have either of vot 
this experiment before or heard anythi 
it?” was interpreted as “Did you hear anythin 
about this experiment before you xot here? 
However, a recent experiment by \icMil 
and Jackson (1971) indicated that : iking ust 
of the information provided by the confedera 
produced feelings of transgression in the sub 
jects. McMillen and Jackson were testing the 
effects of positive and negative requests Of 
compliance following transgression, 1 hey em 
ployed essentially the same procedure used ir 
the present Study, except that the sub 
were not induced to deny knowledge of 
experiment, and independent checks on 
variables were added. Nearly all of the sub; 
in their information groups indicated that they 
had used the information and that they felt 
they had transgressed. Apparently, the act 
using the information coupled with the exp 
menter's instruction “Remember, it is i 
tant that we get a true picture of your knowl 
edge of psychology” is sufficient to cause the 
subject to feel he has transgressed. 

The data on the compliant measure are 
shown in Table 2. Separate chi-square anal 
were performed to assess the effect of 
esteem on compliant behavior. Significant 
differences in compliance occurred in the n 
self-esteem manipulation groups (X= 
df = 1, p < 02). Nine of the 11 subjects 
had information displayed compliant behavio 
only 3 of the 11 subjects who did not 
information complied. 

There were no differences in compli 
among the self-esteem manipulation gr 
(6.18, df= 1, p « 90). Five of theii 


in 


the ti 


subjects » ransgressed showed compliant 
behavior the 11 subjects who did not 
transgres lied 

The 14 tion subjects who complied 
and the 5 iid not were compared as to 


the num! 
was done | 
jects who 
in psycho 
self-appra 
The mea: 
for those 

did not (/ 
not appe 

plied knes 


not comp 


! responses they made. This 
rine the likelihood that sub- 

ed were less knowledgeable 
d wanted an opportunity for 
r to increase their knowledge. 
er of B responses was 18.21 
plied and 19.87 for those who 

1, df = 1/20, p > .05). It does 
y that the subjects who com- 
sychology than those who did 
nce they did not differ in the 


extent. to they used the confederate's 
informatic 

The dat zest that self-image is an im- 
portant f. in determining if compliance 
increases © owing transgression. Evidently, 
the indivi is primarily concerned about 
how trans on affects him, rather than how 
it affects ti person transgressed against. The 
Cognition : he has transgressed is incon- 
sistent wit. ‘is self-image. The compliant re- 
quest gives the individual an opportunity to 
bolster his scif-image by doing something good. 


If an incident occurs which restores self-esteem 
before the compliance request is made, the 
Subject does not need to comply to restore his 
self-image. 

_ Some caution should be exercised in accept- 
ing this interpretation, since there are no data 
bearing precisely on the function of the self- 
esteem manipulation. It is possible that the 
self-esteem manipulation caused the subjects 
to feel so good about themselves that they were 
anxious to leave to tell their friends. This would 
tend to limit compliance, since the request 
required the subject to stay longer. Although 
this does not seem a likely interpretation, it 
es be negated on the basis of the present 


The “self-esteem” interpretation of com- 
Pliance following transgression is a refinement 
of the guilt interpretation. Transgression theo- 
Tetically may lead to guilt, but there is no 
obvious reason why compliance should alle- 
wate guilt. The individual knows he has trans- 
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Measure Bett No sat | Mae | wo sate 
Lui ever 
ld Wow | bm E: 
Complied P » ae 3 
Bd t esee | MA | 3 D 
gressed, regardless of whether he complies or 


not. However, if the compliance behavior can 
function to raise self-esteem, this may com- 


will be reduced. 
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REINFORCEMENT THEORY OF PERSONALITY 
AND SOCIAL SYSTEM: 


DOMINANCE AND POSITION IN A GROUP POWER STRUCTURE 


DAVID WATSON? 
University of Hawaii 


Social structures or systems, defined as the 
pattern of habitual, organized ways in which 
group members interact, may be viewed as 
means for the dispersal of reinforcements to 
participating members of groups. Such systems 
need not be deliberately rational. The act of 
delivering reinforcements will itself either be 
reinforced or not, and these reinforcements will 
influence the system devised. Certain per- 
sonality characteristics may be viewed as 
individual differences in responsiveness to 
particular reinforcements, or in tendencies to 
seek such reinforcements. This is a social learn- 
ing view of the interaction of personality and 
m systems (cf. Mischel, 1968; Staats, 
1968). 


Social Availability of Reinforcements 


Any particular social structure will offer 
frequent opportunities for gaining certain 
reinforcements and very infrequent oppor- 
tunities to get other rewards, For example, a 
group might be organized along competitive, 
status-oriented lines, which is to say that it 
offers the opportunity for the group members 
to be reinforced with achievement rewards 


* This research was supported initially by a grant 
from the University of Hawaii, which also supplied 
free computer time, and later by National Institute of 
Mental Health Grant MH 17747-01. Melinda Riola 
served as the experimenter, and aided in the analysis of 
the data, and my thanks are gratefully extended to her. 

* Requests for reprints should be sent to the author, 
Department of Psychology, University of Hawaii, 
2430 Campus Road, Honolulu, Hawaii 96822. 
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S vay that suca 
striving is irrelevant, in which case the achi 
ment reward is not available. Other reinforce- 
ments, such as gaining an affiliative relations 
ship, might be offered in that group. Grou 
become systematized in such a way that ce 
socially mediated rewards are made avai 
to the participants in the group, while oth 
socially dispersed rewards are not avai 
and different kinds of group organization v 
make available different kinds of soci 
instrumented reinforcements. 


through social intercourse. Some socially. i 
strumented reinforcements come through di à 
access to others, for instance, affiliation, wh 
others come through indirect inferences fro 
the reactions of others, for instance, achie 
ment or nurturance, but these rewards all 
depend upon social interaction for the imple- 
mentation, Some rewards are gained thro : 
Sharing the benefits gained by the group 
its work, while others are not directly: 
nected to the outcome of the work, but 
gained through social interaction as the grou 
performs its task, 

If social structures are habitually orgai 
means of accomplishing some group goal, the 
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the behavio the individual members of the 
group mus! ustained until the goal is 
reached. Th val-oriented behavior is gen- 
erally maintained through the application of 
reinforcem: incentives while movement 
toward the is in progress, and through 
offerings of : ds once the goal is reached. 
The group : er's participation in the goal- 
oriented be! of his group, and his morale 
as a group member, are determined by the 
availability of cinforcements contingent upon 
participation. Issues’ of productivity and 
morale hinge upon the ability of a 

sustain th: sk-oriented behavior of 1 
members. 

These rei ements or incentives need not 
be directly connected to the rewards envisaged 
by those who have established the group goal 
or who determine the group structure. A leader 
who direct group toward some achieve- 
ment-related goal might sustain the produc- 
tive behavior of the group members by 
organizing the croup so that affiliative rewards 
are available, contingent upon participation 


in the task. Groups that are optimally or- 
ganized will provide rewards that can be 
instrumented via participation in the group 
process, sustaining adequate performance and 
morale. 


Personality as Responsiveness to Reinforcements 


Now turn to the personal characteristics of 
the individual group members. Many hy- 
Pothesized personality characteristics can be 
viewed as individual differences in the effec- 
tiveness of a specific socially mediated reward, 
9r as differences in the tendency to seek such 
reinforcements. Many of Murray’s (1938) list 
of needs can be seen this way: achievement, 
dominance, nurturance, affiliation, social fear, 
etc. When a person scores highly on some 
Measure designed to assess one of these 
Personality variables, he is indirectly indicat- 
Ing that the type of social interaction or out- 
Come which the measure asks about is for him 
an effective reinforcement. This applies 
Whether the measure involves self-report or 
Some other process, such as a projective test 
like the TAT, The high scorer indicates that 
his behavior can be affected by the presence 
®r absence of the class of reinforcers asked 


about by the measure. Conversely, the low 
scorer indicates that for him that class of 
reinforcers is largely ineffective, that is, not a 
reinforcer, and thus his behavior will not be 
affected by its presence or absence. 

The interaction of personality and social 
structure now becomes explicable by examin- 
ing (a) the degree to which a given social 
system provides opportunities for a particular 
social reinforcement and (b) the degree to 
which each individual's behavior can be 
influenced by application of that reinforcement. 
Personality is defined as individual differences 
in responsivity to certain types of rewards, and 
social system is the organized pattern of inter- 
action which can either maximize or minimize 
those types of rewards. An individual's 
performance and morale in'a group will depend 
upon the group being so organized that the 
reinforcers for that person are made available. 


Given relatively homogeneous personality N 


measures, which assess responsiveness to only 
one class of reinforcers, and experimentally 
created homogeneous social situations—situa- 
tions in which only one important social rein- 
forcement is available—the prediction of the 
interaction of personality and social system 
is straightforward. Homogeneous personality 
measures are available, for example, Jackson’s 
(1966) Personality Research Form. When the 
social structures are not homogeneous, as is 
usually the case in real-life groups, more than 
one class of reinforcements is available, and 
prediction requires an analysis of all the 
relevant, available reinforcements. 


Dominance and Position in a Social Power 
Structure 


The frequency of communication and the 
satisfaction of group members are affected by 
their position within the social power structure 
of a task-oriented group. Those who hold 
positions of relatively high power enjoy their 
position more and communicate more fre- 
quently (Hurwitz, Zander, & Hymovitch, 
1968; Watson, 1965; Watson & Bromberg, 
1965). While previous research has examined 
the effects of different positions in social power 
structures, there have been no studies of the 
effects of relevant personality characteristics, 
such as dominance, upon the communication 
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or morale of group members, nor have there 
been studies of the possible interactions of 
relevant personality characteristics with posi- 
tion in a power structure. If the behavior of 
group members is partially determined by their 
position within the group power structure, it 
may also be determined by their degree of 
interest in the exercise of social power. In a 
2X2 design, this study manipulated the 
social power position of group members, 
either high or low, and their degree of expressed 
interest in dominance. 

Jackson's (1966) Dominance scale was used 
às the measure of interest in dominance. 
People who score highly on the scale are 
hypothesized to be prone to desire to influence, 
direct, and control the behavior of others; 
to enjoy arguing for their own point of view; 
and to enjoy guiding and supervising the 
activities of others. If these people were given 
à chance to direct others, they would be very 
likely to use this power frequently. In task- 
oriented groups with many decisions to be 
made, they would be expected to communicate 
frequently in an effort to guide group decisions, 
and even if they were not in a powerful posi- 
tion, they would be expected to communicate 
to others in an attempt to control group out- 
comes. Those low on dominance would be less 
likely to use power if it were given to them, 
and less likely to communicate in attempts to 
guide group decision processes. People high in 
dominance would enjoy positions of high power 
more than those low on dominance. 

The group power structure involved one 
powerful person and three relatively weak 
members of a task-oriented group, called a 
centralized and connected structure. Within 
such a structure, those who hold power com- 
municate more frequently and report that 
they enjoy their positions more than those 
who do not (Watson, 1965). 

Four main dependent variables were used: 
the number of times those who held high power 
used their power, the frequency of all group 
members’ communication to each other, the 
content of their communication, and their en- 
joyment of the situation. Both position within 
the power structure and the subject’s degree 

of dominance were expected to affect these 
variables. 
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METHOD 
Subjects 
The subjects were male undergradu: it the 
University of Hawaii who were strangers t other 
Jackson's Dominance scale was adminis: to 231 


males. From those scoring 12 or above > 75th 


percentile, and 4 or below, the 25th pe: e, 28 
high-dominance and 28 low-dominance s were 
randomly selected. Within the high- and low inance 
groups, the subjects were randomly assign her to 
be high or low power. The » was 14 in each xcept 
for the high-dominance-low-power group, w? ad 13. 
Procedure x 

The subjects were seated separately in small rooms. 


All communication was by messages written on 4 X 6 
cards delivered by the experimenter. The subjects 
believed that they were participating in functioning 
four-man groups, but actually all messages ‘hey sent 


were intercepted, and ed messages > sub- 
stituted. To simulate a - -oriented p, all 
messages were task oriented d directed to aed from 
the group leader, and the leader receive! | vice as 
many messages as did the followers. This re -ts the 
usual nature of communication in such pow’s struc- 
tures (Collins & Raven, 1969; Watson, 1965; ` ison & 
Bromberg, 1965) and is part of the power 'ipula- 
tion. Within the high- or low-power condi.:^ns, all 
subjects received identical messages. The subjects were 
encouraged to send messages—and received « rela- 
tively busy flow—in order to solve the group's task, 
which required the integration of the members’ activi- 
ties. The sole restriction was that they were not to 


ask direct questions, which made it possible for the 
experimenter to substitute the prepared messages 
without detection. No subject noticed the deception. 

In the preliminary instructions, the sub were 
told that in order for their group to solve its task, their 
work would require coordination, and the experimenter 
appointed one of the group members to be the co- 
ordinator. The coordinators were led to believe that 
they held the power to direct and guide the task be- 
havior of the noncoordinators. Al] subjects in the high- 
power condition were informed that they had been 
selected to be the coordinator, and those in the low- 
power condition were told that they were a non- 
coordinator and that someone else was the coordinator. 
This assignment to a power position placed no restric- 
tion on the messages that the subject might send, but 
only the coordinators had the right specifically t9 
direct others’ behavior, which they did via the written 
messages, 

The messages were designed to create the impression 
that control was exerted within the group. 


Task 


The task was a simple card game which each group 
member played against the experimenter. It was fully 
explained in an earlier report (Watson, 1965), and 
will only be sketched here. Each group member had @ 


numbered 1, 2, and 3. All hands 
ard was played each trick. For 
the experimenter and one sub- 
played the higher card won, but 
1 in case of a tie. A card was 
ich game. 
vere that if à group member won 
point. If three of the four group 
gained 1 additional point. If the 
or tied the trick, they lost 1 point, 
ost or tied, each lost 1 additional 
ring system was arranged in such 
bject could gain points unless a 
:do) group won. This required the 
its activities in order for each in- 
) win. 

‘ects were inexperienced at the game, 
perimenter intercepted their mes- 
` to control the pattern of wins and 

z the directions sent by the fictitious 

s predetermined that the subjects 

- two games, win the third, and tie 

"pleting four games, the subjects 
:estionnaire, and the purpose of the 
plained to them. 


RESULTS AND DISCUSSION 


ent variables were the amount 
on, the frequency of commands, 
indexes of morale, and the 
nessages. 


cy of Communication and Commands 


e amount of communication was mea- 
by simply counting the number of mes- 
es sent on the 4 X 6 cards. The means for 
four conditions were: high dominance- 
power = 27.0; high dominance-low 
) = 20.3; low dominance-high power 
8.6; and low-low = 6.5. People who were 
on dominance sent more messages than 
ose who were low (F = 98.49, df = 1/51, 
< .001), and those who held high power— 
Teceived more messages—sent more than 
those who held low power (F = 7.43, df = 
B < .01). Dominance made more dif- 
nee than did position in the power struc- 
The high-power subjects may have sent 
e messages simply because they received 
(cf. Watson & Bromberg, 1965), but 
| cannot be true of the high-dominance 
jects. Their large output was an expression 
their interest in guiding the outcome of the 
"oup. 
€ number of commands issued by the co- 
tors was compared, but only within the 
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high-power condition, since only these subjects 
could issue commands. The mean number 
issued by high-dominance subjects was 17.6, 
but was only 7.2 for the low-dominance people 
(F = 13.83, df = 1/27, p < .001). Jackson's 
scale clearly differentiates people according to 
their interest in directing others. 


Questionnaire Responses 


On the posttask questionnaire, the subjects 
were asked how much they had enjoyed their 
position within the group. They responded on 
a five-point scale. The means were high 
dominance-high power, 3.64; high dominance- 
low power, 3.00; low dominance-high power, 
2.64; and low-low, 2.43. Only the dominance 
condition produced an effect (F = 9.43, df = 
1/51, p < .01). The difference between high- 
and low-power conditions neared significance 
(F = 2.81, df = 1/51, p = .10), so the data 
were not interpreted as a failure to replicate 
an earlier finding (Watson, 1965), that people 
who hold power enjoy this kind of experiment 
more. 

The subjects were also asked how personally 
important it was for them to solve the group’s 
task. Again, those high on dominance scored 
higher (F = 5.58, df = 1/51, p < .05). The 
means were 3.0 for high dominance and 2.3 for 
low dominance. On a third question, the high- 
dominance subjects reported that they had 
enjoyed the experiment in general more 
(F = 3.76, df = 1/51, p = .06). The means 
were 3,46 for high dominance and 2,93 for low. 

Taken together, these three results from the 
questionnaire indicated that those who were 
high on dominance enjoyed the situation more 
and were more personally involved. These 
groups were extremely task oriented. The 
major job of each group member was to try 
to solve the task and convince the others that 
his solution was the correct one, which is a 
situation providing potential rewards for 
people who score highly on dominance but 
not for those who are low on this characteristic. 
Questions about enjoyment of one’s position 
in a group, personal involvement in the task, 
and general enjoyment of the experiment can 
be taken as different ways of inquiring of the 
subject if the situation contained events that 
were reinforcing for him. In the relatively 
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homogeneous social structure created in this 
experiment, only dominance-type rewards were 
available. The low-dominance subjects would 
have found some other social group more en- 
joyable and involving. 


Message Content 


The contents of the messages were analyzed 
using Bales’ (1950) system? Each message 
card was scored once. The groups were almost 
entirely task oriented, there being few instances 
of messages falling in Bales’ first three cate- 
gories, “positive social-emotional content.” 
The modal number there was zero. A simple 
count of the number of messages falling in 
Categories 1, 2, or 3 showed these totals for 
the four conditions: high dominance-high 
power = 8; high dominance-low power = 2; 
low dominance-high power = 6, and low- 
low = 1. Using an exact test, this is sig- 
nificant at the .01 level. This replicates the 
finding of Watson and Bromberg (1965), that 
those who receive a relatively large number of 
messages are more likely to send some messages 
with positive social-emotional content. Domi- 
nance had no effect. 

There were so few instances of messages 
falling in Bales’ Categories 7-12 that the 
data could not be analyzed. The subjects, who 
were strangers interacting for a brief period, 
were very polite to each other. 

The Bales’ categories dealing with positive 
task orientation, Categories 4 and 5 par- 
ticularly, produced effects. The means for 
messages in Category 4, “Gives suggestion, 
direction,” were high dominance-high power 
= 19.2, high dominance-low power = 2.2, 
low dominance-high power = 9.2, and low- 
low = 1.8. All three terms of the analysis of 
variance were significant. Those with high 
power sent many more Category 4 messages 
than the others (F = 91.96, df = 1/50, 
p < .0001), indicating that the subjects’ posi- 
tion in the power structure was very effective 
in determining this behavior. The high- 
dominance subjects sent more Category 4 
messages than did the low-dominance sub- 


? Unfortunately one subject's messages were lost, so 
the » for the low-dominance-low-power cell was reduced 
to 13. My thanks to Barbara Brown and Chantis 
Stinson, who performed the Bales analysis, 
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jects (F = 16.72, df = 1/50, p < .00!), which 
parallels the finding that within the h'»:-power 
condition, the high-dominance subjec'= issued 
more commands. They were both dire tly and 
indirectly attempting to direct the ot! rs. The 
most striking effect was the Domi ance X 
Position in the Power Structure interaction 
(F = 14.34, df = 1/50, p < .001), indicating 
that a person who was high in domin»ace re- 


to be 
ion of 
ıs low 


sponded much more to the opportu: 
directive when he was placed in a | 
high power than did the person wh: 
in dominance. 

The results for the Category 5 v cssages, 
“Gives opinion, evaluation, analys: . . ., 
paralleled those for Category 4. T! 
were high dominance-high power 
high dominance-low power = 1.6; k 
nance-high power — 1.6; low-low - 
effect of power position was signific: 
5.54, df — 1/50, p « .03) as was th 
dominance (F — 5.89, df — 1/50, ; 
The interaction effect, showing ex 
same pattern as above, approa 
nificance (F = 2.79, df = 1/50, p = .10). 

There were no effects on messages falli 
Category 6. This is not surprising, sinc 
structure of the task and experimental 
cedure emphasized the exchange of opini 
directions, hunches, etc., about how 
and not the simple exchange of information. 


4.1; 


Personality and Social Structure 


The interaction pattern in the analysis of 
message content is a very good example of the 
interaction of personality and social system. 
The low-dominance-high-power subjects did 
send more directive, Bales’ Category 4, mes 
sages than did the low-low subjects, which 
may have been a reaction to the social re 
sponsibility of being appointed as the group 
coordinator. It would be embarrassing to be 
the coordinator and do nothing, particularly 
as it had been stressed that the group required 
coordination. Being the coordinator may have 
also been positively reinforcing because the 
coordinators received more messages. But 
the high-dominance-high-power subjects sent 
twice as many directive messages as tht 
low-dominance-high-power group. They vet 
busily engaged in the task. 


he characteristics of the group’s task, and 
ed power structure, created a social 
in which directing others was an 
available reinforcement. It would be 
reinforcing for those who were high in 
minance. Given a person who finds direct- 
thers to be reinforcing, a high-dominance 
, and an opportunity to engage in direc- 
behavior, which was a position of high 
; the person zealously engaged in task- 
riented behavior. 
he interaction pattern in message content 
s à reinforcement theory of the inter- 
of personality and social structure. 
should have been a parallel effect in the 
onnaire data, but there was not. Whether 
due to a fault in the theory, or with the 
ement of the effect, is not clear. Since 
sory did successfully predict one inter- 
pattern, it might be most parsimonious 
earch first for faults in measurement. For 
ple, the questionnaire data were gathered 
‘the experimental task was completed, 
‘the message content analysis was 
ed on data produced while the task 
progress. Second, questionnaire data 
the subject to interpret his reactions 
e situation, while message content may be 
e direct index of his reaction. A rein- 
does not depend upon the subject being 
e to verbalize its effect in order to be 
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Some of the data support the theory. Social 
structure determines the availability of certain 
kinds of reinforcements, and the individual 
group members differ in the degree to which 
they happily seek those rewards. 
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REINFORCING EFFECTS OF SPEAKING IN REPLY ! 
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A definitive difference between the monologue of mass communications and the 
dialogue of personal conversation is the opportunity of each participant to reply 
to the other, Replying plays a fundamental role in determining the participation 
of each party to the conversation, rather than serving only as a mere dependent 
variable of each participant’s behavior or as an influencer of one participant by the 
other. Participation in conversation may be reinforced by the opportunity to speak 
in reply. Latency data from four experiments (N = 418) are isomorphic with an in- 
strumental escape conditioning model in demonstrating analogues of (a) acquisition, 
(6) extinction, (c) partial reinforcement effects, (d) delay of reinforcement effects, 


(e) Delay X Acquisition Trials effects, and (f) drive effects. The research employed 
the general approach which Neal Miller has called “extension of liberalized S-R 
theory,” and draws upon the motivation theories of Dollard and Miller, Byrne, 


and Festinger. 


Fortunately, most human communication 
does not yet take the form of the television 
commercial, the mass lecture, or the laboratory 
persuasive communication, in which messages 
are directed by a communicator to an audience 
that has no direct opportunity to reply. Still 
ubiquitous in human life is conversation be- 
tween two or a few more people, one of the 
distinguishing features of which is the fact 
that each participant has the opportunity to 
reply to the other “communicators.” Con- 
versations conducted under formal rules or 
professional auspices, such as interviews, 
psychotherapy, group discussion, negotiation, 
and bargaining, are each the subject of such 
intense specialized attention from various 
segments of the profession of psychology that 
they are seldom all mentioned in a single 
sentence. Researchers have devoted consider- 
able attention to the reinforcement of con- 
versational behavior (e.g., Aiken, 1965; Cervin, 
1955; Hastorf, 1965; Kanfer, 1964; Krasner, 
1958; Levin & Shapiro, 1962; Matarazzo, 
Wiens, & Saslow, 1965; Oakes, Droge, & 
August, 1960; Salzinger & Pisoni, 1960; Sarbin 
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the College of Arts and Sciences of the University of 
Oklahoma. The authors thank John S. Ezell for acutely 
needed interim support. The authors also thank John 
Lamberth and Philip G. Zimbardo for helpful com- 
ments on the manuscript. 

* Requests for reprints should be sent to Robert 
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Oklahoma 73069. 
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& Allen, 1968; Verplanck, 1955). D 
variables have included measures o 
behavior, or, as in the present study, indexes 
of social participation. The importance of 
studying one person's reply to another is 
generally acknowledged, but its signi/icance 
has not been fully apprehended; wen the 
focus of the research is on the person replying, 


ndent 
erbal 


the reply is seen as a mere dependent viable; 
and when the focus of the research is on the 
effects of the reply, the reply is seen on; as an 
influencer of the other person. It seen: likely 
that if the opportunity to reply is so funda- 


mental an aspect of human communication, 
then it might be very illuminating to focus 
upon the effect of replying upon the person 
replying himself. Common observation in- 
dicates that people engaged in conversation 
are often eager to present their viewpoint, to 
"put their two cents in," particularly in direct 
reply to a participant of differing opinion. It 
is possible that people show this eagerness to 
reply because there is something reinforcing 
about speaking in reply, and that participation 
in conversation depends, in part, upon such 
reinforcement. If this is so, then in an experi- 
mental conversation, a subject should learn an 
instrumental response, the reinforcement for 
which is the opportunity to speak in reply. The 
purpose of this research was to determine 
whether speaking in reply does indeed function 
as a reinforcer. If the proposed similarity be- 
tween speaking in reply and conventional 
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reers is to be morc 
me, it is essential to demon 
reply exhibits a numb 
ies of reinforce 
ht be acquisition e!ic: 
of reinforcement 
cement effects, and ar 
fects among such vari 
p The effects of reinfor 
ype of experimental learn ı 
byed; there are differen 
“tween instrumental c 
‘crimination learning, 
en instrumental res 
trumental escape cor 
conversation experimen! 
designed to be analogon 
trumental conditioning v 
question of whethe: : 
ments are analogou 


in a superficial 
te that speaking 
he fundamental 

among which 
‘tinction effects, 
, partial rein- 
iate interaction 


forc 


depend on the 
paradigm em- 

or example, be- 
‘ing and dis- 
as differences 
onditioning and 
ming. The four 
rted here were 
ne paradigm of 

ı discrete trials. 
onversation ex- 
cape or reward 
one: if speak- 

is the drive on 
sed appetitive or 
ʻe reinforcer like 
is it rather the 
ious drive like 
ho enjoys a good 
ing assumption 
sitive reinforcer, 
on this assump- 


this reinforcer is ! 
e? Is replying a p 


s of terminating 

nor anxiety? To anyo! 
discussion, an intuitively apy 
‘that speaking in reply i 
the research was ini 
. There is considerable theoretical founda- 
however, for the assumption that the 
Situation is aversive. The subjects reply to 
“Statements that disagree with their own 
nions. Dollard and Miller (1950, pp. 


on social punishment of behavior which 
‘insufficiently logical, lacks explanation, or 
fails to make a correct report of the environ- 
ment. Byrne and his associates (e.g, Byrne, 
1969; Byrne & Clore, 1967; Byrne, Young, & 
Griffitt, 1966; Stapert & Clore, 1969) have 
ther developed these concepts under the 
ne of ellectance drive and have brilliantly 
oited their implications for attraction and 
ed social processes. Their work indicates 
the noxious drive of effectance can be 
used by opinions which are dissimilar to 
Subject’s own (e.g., Bryne & Clore, 1967) 
disagreement with the subject by other 
8 (Stapert & Clore, 1969), and that the 
Sentation of such opinion statements fol- 
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lowing an instrumental response is punishing 
(Byrne, Griffitt, & Clore, 1968; Byrne et al., 
1966). Moreover, effectance induced by prior 
disagreement with other persons exhibits the 
characteristic differential energizing effect of 
drive on dominant and competing responses 
in paired-associate learning (Lombardo, Libku- 
man, & Weiss, 1970). Byrne (1969) and 
Dollard and Miller (1950) indicated that such 
drive may be reduced by evidence of one’s 
correctness, logicality, or having an explana- 
tion; it seems clear that speaking in reply 
would provide such evidence. 

Upon the assumption that the experimental 
conversation paradigm reported here is analo- 
gous to the learning paradigm of instrumental 
escape conditioning, the following predictions 
can be made about latency and extinction 
measures of the strength of an instrumental 
response reinforced by speaking in reply. 
Response speed (1/latency) is (a) an increasing 
function of the number of trials reinforced by 
the opportunity to speak in reply; (b) a de- 
creasing function of delay of the reinforcing 
opportunity to speak in reply; (c) a multi- 
plicative combination of the effects of trials 
and delay; (d) faster for continuous reinforce- 
ment (opportunity to reply on all trials) than 
for partial reinforcement (opportunity to 
reply on only some trials) ; (e) a decreasing 
function of the number of extinction trials 
(opportunity to reply withdrawn); (f) a de- 
creasing function of drive when the level of 
drive reduction is held constant. Moreover, 
(g) resistance to extinction is greater when 
acquisition has taken place under partial 
reinforcement than it is under continuous 
reinforcement. These predictions diverge from 
those of a reward conditioning model only in 
the case of partial reinforcement acquisition 
effects (Prediction d) and the dependence of 
drive effects upon the amount of reinforcing 
drive reduction (Prediction /). 


GENERAL METHOD 


The present experiments do not attempt to dupli- 
cate real-life conversations in all their richness and 
complexity. 


A laboratory experiment need not, and should not, 
be an attempt to duplicate a real-life situation. The 
laboratory experiment should be an attempt to 
create a situation in which the operation of variables 
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will be clearly seen under special identified and 
defined conditions [Festinger, 1953, p. 139] 


The present experiments attempted to create a situa- 
tion in which the operation of the replying variable will 
be clearly seen, under special “pure” conditions, 
which deliberately exclude such important but ob- 
scuring complexities as replies to the replies, etc. On 
each trial of the four experiments, the subject listened 
to another person's viewpoint and could then press a 
switch (the instrumental response), the reinforcement 
for which was the opportunity to speak in reply to the 
other person. 


Deception and Masking Task 


The experiment was represented to the subjects as 
a study of opinion change. “We are interested in how 
your opinion may be affected by what someone else 
says; how your opinion may be affected by what you 
yourself say; and how what you say may affect the 
opinions of someone else.” As a masking task, after 
each statement and reply, the subject indicated whether 
he had changed his opinion by moving a dial. Of 30 
subjects given a questionnaire after the experiment, all 
believed this deception; on an open-ended question, 
no one even mentioned the switch, which appeared to 
them as a mere incidental to the serious business of 
opinion change through conversation. 

The other person to whom the subjects spoke was a 
tape recorder, For this reason, the “other person" was 
said to be in another room from the subject, and the 
experimenter was in fact in another room. The subject 
was told that this would ensure confidentiality of his 
candid opinions on controversial issues. The subject 
addressed the other person and the experimenter over 
an intercom system, and could also hear the experi- 
menter give instructions and occasional comments 


(“speak a littler louder, Subject A") to the nonexistent 
other person. 


Apparatus 


The subject's room and the experimenter's control 
room shared a common wall. The subject was seated 
at a table facing this wall, which included four windows 
of transparent mirror glass. The. windows were opaque 
except when illuminated from behind, and instructional 
signals appeared in each window upon illumination. 
The signals were the large printed words (a) “Jisten,” 
(b) “throw switch if you wish to comment," (c) “talk,” 
and (d) “move dial to final opinion." A panel mounted 
on the table top contained the subject's “comment” 
switch (a telephone toggle switch with a spring return), 
a microphone, and a speaker, as well as the masking- 
task opinion-change dial. Also mounted in the wall were 
two Standard Electric timers, by means of which the 
subject could monitor the length of time he and the 
other subject, respectively, had spoken on each trial. 
On the experimenter's side of the wall were the con- 
trols for turning on the various signals, a microphone, 
a cassette tape recorder containing the other person, 
a control timer, and a Lafayette digital stop clock, 
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which automatically measured the subjec 
latency to .01 second. The subjects m 
opinion-change dial was attached to a 
motor which drove a matching dial in the c 
enabling the experimenter to determine w} 
ject was finished with the intertrial maskin 
therefore ready for a new trial. 


response 
ing-task 
hronous 
j| room, 
he sub- 
ask and 


Procedure 

'The subject and the other person (a sed as 
“Subject B" and “Subject A") received th eptive 
rationale and the operating instruction r the 
intercom. It was explained that the othe n had 
been provided with a list of topics. The otter person 


would begin with the first topic on his list, ng the 
topic to the subject and then offering his opi on the 
topic. The subject might the an rity to 
comment on the other person’: vished. 


The subject and the other pei 


a 


Í the 


but they were never interrupt: 
other person were alwa: 
An experimental trial 


"listen " signal 
topic and opinion by the 
message ended, the 


experimenter operated th which h (a) 
presented the CS, the sig nal à Witch if you wish 
to comment,” and (6) started th lacy timer. When 
the subject threw the comment switch, the jatency 
timer stopped. If the trial was a reinforced one, the 
"talk" signal followed the switch-press response, and 
the subject spoke in reply. Subjects were silowed to 
exceed the suggested 20-second speaking time if they 


wished, since interruption would have been punishing 
(Mandler & Watson, 1966). On nonreinforced trials 
(partial, extinction), the talk signal did not go on, and 
the subject waited 20 seconds until the “move dial 
to final opinion” signal went on. 

In later extinction, the subjects sometimes did not 
throw the switch within 20 seconds: latency was then 
considered to be infinite, and speed counted as zero. 
In acquisition, response probability quickly approxi- 
mated 100%, just as in a straight alley, but occasionally 
a subject did not press the switch on a particular trial. 
In this case, speed was also counted as zero, but an 
unscored make-up trial was given to equalize the 
number of reinforcements. 


Relationship to Instrumental Conditioning 


The procedure described was closely modeled on 
traditional discrete-trials instrumental conditioning. 
The reinforcement. (speaking in reply) was contingent 
on the instrumental response (switch pressing). The 
use of inappropriate dependent variables is one of the 
most common failings of social learning research. 
Cottrell (1968) and Weiss (1962, 1968; Weiss & Miller, 
1971) have strongly emphasized the importance of 
choosing dependent variables which are appropriate 
to the theory to be tested. In discrete-trials instru- 
mental escape conditioning research, and in the Hull- 
Miller-Spence learning theory which underlies much of 
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this research, response speed (1/latency) and resistance 
to extinction 3 ;e appropriate dependent variables. 
In the conv ion experiments, the dependent 
variable was (1/latency), measured from the 
time of the tation of the CS (signal “press 
switch if you » comment") to the instrumental 
switch-pressi; se. It is also worth mentioning 
that some rc rs who are not intimately familiar 
with the ty nditioning research on which the 


; modeled sometimes confuse trial- 
ith instrumental conditioning. The 
i above conforms to the paradigm 
onditioning, in which “the aim of the 
arrange for the occurrence of but 


of instrumenta; 
experimenter 


a single resp response chain which is followed 
by reinforce: and that the experimenter there- 
fore attempt 
maximize fr he first the occurrence of this to-be- 
learned response and to minimize the occurrence of 
competing : nses. In the maze type of instru- 
mental site; on, the likelihood of competing re- 
sponses occurring is reduced by having but a single 
straight-abeod alley instead of a number of alterna- 
tive ones [5r once, 1956, pp. 37-38]. 
The instruct to the subject maximized from the 
first the oceurrcace of this to-be-learned response and 
minimized the urrence of competing responses. A 
single switch was employed, rather than a number of 
alternative oncs. 
Materials 
The opinions expressed by the other person were 


selected by s of a 120-item questionnaire ad- 
ministered to 100 subjects from the Psychology I pool. 
Selection criteria were that the opinions should be 
Teasonably consistent with each other and as dissimilar 
às possible from the opinion of the Psychology I 
Students. The original intent of this selection was 
merely to provoke a lively conversation, but it is clear 
that its effect was to provide materials which would 
tend to be effectance arousing. Since the opinions were 
selected with a typical subject in mind, rather than 
1g individually chosen by pretesting of each subject, 
te did not necessarily encounter an opinion 
ferent from their own on each and every trial, and 
i Procedure thus resembled the intermittent shock 
edure in esca; itioni .g, Franchina 
1966, 10694). pe conditioning (e.g. Fi " 
The subject and the other person were always of the 
“ame sex. The other person's statements were taped 
m people, otherwise unconnected with the research, 
Who spoke with a regional accent of about the same 
Sand as the average in-state student. Considerable 
EUM taken to insure that the other person's opinions 
Mine be expressed with about the same degree of 
i ulateness and sophistication as a typical student, 
cept in Experiment IV where these tapes were com- 
Pared with highly articulate and sophisticated ones. 
Order to insure that the effect of trials would not 
co ounded with the effect of particular topics of 
Versation, the taped statements were arranged in 
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sets of three or four statements, with each set on an 
individual tape cassette, and the order of presentation 
of these sets was completely counterbalanced within 
each experimental condition. 


Subjects and Design 


The subjects were 418 students from the Psychology I 
pool. Each experiment was factorialized for sex, and 
there was no main or interactive effect of sex in any 
experiment. 

Typically, the different groups in a conditioning ex- 
periment begin at a similar low level of performance, 
with differences in performance developing over the 
course of trials. As a general rule, therefore, tests for the 
significance of the difference between the various ex- 
perimental groups are made late in learning, over the 


iaf 


J F 


last few trials. Latencies are transformed into speeds "i 


for each trial of each subject, and individual-subject 
mean speeds are calculated for each subject over a 
block of the last few trials in order to reduce variability. 
These individual-subject mean speeds then serve as the 
scores upon which the analyses to test the differences 
between groups are conducted. Analyses across the 
whole course of learning are not used to test for the 
significance of the difference between the experimental 
groups, since these differences are only fully developed 
late in learning, but, if there is a theoretical reason for 
doing so, such an analysis may be used to determine the 
interaction between trials and the experimental groups. 
Equivalent procedures were used throughout the 
present research, 

In conditioning experiments it is usually taken for 
granted, on the basis of previous experience, that 
performance will improve over trials, and statistical 
tests for the significance of this improvement are 
generally unnecessary. In the novelty of the present 
circumstances, however, the significance of improve- 
ment over trials could hardly be taken for granted, 
and has been tested for in each case. 


No-Reinforcement Controls 


Normally, instrumental conditioning studies do not 
include a no-reinforcement control group, and this 
procedure was followed in the present research. The 
logic of this procedure is not merely that previous re- 
search indicates the relative ineffectiveness of the no- 
reinforcement procedure, but, more importantly, that 
within each new experiment the effects of different 
reinforcement conditions test for the effectiveness of 
the reinforcer. That is, delay of reinforcement effects, 
partial reinforcement effects, or extinction effects 
when reinforcement is discontinued indicate that 
performance is being affected by reinforcement. The 
same logic applies to the present research; pilot re- 
search indicated the ineffectiveness of a no-reinforce- 
ment procedure, and within each experiment, delay of 
reinforcement effects, partial reinforcement effects, 
extinction effects, etc., test for the effectiveness of the 
reinforcer. In addition, pilot research indicated that 
practice effects on motor skills could be ruled out by the 
fact that in a simple reaction time test, subjects in- 
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Fic. 1. Acquisition curves of response speed at 
two levels of delay of the reinforcing opportunity to 
speak in reply. 


structed only to press the switch as quickly as they 
could following the CS signal performed faster on their 
first such reaction time trial than the conversation sub- 
jects did at asymptote. 


Experiment I: DELAY OF REINFORCEMENT 


If the opportunity to speak in reply func- 
tions as a reinforcer of the instrumental switch- 
pressing response, then the time interval be- 
tween the instrumental response and the 
reinforcing opportunity to reply should have 
the functional properties of delay of reinforce- 
ment. One property of delay of reinforcement 
is that responses reinforced with a short delay 
of reinforcement are stronger (as measured by 
response speed) than responses reinforced with 
a long delay of reinforcement (Fowler & 
Trapold, 1962; Tarpy & Koster, 1970; Woods 
& Feldman, 1966; see also Karras, 1967; 
Penney, 1967), and the corresponding result 
should hold in our conversation experiment. 
Another property of delay is that it interacts 
multiplicatively with the number of condi- 
tioning trials; delay effects are greater when 
the number of trials is larger (diverging curves 

on a graph), and again the Corresponding result 
should be expected here. Of course, a main 
effect of trials was also expected: speed should 
be an increasing function of the number of 
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trials reinforced by the opportunity 'o speak 
in reply. 
Method 

There were 160 subjects, half of whom ved the 
opportunity to speak in reply (talk-sig n) ime 
mediately after they pressed the switch « h trial” 
and the other half of whom received the rtunity: 
to speak in reply 3 seconds after they had sed 
switch on each trial. The delay of reinfor nt 
regulated automatically by a Lafayette 5001 terval 
timer which was started by the subject’sswi pressing: 


response and turned on the “talk” signal aater 0 or 
3 seconds. Each subject received five acquisition trials. 


Results and Discussion : 


Figure 1 shows learning curves oí nean re- 
sponse speed at two levels of del S 


proved over trials, and the acquis urv 
of the two delay groups dive over trials 
so that the zero-delay group gradually be- 
came superior to the 3-second-delay group. 
Following standard conditioning p 
individual subject means were cal d 
each subject over the last three trials, and & 


t test between the short and long delay groups 
indicated that the delay effect was signifi 
in the predicted direction (¢ = 2.06, df = 158, 
b < 025, one-tailed). In order to determine 
the significance of the trials effect and the ap- 
parent multiplicative interaction between delay 
and trials, a 5 X 2 analysis of variance was 
performed over all five acquisition trials. The 
trials effect was highly significant when tested 
against the within-subject error term (F = 
7.12, df = 4/632, p < .001) as was the Delay 
X Trials interaction (F = 8.62, df = 4/632, 
p < 001). a 

The results thus indicate that speed „isi 
faster for immediate than for delayed “reing 
forcement,” an increasing function of trials, 
and a multiplicative combination of the effects 
of delay and trials, as would be predicted from 
an instrumental conditioning model. Such re- 
sults are compatible with either an instrumen- 
tal escape conditioning or an instrumental 
reward conditioning model. 


EXPERIMENT II: PARTIAL REINFORCEMENT | 


If the opportunity to speak in reply func- - 
tions as a reinforcer, then giving that oppor- 
tunity on only half of the trials should have the 


T In escape c 
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functional properties of partial reinforcement. 
ditioning, partially reinforced 
lower than continuously rein- 
(Bower, 1960). This pattern 
from the characteristic effects 
ment in instrumental reward 

re it is only early in learning 
are slower than the con- 
tinuous, and ihe partials gradually overtake 
the continuous so that later in learning the 
pattern is the opposite of escape conditioning 
(e.g., Spence, 1960; Weinstock, 1958). In both 
reward and cscape conditioning, resistance to 
extinction is ¿reater for partial than for con- 
tinuous reinforcement, with escape-conditioned 


responses ar 
forced response: 
is quite diffe 

of partial rei 
conditioning 
that the p 


responses learned under partial reinforcement 
showing a particularly sustained resistance to 
extinction. 
Method 

There were 100 subjects, half of whom received the 
opportunity to speak in reply on every trial that they 


pressed the switch and half of whom received the op- 
portunity to speak in reply on only 50% of the trials on 
which they pressed the switch. Five different sequences 
of reinforced and nonreinforced trials were employed 
cinforcement group. Following eight 
, extinction. was begun: for six con- 


secutive trial 


t »ject in either group received the 
opportunity to when he pressed the bar. 
It was conside: irable to determine whether the 
opportunity to reply was equally reinforcing for rela- 
lively verbal and nonverbal subjects, and the experi- 


ment was therefore factorialized into five levels of 
“phe ability as measured by the American Council on 
ducation Psychological Exam. 


Results and Discussion 


Figure 2 shows acquisition effects in conver- 
sation which are analogous to those in instru- 
Mental escape conditioning: the continuous 
reinforcement group is faster than the partial 
reinforcement group. A £ test between the par- 
tial and continuous reinforcement groups, on a 
as of the last three acquisition trials, in- 
E that the partial reinforcement acquisi- 
10n effect was significant (t = 2.29, df = 98, 
d zi one-tailed) in the direction pre- 
i by the escape conditioning model. 
id yses of variance over the first nine trials 
cated that the continuous reinforcement 
"D improved significantly (F = 5.52, 
Hh 8/392, p — .001), as did the partial group 

7 2.39, df = 8/392, p < 05). 
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Fic. 2. Acquisition curves of response speed under 
partial and continuous opportunity to speak in reply. 


The continuous reinforcement group shows 
a clear gradual extinction effect which is 
significant by a trend analysis (linear F = 
13.03, df = 1/125, p < .001). The partials show 
no evidence of extinction, and the interaction 
of extinction trials with reinforcement schedule, 
analyzed over all six extinction trials, in- 
dicated that the partials are indeed more re- 
sistant to extinction than the continuous 
(F = 3.36, df = 5/490, p < .005). Both the 
acquisition and the extinction curves show a 
striking similarity to the escape conditioning 
curves, which extend even to the persistence 
of the partial reinforcement group in extinction 
(Bower, 1960). 

Analyses of variance revealed no main or 
interactive effect of verbal ability with rein- 
forcement schedule or trials in either acquisi- 
tion or extinction (all Fs < 1). 


EXPERIMENT IIT: PARTIAL REINFORCEMENT 
WITH EXTENDED TRIALS 


Since the partial reinforcement effect pro- 
vides a particularly clear basis for discriminat- 
ing between an escape conditioning and a 
reward conditioning model, it was important 
to ascertain whether the phenomenon was 
replicable and whether there was any pos- 
sibility that the partials would overtake the 
continuous, in the manner of reward condition- 
ing, if more acquisition trials were given. Ac- 
cordingly, the acquisition phase of Experiment 
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Fic. 3. Acquisition curves of response spced at two 
levels of formidability of the other person. 


II was replicated, with 16 acquisition trials 
rather than 8 (W = 98). A replication of the 
results of Experiment II was predicted. 


Results and Discussion 


The results replicated Experiment II: the 
curves resemble Figure 2 simply extended for 
another eight trials. Individual-subject mean 
asymptotic speeds were calculated for the last 
three trials of acquisition, and the continuous 
group was faster than the partial group (Mann- 
Whitney z = 1.60, p = .055, one-tailed), Evi- 
dence of acquisition was obtained once more. 
Analyses of variance over all 16 acquisition 
trials indicated that the continuous reinforce- 
ment group improved Significantly (F = 2.54, 
df = 15/720, p < .005), as did the partial group 
(F = 181, df = 15/720, p < 05). 

The curves asymptoted by about the eighth 
trial and were unmistakably asymptotic well 
before the final block of three trials (Trials 
14-16) compared above. It is therefore un- 
ambiguously clear that the pattern of results 
resembles instrumental escape conditioning, 
rather than instrumental reward conditioning. 
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EXPERIMENT IV: FORMIDARI OF 
THE OrHER PERSON 


This experiment was originally i led asa 
methodological study to determ ptimum 
levels of strength and articulat: of the 
other person's viewpoint in rela to the 
verbal ability of the subjects. The experi 
menters wished to determine whether subjects 


of high verbal ability might perform better if 
the other person were a more formidable con- 
versationalist than the average subject" tapes 


normally employed, whether subjcc'* of lesser 
verbal ability might be overwhel: y such 
a formidable conversationalist, similar 
questions, 
Method 

The design was a 2 X 2 factorial wi evels of 
formidability of the other person (standar d sharp) 
and 2 levels of verbal ability (V = 60). standard 
tapes were the tapes used in Experiments I, |T, and Ill, 
which had been very carefully designed to nble the 
limited articulateness and sophistication ı typical 
subject performing ad lib. The sharp tape re made 
by the same persons who spoke on the standard tapes, 
but now the logic, phrasing, and factual ::is of the 
opinions were strengthened. Of course, care was taken 
to insure that neither the standard nor the sh rp tapes 
were hostile, insulting, arrogant, etc.; and in oth cases, 
the other person spoke in his usual moderate regional 
accent. Verbal ability was manipulated by selecting the 


highest and lowest 20% of the subjects on the verbal 
scales of the American Council on Education Psy- 
chological Exam. 


Results and Discussion 


Figure 3 shows the acquisition curves for 
the sharp and standard tape groups. Perform- 
ance was superior in the standard tape condi- 
tion. Individual-subject means were calculated 
over the last 5 trials, and the analysis of 
variance indicated that the difference between 
the sharp and standard tape conditions was 
significant (F = 8.93, df = 1/56, p < .005) 
and that there were no significant main of 
interactive effects of verbal ability. The m- 
formal observations of the experimenters Te- 
vealed no tendency on the part of the subjects 
rise to the occasion provided by the sharp tapes 
to improve the quality or quantity of their 
own answers. Analyses of variance over all 
12 trials indicated that the standard tape 
group improved significantly (F = 4.02, df = 


, as did the sharp tape group 
1/319, p < .05). 
riment was conducted for 
rposes and yielded informa- 
f intrinsic value, its greatest 
! only when it is interpreted 
scape conditioning model. 
ı rests on two assumptions, 
tance theory and one on ob- 
e subjects. The first assump- 
| effectance theory: if the 
s arouse effectance by ques- 
Us beliefs, then the sharp 
ore effectance arousing than 
pes. “Disagreement from a 
er would be more punishing 
it from an incompetent stran- 
ess, Ch. 13].” The second 
sed on the observation that 
vive better or more extensive 
harp other person than to the 
on: the reinforcing drive 
‘ded by speaking in reply was 
! the amount of effectance drive 
hese assumptions are correct, 
n analogue of an escape condi- 
‘cedure in which the amount of drive 
> alley) is increased, but the 
inforcing drive reduction (shock 
tion) is kept constant. Such an experi- 
has been conducted by Campbell and 
ing (1953), who found that a constant 
ive reduction had progressively less effect 
higher the level of drive: that is, when all 
received 100-volts drive reduction, the 
olt high-drive group ran slowest, the 300- 
medium-drive group ran faster, and the 
volt low-drive group ran fastest. It there- 
follows that if the sharp tape induces a 
level of drive than the standard tape 
l the reply produces a constant amount of 
€ reduction, then the standard tape group 
d perform better than the sharp. Of 
this interpretation might seem too 
T by half, were it not for the evidence of 
other three experiments. 


GENERAL DISCUSSION 
this paper the general approach which 
Miller (1959) has called “extension of 
alized S-R theory" was employed to 
P and test a theory of certain aspects 
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of conversational behavior. Hull-Miller-Spence 
learning theory and its related research in 
escape conditioning (collectively referred to 
hereafter simply as “learning theory"; eg., 
Brown & Farber, 1968; D’Amato, 1969; 
Dollard & Miller, 1950; Hull, 1943; Logan, 
1959; Spence, 1956, 1960; Spence & Spence, 
1966) is used as a model for the effects of 
replying in conversation. Hullian learning 
theory has been developed primarily to predict 
individual behavior, but it has nevertheless 
been extended to numerous social processes 
with a considerable measure of success (e.g., 
Cottrell, 1968; Cottrell & Wack, 1967; 
Dollard & Miller, 1950; Lott & Lott, 1968; 
Miller & Dollard, 1941; Weiss, 1963, 1968; 
Weiss & Miller, 1971; Wheeler, 1966; Whiting 
& Child, 1953; Wolpe, 1958; Zajonc, 1965). 
Technique of theory construction, The use of 
a model in theory construction typically in- 
volves the specification of a dictionary of 
analogies (rules of correspondence) which 
relate the variables of the model to the vari- 
ables of the research area to be predicted and 
explained. Once this is done, the relations hold- 
ing among the variables of the model must, 
theoretically, also hold between the correspond- 
ing (analogous) variables in the data area to be 
explained (e.g., Brodbeck, 1959; Campbell, 
1920; Lachman, 1960; Oppenheimer, 1956). 
The systematic use of learning theory as a 
model for conversation makes it possible to 
use known principles of conditioning to deter- 
mine whether any analogous principles operate 
in conversation. For example, there are a 
number of principles regarding the effects of 
delay of reward on measures of instrumental 
escape conditioning, which may be stated 
informally as (a) the delay of reinforcement 
gradient is decreasing and negatively ac- 
celerated in shape; (b) delay of reinforcement 
and number of trials combine multiplicatively ; 
and (c) shorter delays of reinforcement maxi- 
mize the effects of differences in magnitude 
of reinforcement. If a social variable is to be 
analogous to delay of reinforcement in escape 
conditioning, its effects on appropriate de- 
pendent measures must show that (a) this 
social variable gradient is decreasing and 
negatively accelerated in shape; (b) this social 
variable and number of trials combine mul- 
tiplicatively; and (c) smaller values of this 
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social variable maximize the effects of dif- 
ferences in magnitude of reinforcement. 

The above points illustrate the development 
of an analogy between learning and social 
independent variables. The analogy does not 
have testable implications until analogies be- 
tween learning and social dependent variables 
are also developed. Moreover, it is necessary 
to clearly specify which learning paradigm the 
social conditions are analogous to; instrumental 
reward conditioning, instrumental escape con- 
ditioning, selective learning, etc., differ sharply 
in certain regards. 

Models and translations, 'The theory outlined 
here is not a “mere translation of one be- 
havioral science language into another," since 
the theory generates hypotheses that are not 
in the existing literature. A “mere translation” 
would require, for example, that the hypotheses 
concerning the previous paragraph's social 
variable already exist in the literature of con- 
versation research before they could be merely 
translated into statements about delay of 
reinforcement and number of trials. 

Analogy and reduction. Nowhere in this paper 
is it proposed that social behavior represents a 
“distinct higher level of phenomena which can 
only be fully explained by reducing it to a 
lower level of phenomena such as condi- 
tioning.” Nor is this dubious philosophical 
program implicit in our intentions. Rather, 
the intention has been to use the relatively 
well understood as a model for that which is 
presently less well understood, The program- 
matic assumptions involved are commonplaces 
of scientific method: well-constructed models 
and analogies help to stimulate and guide 
research and to integrate broad ranges of 
knowledge through an underlying set of 
common principles. Indeed, if the progress of 
this research on conversation were to outstrip 
escape conditioning research, it would be 
perfectly permissible to use the principles of 
conversation as a model for escape condi tioning. 

Rules of correspondence. The rules of cor- 
respondence relating the variables of escape 
conditioning to the variables of conversation 
are given below and numbered for later refer- 
ence. The concepts underlying the formal rules 
of correspondence have been developed earlier 
in the study: the initial disagreement of the 

other person with the subject functions as an 


"1 
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arouser of noxious drive (fear, eic nce, dis 
sonance), and speaking in reply í ions as a 
reinforcer. Corresponding to a re ed trial 
in escape conditioning is a conve- tion trial 
in which the other person initi lisagrees 
with the subject and in which the 1mental 
response affords the subject an opportunity te 
speak in reply [1]. Correspondin: to a non- 
reinforced trial in escape conditioning is ay 
conversation trial in which the person 
initially disagrees with the subj jut the 
instrumental response does not rd the 
subject an opportunity to speak i ly [2]. 
A series of such *nonreinforced" t follow- 
ing acquisition is analogous to extinction [3], | 
while a mixture of trials on which the instru- | 
mental response sometimes affords |). subject | 


the opportunity to reply and sometimes does 
not is analogous to partial reinforcement [4]. - 
It is also possible to have a trial on which the 


other person’s initial statement docs not dis- 
agree with the subject, correspondiag to a 
procedure sometimes used in escape condition- 
ing, in which the shock is omitted «.!‘ogether 


from some trials [5]. A mixture of such trials 
with the normal initial-disagreemen: trials is 
analogous to intermittent shock [6]. 

The time interval between the instrumental 
response and the opportunity to speak in reply _ 
corresponds to delay of reinforcement [7]. 
Correlated reinforcement, in which the presen- 
tation of the reinforcer is contingent upon the 
speed of the response, corresponds to the situa- 
tion in which the opportunity to reply 1s 
contingent upon the speed of the instrumental 
response [8 ]. If speaking in reply is a reinforcer, 
then the quantity [9] or quality [10] of the 
reply should correspond to magnitude of 
reinforcement, but controlled manipulation of 
these variables poses a considerable problem. 
Thus, for example, the duration of a subject’s 
reply to an interviewer has been found to be 
an increasing function of the duration of the 
interviewer’s statements (Matarazzo et al. 
1965), but a lengthy statement disagreelng 
with the subject could easily prove more drive 
inducing than a short one. In actual conversa- 
tions, one person’s reply sometimes change 
the other person’s mind. It seems likely that 
if the other person yields to the subject’s reply, 
this would be more reinforcing than replying 
alone; and this operation should therefore als 
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magnitude of reinforcement 
assumption is consistent with 
conception of the role of con- 

on in reinforcement and effect- 
. It is also in this instance of the 
ly that the relationship of the 
to dissonance theory becomes 
[he existence of disagreement 
s of a group on some issue or 
if perceived by the members, cer- 
s cognitive dissonance [Fest- 
178]," and one “way of reduc- 
nce would be to influence those 
disagree to change their opinion 
re closely corresponds to one's 
, 1957, p. 182] Even when 
pportunity for the reply to elicit 
upport, as in the present experi- 
nance interpretation still appears 
s, if replying has often elicited 
ort outside the laboratory, the sub- 
vve to come to expect replying to 
ial support. Rendered in learning 
s, replying would serve as a secondary 
cer (although it should be borne in mind 
t the evidence for secondary reinforcers 


i is no 
socia: 


d on ths reduction of aversive drives is 
biguous; £.g., Dinsmoor & Clayton, 1966; 
n, 1966). Alternatively, Festinger (1957) 


ated that such dissonance could be 
reduced by intensification of the original belief, 
‘speaking in reply might provide a mech- 
for intensification of belief. Festinger 
s dissonance as comparable to other 
, and it has been shown that dissonance 
exhibit the energizing property of drive 
conceived in learning theory (Cottrell, 
ki, & Smith, 1970; Cottrell & Wack, 
Waterman, 1969; Waterman & Katkin, 
Analogues of drive intensity are directly sug- 
d by the drive theories of Dollard and 
et, Byrne, and Festinger. Thus, for ex- 
ple, effectance theory suggests that drive 
d be an increasing function of the in- 
‘terest of the topic of conversation [12] (Byrne, 
London, & Griffitt, 1968; Clore & Baldridge, 
3968) and the competence or formidability of 
the other person [13] (Bryne, in press, Ch. 13; 
*almer, 1969), a decreasing function of the 
Yerifiability of the opinions expressed [14] 


or SPEAKING IN REPLY 195 
(Byrne, Nelson, & Reeves, 1966), and that less 
drive should be aroused by the other person's 
expression of opinions on impersonal topics 
than by his expression of negative personal 
evaluation of the subject [15] (Byrne & 
Lamberth, 1970; Byrne & Rhamey, 1965; 
Clore & Baldridge, 1970), Dissonance theory 
also clearly suggests the verifiability variable, 
as well as the attractiveness [16] and ex- 
pertness [17] of the other person, the extent 
of the disagreement [18], and the number of 
persons in the conversation who disagree with 
the subject [19] (Festinger, 1957, pp. 179-180). 
It is not, of course, suggested that these drive 
theories are necessarily compatible (e.g., on 
the attractiveness of the other person, cf. 
Festinger, 1957, with Byrne, Ervin, & Lam- 
berth, 1970; Byrne, London, & Reeves, 1968), 
but rather that it would be unfruitful to 
commit the present theory to one or another 
drive mechanism at the present time. 

Corresponding to the dependent variable of 
response speed is the speed of the instrumental 
response (e.g., switch pressing) [20]. 

Boundary conditions. Any theory is in- 
tended to apply only within a limited range 
of conditions. “Ideally all such limitations 
should be stated explicitly by the theorist, 
although, in practice, he may not have recog- 
nized some of these and may not yet be ready 
to specify others [Logan, 1959, p. 300].” Since 
the theory of conversation developed here is 
based on analogies with conditioning research 
and theory, some of the most essential bound- 
ary conditions are also based on analogies 
with conditioning. The Method section en- 
titled “Relationship to Instrumental Con- 
ditioning” may be read as an explicit state- 
ment of some of these boundary conditions, 
particularly in regard to the choice of depend- 
ent variables, the discrete-trials procedure, 
the study of a single response, the maximizing 
from the first of the occurrence of this single 
response, and the minimization of competing 
responses. The study of conversation requires, 
by definition, an articulate organism with 
cognitive capacities not granted to white rats, 
and since deception and masking procedures 
are necessary for the study of conditioning of 
voluntary responses in articulate organisms 
(e.g., Spence, 1966), such deception and mask- 
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Ta addition to drive reduction, there is also 
the question of drive arouml The present 
theory and are carefully specified 
for situations in which subjects reply to state- 
ments that are drive arousing because the 
statements disagree with the subject's own 
opinions. Nothing has been said about con 
Versations in which the subject replies to 
statements that agree with his own opinions, 
nor is it clear that there is a single answer for 
such a situation. Statements that agree with 
the subject's own opinions are not generally 
believed to be drive arousing, rather they are 
believed to be drive reducing and reinforcing 
through the social support (e.g, Festinger, 
1957) or consensual validation (eg., Byrne, 
1969) which they provide. In such a situation, 
listening might be far more reinforcing than 
replying (Kanfer, 1964), and the subject might 
then reply only sufficiently to elicit further 
reinforcing statements of agreement. The 
reinforcing nature of does not, 
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functional relationship: 
among the variables of the condition: 
should also hold among the correspon: 
versation variables. The following dc: 
are illustrative, not exhaustive. (a) cj 
in escape conditioning, response spec 


Rules of Correspondence 7 and 20, the speed of 
the instrumental response in the conversation 
experiments should be a decreasing function 

the delay in the opportunity to speak in replys 
as found in Experiment I. (c) Delay X Trials 
interaction: in escape conditioning, delay of 
reinforcement and number of reinforced trials 


s\ovely to determine 
Trepedd, 1962). Given eke 


4 the opportunity to speak 
mber of trials on which that 
zn should combine multi 
ermine the speed of the 
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enforcement. Given Rules 
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escape condit response 
sing function of the number 
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function of the number of trials 
ostrumental response no longer 
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ment II, (f) Drive: in escape 
peed is an increasing function 
he drive is reduced to zero after 

-., Franchina, 1969b, Trapold & 
HLowever, when the reinforcing 

; is held constant, speed is a 

nction of the level of drive (Camp- 
Campbell & Kraeling, 1953; 
Myers, 1969: sce also McAllister & McAllister, 
1967; Woods, Davidson, & Peters, 1964; 
Woods & Holland, 1966). Given Rules of 
Correspondenc € 13 and 20, and the assumption 
that subjects do not give a better answer to à 
Sharp other person than to a standard other 
person, it follows that the standard group of 
iment IV should be faster than the sharp 
Soup. (e) Partial reinforcement extinction 
cts: in escape conditioning, resistance to 
“itinction, as measured by response speed in 
ion, is greater when acquisition has 
taken place under partial reinforcement than 
pea continuous reinforcement vac 
- Given Rules of Correspondence 1, 3, 4, 

and 20, it therefore follows that resistance to 
“tinction in the conversation experiments 
be greater when subjects have received 

the Opportunity to reply on only some trials in 
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GENERALITY OF IMPRESSION-FORMATION 


PROCESSES FOR 


EVALUATIVE AND NONEVALUATIVE JUDGMENT: 


DAVID L. HAMILTON ! ano LEROY J: HUFFMAN * 
Yale University 


Most studies of impression-formation models and processes have used some 
form of evaluative judgment as a dependent variable. The present stud; 
examined the generality of the findings for evaluative judgments to two other 
response dimensions, potency and activity. The data for evaluative and 
potency judgments supported a weighted averaging, rather than a summation, 
model, while neither model adequately accounted for the activity judgments. 
It was further hypothesized that trait words reflecting unconventional, non 
normative behavior would receive greater weight in forming first impressions 
than would attributes descriptive of most people. The data were generally 
consistent with this proposition, Implications of the findings for the generality 
of impression-formation models are discussed. 


One issue in the continuing line of research 
initiated by Asch’s (1946) study of impres- 
sion formation has been concerned with de- 
scribing the model that represents the process 
of information integration in forming initial 
impressions. Almost all of the studies con- 
cerned with this issue have used some kind 
of evaluative response (e.g., the “likableness” 
of a stimulus person) as the dimension of 
judgment. First impressions, however, involve 
more than simply a judgment of whether or 
not the person is likable. In forming an im- 
pression of another, we may also want to 
know whether he is intelligent or stupid, 
liberal or conservative, energetic or lazy, etc. 
A first impression, then, is a conglomerate of 
tentative judgments about another person, 
our attraction to him being only one (al- 
though perhaps a major one) such compo- 
nent. Hence, in discussing models of impres- 
sion formation, two questions need further 
investigation: What are the dimensions of 
judgment that characterize the impression we 
form of others? and Do we arrive at these 
different judgments by the same process; 
that is, Will the same model of information 
integration provide an adequate description 
of the judgment process for all judgment 
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dimensions? The present research was con- 
cerned with the latter question. 


With respect to the first question, previous 


research has demonstrated that persons do 
make nonevaluative judgments as art of 
their impression of strangers (Levy & Dugan, 
1960; Osgood, Suci, & Tannenbaum, 1957; 
Passini & Norman, 1966; Rosenberg, Nelson, 
& Vivekananthan, 1968). Severai studies 
(e.g., Osgood et al., 1957; Rosenberg et al., 
1968) indicate that potency and ac: are 


important dimensions of judgment, i» addi- 
tion to evaluation. However, the generality 
of combinatory models to these dimensions 
has been largely assumed. As Rosenberg et al. 
have pointed out, the evidence favoring an 
impression-formation model would be sub- 
stantially increased if parallel findings could 
be demonstrated for both evaluative and non- 
evaluative dimensions of judgment. The pur- 
pose of the present study was to examine 
the generality of information integration 
processes across three judgmental dimensions. 

A substantial amount of research evidence 
has shown that rather simple models can 
yield accurate predictions of evaluative judg- 
ments of complex stimuli from knowledge of 
the evaluative properties of the individual 
components, The two models most promi- 
nently proposed argue that the value of the 
overall judgment is either a sum (e.g., Fish- 
bein & Hunter, 1964) or an average (eg 
Anderson, 1968a) of the values of the com- 
ponent elements. Although evidence has been 
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reported in support of both positions, results 
of comparative tests in which the two models 
made differing predictions have generally 
supported iveraging interpretation (e.g., 
Anderson, 1905; Rosnow & Arms, 1968). In 
the prese: udy, the generality of these 
findings wss determined by conducting such 
comparative {ests for judgments on the evalu- 
ative, potency, and activity dimensions. 

A simple overaging model would consider 
each elemet:t in a stimulus combination to 
be of equa! importance in forming an overall 
impression. That is, all trait words in a set 
would be given equal weight in the formula- 
tion, Suci simple model appears to be 
untenable. Several lines of evidence indicate 
that the verious elements of the stimulus set 
are differentially weighted in the integration 
process. An important research problem con- 
cerns discovering the properties of stimulus 
characteristics that result in their receiving 
disproportionately large or small weights in 
the judgment process. 

Briscoe, Woodyard, and Shaw (1967) 
found that an unfavorable first impression is 
More resistant to change than a favorable 
one. Their explanation was based on the no- 
tion that favorable information, being consist- 
ent with social norms, conveys less information 
about the individual’s personality than does 


unfavorable information. When a person is 
described as behaving in a favorable manner, 
it is not clear whether this is a revealing 
Manifestation of his personality, or simply an 
Instance of consistency with social expecta- 
tions. Thus, favorable attributes convey 
Somewhat ambiguous information about the 
stimulus person, making one’s impression of 
him relatively susceptible to change. Unfavor- 
able information, however, indicates a manner 
in which the person deviates from social 
norms and indicates something unique about 
him as an individual. Consequently, one is 
more certain of a first impression of him, 
tendering it more resistant to change. This 
line of reasoning is quite similar to recent 
Statements of attribution theorists (e.g., 
Jones & Davis, 1965). The finding that 


. evaluatively positive descriptive adjectives 


are used more frequently than evaluatively 
Negative terms (Boucher & Osgood, 1969) is 
also consistent with this interpretation. 
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If information about negative character- 
istics is more useful to the observer, then 
it should receive more weight in the impres- 
sion-formation process. It then follows that 
a stimulus condition whose average value is 
a certain number of units below the neutral 
point should be judged to be more polarized 
than a stimulus set the same number of units 
above the neutral point. In the present study, 
two tests of this hypothesis were made for 
each of the three dimensions of judgment. 


METHOD 


Preparation of Stimulus Materials 


The trait words used in the development of 
stimulus materials were taken from a study by 
Rosenberg and Sedlak (1969).8 These authors had 
100 undergraduate students describe five persons 
they knew well and five persons they knew mainly 
by reputation by listing at least 5 trait adjectives 
for each person. The initial sample of trait words 
used in the present investigation consisted of the 
110 adjectives that were used 10 or more times 
by Rosenberg and Sedlak’s subjects. This provided 
some assurance that the stimulus persons would be 
characterized by trait words used with at least 
moderate frequency in the college population. 
Twenty-eight of the words were eliminated on one 
of the following grounds: (a) words that were to 
be used as polar adjectives for the evaluative, 
potency, and activity dimensions, (b) words that 
might cause ambiguity and slang words, (c) in- 
stances of overspecificity, for example, very intel- 
ligent was eliminated, since intelligent was in the 
sample, and (d) where near-synonym pairs were 
present (e.g, outgoing and extraverted) only one 
of the words was retained. Since it appeared, based 
on data provided by Rosenberg, that the negative 
poles of the potency and activity dimensions might 
be underrepresented in the list as a result of these 
eliminations, 6 new words considered low on one 
or the other of these dimensions were added. This 
resulted in a final set of 88 adjectives which were 
arranged in three random orders for presentation 
to subjects to judge on the evaluation, potency, and 
activity dimensions. Since there were three word lists 
and six orders of the three dimensions, 18 unique 
questionnaire booklets were constructed. 

The judges were 55 undergraduate students in a 
residential college at Yale University. Questionnaires 
were administered individually or to small groups 
in the students’ rooms during the evening. All sub- 
jects rated all 88 trait words on each dimension, 
rating all words on one dimension before making 
ratings on another dimension. Ratings were made 


3The authors express their appreciation to Sey- 
mour Rosenberg for making these materials and 
accompanying data available to them. 
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TABLE 1 
Kanon of Scars Vauews or Taar Wows ex me H, 
M+, M, xo L Carsooxns ror Duazxmon 


ime ecm 
Word category €—————————— 
| Kwebestion |. Fotency i Activity 
M aas | se | vu fies 
Qmm | an | ss | &e 
+ 
Highest | 19 | m5 | 6 
Lowest | «n 6.76 i 693 
M- 
Highest | ag | Su | Sze 
Loves ! 4 | 4e | 449 
49 


on li-point scales defined by the polar adjectives 
"good" or "bad" for evaluation; “ domi- 
neering” and 
“active” and 
asked to rate 
make an effort 
the entire range 


n — subgroups on the potency and 
activity dimensions, where the differences were .56 
and .98, respectively.) Finally, in an attempt to 
reduce the influence of one dimension on the others, 
words were not used for a dimension if they had 
very high (900 or above) or very low (300 or 
below) mean values on either of the other two 


values for each pair of dimensions. These correla- 


* These data are available upon request from the 
first author, 
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different judgment dimensions | 
the present experiment are high!) 
two dimensions, it would be unck 
should be interpreted as evidence ! 
ty of the processes involved or sim 
potency and activity judgments bot! 
a "dynamism" factor. As will be seen 
results were such that this interpretive pr 
not arise. 

The trait adjectives in these subur 
combined to form the 21 stimulus co 


HE 
sE 


Ẹ 


s can be thought of 
imulus combinations were 
reported in this st: 


(eg, HM+HM+ M+HM+H). Three judges) 
were consulted to assure that all stimulus combini 

tions described “possible people.” Within each of the 
two forms for each dimension, four randor 
of the stimulus sets were used in cons ru 
the questionnaire, making eight different questio 
naires for each of the three dimensions. Thus, 
different questionnaire booklets were constru 
Order of judgment dimensions within booklets was 
counterbalanced. In the experiment, each of the 
eight questionnaires for each dimension was admin= 
istered to 10 subjects. 3 


Subjects 


Subjects in the experiment were 80 Yale under 
graduates, 50 of whom received course credit andi 
30 of whom were paid for their participation. 


Procedure 


Subjects were presented with a questionnaire book: 
let consisting of an instruction page and three p 
listing the 21 stimulus sets. The instructi 
described the task and indicated that each grou 
of adjectives was descriptive of a “young man: 
Each stimulus person was to be rated on a 21-p 
scale, the polar adjectives being the same ones 
in the preliminary study in which the indivi 
trait words were scaled. It was emphasized thi 
each combination was to be considered indepel 
dently, and subjects were asked to distribute their 
ratings across the entire 21-point scale. 
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Ri AND DISCUSSION 

Averaging Summation 

The pri urpose of this experiment 
was to p comparative tests of the 
averaging nmation models for three 
different d ins of judgment. Four such 
lests were the results of which are 
shown in 2. If we arbitrarily assign 
the values ind 3 to H and M+ words, 
respective! it is clear that HHM+M-+, 
as compar HH, has a larger sum (14 
versus 8) ! naller mean (3.5 versus 4.0). 
Similarly, ng 2 and 1 to the M— and 
L stimulus is, respectively, a comparable 
contradict etween the two models occurs 
with the | id M—M-LL sets. Table 2 
shows that data from these comparisons 
for the e ive judgments replicated the 
findings of erson (1965),* supporting the 
averaging el. Thus, the addition of 
moderate! able or unfavorable charac- 
teristics to ^» corresponding more polarized 
sets (HH an! LL, respectively) had a signifi- 
cant moder ing effect, rather than eliciting 
more extr judgments as the summation 
model woul: predict. Significant differences in 
the same direction, again supporting the 
averaging n:odel, were obtained for the po- 
tency (strong, domineering versus weak, 
submissive) dimension. Results for the activ- 


ity judgments (active versus passive) were 
less consistent. While a similar averaging 
effect was obtained in the comparison of LL 
and M—M LL, a strong summation effect 
Was found in the HH versus HHM--M-- 
Comparison. That is, HHM+M+ stimulus 
Sets were judged to be significantly more 
active than the HH combinations. 

Two additional comparisons were made in 
Which the two models make different predic- 
lions. Again, using the HH and LL combina- 
tions as a base line, two stimulus sets were 


E * Seventy-five of the 88 words scaled in the pre- 
aminary study have previously been scaled for 
alkableness* by Anderson (1968b). The correlations 
tween the likableness and evaluation (good-bad) 
cmd Values for these words was .97. Thus, the 
s ta of the present study for the evaluative dimen- 
on can be considered most comparable to the vast 
Majority of impression-formation experiments which 
BL a “likableness” judgment as the dependent 
le. 


TABLE 2 


Mean Ramos or Stimetos Sers rox COMFARATIEE 
Tests oF AVERAGING AND SUMMATION 


— — ——M— 15 


|  Simulas combination 


Judgment 
dimension | z EN -— , 

| HH HUM +M + 
Evalustion | 17.20 16.29 «0 
Potency 17.32 1647 « 05 
Activity 17.15 18.20 «0 

Li M -M -1L4. 

Evaluation 3.09 337 « 05 
Potency 3.99 8134 « ot 
Activity 407 $91 «0 

|. MH | HIM M —- | 
Evaluation 17.20 1$00 | < 
Potency 17.32 1542 < 0 
Activity 17.18 ! 16.76 m 

| LL |[M&M-LL 
Evaluation 3.09 5.20 «0 
Potency 3.99 | 5.75 « üt 
Activity | 4.07 645 « 0 


Note.—Simulus sets were rated on a 21«point scale. 


included which added M+M-— to these com- 
binations. Again, while the summation model 
predicts that ratings of the four-element sets 
will be more polarized than those for the 
two-element combinations, the averaging 
formulation predicts just the reverse relation- 
ship. In the bottom half of Table 2, it is 
seen that the results are parallel to the pre- 
vious findings. Both comparisons for both the 
evaluation and potency judgments yielded 
significant differences supporting the aver- 
aging model. A similar effect was obtained 
in the LL versus M--M-LL comparison for 
the activity dimension. However, there was 
no significant difference in the mean active- 
passive ratings of the HH and HHM--M— 
sets, a finding inconsistent with both the 
summation and averaging model predictions. 
These results form a consistent pattern of 
findings. For both the evaluation and potency 
dimensions, all four comparisons yielded sig- 
nificant differences in the direction consistent 
with the averaging formulation, For the 
active-passive judgments, the results were 
different for the positive and negative poles 
of this dimension. Both comparisons using 
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stimulus combinations representing the nega- 
tive (passive) pole yielded significant effects 
supporting an averaging interpretation, On 
the other hand, the results of both compari- 
sons of stimulus sets from the positive pole 
were inconsistent with the averaging model, 
and one of these represented a strong sum- 
mation effect. 

Just how much one should make of these 
latter findings is not clear. There is the 
temptation to consider them as due to either 
error effects or methodological artifacts; after 
all, only 2 of 12 comparisons were not con- 
sistent with the averaging formulation, and 
only one of them supported the summation 
model. Nevertheless, the fact that both oc- 
curred in comparisons at the positive pole of 
the activity dimension makes it difficult to 
consider them random findings. Moreover, a 
summation interpretation for the positive pole 
of the activity dimension may have psycho- 
logical meaning. We frequently refer to some- 
one as “having unlimited energy” or “not 
being able to sit still." If we know that some- 
one is kard working (H) and athletic (H), 
the further information that he is also cheer- 
ful (M+) or emotional (M+) does not seem 
to moderate our impression of his activity 
level in the same way that for the evaluative 
dimension moderately favorable information 
appears to qualify a highly favorable initial 
impression. It seems easier to conceive of a 
person with “unlimited energy” than one with 
“unbounded goodness.” If this interpretation 
is valid, why it should hold only for the 
positive pole of the activity dimension is 
not clear. And it should be emphasized that 
this interpretation must be viewed with 
caution, since even these data are not totally 
consistent with the summation position. 

In sum, the highly consistent findings for 
the evaluation and potency dimensions pro- 
vide strong support for the generality of the 
averaging model for these two dimensions of 
judgment. Results for the activity dimension, 
however, suggest that there may be limits to 
the generality of this model. Interpretation 
of these findings are discussed further fol- 
lowing consideration of the results on dif- 
ferential weighting. 
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Differential Weighting of Positi: 
Negative Characteristics 


Tt was argued above that rmation 
about negative characteristics i person 
indicates ways in which he is dis shable 
from our normative expectations every- 
man," and that therefore this iní tion is 
highly useful to the observer for i first 
impression of him. Thus, it was h esized 
that negative attributes would be heav- 
ily weighted than positive chara ics in 
the process of information integr 

Using a neutral impression (M +M- 
M-—) as base line, it can be that 
the mean values predicted fron simple 
averaging model for the HHM and 
LLM+M — sets are equally above elow, 
respectively, the M--M--M—M- : If the 


above reasoning is correct, however, the 


difference between ratings of the LII +M- 
and neutral sets should be great: in the 
difference between ratings of the HH i +M— 
and neutral sets. These two differe scores 
were determined for each subject ‘or his 
judgments of these three stimuli each 
judgment dimension. The difference between 
these difference scores was then calcviated for 
each subject, and a £ test for paired observa- 
tions was performed on the data {or each 
dimension. A similar analysis was conducted 
comparing the differences in judgment between 
M+M+M—M-— and the HM+M+ 


M — and LM — M + M — sets. Thus, the 
hypothesis was tested for stimulus combina- 
tions at two degrees of polarization from the 
neutral set, 

The mean judgments of each of these sets 
for each dimension are presented in Table 3. 
For the evaluation and potency dimension 
judgments, both tests of the hypothesis re- 
ceived strong support, the difference between 
the difference scores being highly significant 
in all cases. These data indicate that at- 
tributes at the negative pole of these dimen- 
sions are given greater weight in information 
integration than are positive characteristics- 
Results for the activity dimension judgments 
were not consistent with these findings. There 
was no significant difference in the extent to 
which judgments of the HHM--M- and 
LLM+M— sets were polarized from the 


ets were rated on a 21-point scale, 


1 the case of the less polarized 
inations, the difference between 
scores was significant at the 
level, but in the direction opposite to 
hypothesized above. In this case, posi- 
pole characteristics were weighted more 
ly than were negative attributes." 
us, for two of the three dimensions, the 
Midence indicates that negative character- 
are more heavily weighted than are 
itive attributes, and this was found to 
cur at two levels of polarization from the 
utral impression. The results for the evalu- 
and potency judgments were consistent 
h the reasoning outlined above, that nega- 
Characteristics convey more useful infor- 
tion because they indicate the unique way 
Which a person deviates from normative 
ations. Thus, with respect to the evalu- 
dimension, learning that a person is 
ritical (L) or quick-tempered (M—) 
differentiate him from others to a greater 


mulus comt 
€ difference 


-'In determining the difference scores for these 
Ss, the M+M-+M—M-— set was a part of 
subtraction operations. Consequently, the dif- 
Scores were not totally independent as is 
ally assumed for the # test, and this may have 
_ in somewhat inflated t values. However, 
of the differences were highly significant (for 
evaluation and potency, all were at least 
92), so the results obtained probably do reflect 
erences, 
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TABLE 3 
Ratings AND Mean Dirrerence Scores rox Tests or DirrkRENTIAL WEIOMTING 
or POSITIVE AND NEGATIVE. CHARACTERISTICS 
Stimulus set M Mean diference i score Li 
(a) Polarization from neutral set : High Y 
i) B. `} 
HHMSM— | M+MÍM-M-| LLMTM- (AY =B) | (BY) = 1 
15.09 11.57 5.20 3.52 63 « 02 3 
15.42 12.51 535 2.91 6.76 <0 
16.76 11.38 645 5.38 4.93 ns 
(b) Polarization from neutral set: Moderately high 
(A) B i -1 
MAMAM- | M&MTM-M- | LM-I 4M- PAL O SEAS 
žan sive aN ee Gee eee ore 
14.09 | 11.57 | 7.20 252 | 437 | <0 
14.79 12.51 | 7.34 2.28 5.17 < 01 
16.01 | 11.38 | < .0S 


8.96 4.63 242 | 


extent than learning that he is friendly (H) 
or nice (M+). Similarly, for potency judg- 
ments, knowing that a person is dependent 
(L) may tell us more about him as an indi- 
vidual than would information that he is 
determined (H). A similar argument might 
explain the result for the activity dimension 
which suggested that positive characteristics 
on this dimension may be weighted more 
heavily than negative-pole attributes. In this 
case, normative behavior may be represented 
by terms reflecting moderately low levels of 
activity, such as easygoing (M—), calm 
(M—), and polite (M—). If so, then attri- 
butes such as emotional (M+), hard working 
(H), and demanding (H) would be non- 
normative and thus provide relatively unique 
information about a person. Consequently, 
they would receive greater weight in making 
activity judgments. This latter interpretation 
must be considered tentative, since only one 
of the tests of the hypothesis showed a 
significant difference in this direction. 


GENERAL DISCUSSION 


It was argued above that forming an 
impression of another person involves judging 
that person's position on each of several 
dimensions, and that impression-formation 
researchers have been unnecessarily narrow in 
focusing their attention almost exclusively on 
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ratings. 
neutral set, 
M+M+M~—M-, where the variances of the 
ratings for the evaluation, potency, and activ- 
ity dimensions were 9.2, 9.7, and 15.43, respec- 
tively. Such data suggest that the activity 


ratings may have presented the su and 
a fairly atypical task, whereas were 
more accustomed to making eva and 
potency judgments, To the exte id 
interpretation is valid, the res r the 
activity dimension present on; wrak 
challenge to the generality of t! izing 
model. 

It should be noted that the thre ent 
dimensions employed in the pre tudy 
may not be the only dimensions th wise 
a “first impression.” The presen: was 
based on the results of previ udies 
which have found that these dime: pro- 
vide an adequate representation o/ struc 


ture of interpersonal judgment (í 
al, 1957; Rosenberg et al, 19: )ther 


types of judgment may be import: 


as reflected in the dimensions oí 

found by other authors (e.g., Levy jugan, 
1960; Passini & Norman, 1966) ough 
these studies all indicate the multid ional 
nature of the way we judge other earch 
to date has not clearly substant what 
those dimensions are. Consequi the 
dimensions included in the pres study 
cannot be considered as necessari! fining 
the most important dimensions on w>.ch first 


impressions are based. 

Although most of the findings 
evaluative and potency dimensions ca 
accounted for by a weighted averaging : 
such a model gives a representatio: 
explanation, of the impression-formatio 
cess. First impressions may be desc 
adequately by a weighted average of the 
scale values of the individual components, 
but the weights are determined postdictively. 
An understanding of the process of forming 
first impressions will come only when we 
are able to predict the pattern of differential 
weighting to be applied to the single ele- 
ments, The degree to which an attribute 
reflects deviation from some social norm ap- 
pears to be one determinant of the resulting 
weights. Further theoretical statements are 
needed to provide a basis for anticipating 
the weighting process. 
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THE DEFENSIVE FUNCTION OF PROJECTION 


DAVID $. HOLMES * Ax» B. KENT HOUSTON 


University of Kansas 


Subjects in two experimental groups were led to believe that they were ma 

adjusted, while those in a control group were not. Subjects in one experimenta 
group were then given the opportunity to project maladjustment onto thre 

friends, Subjects’ subjective reports and pulse rates indicated that the person 
ality feedback significantly increased the anxiety levels of experimental 5 
compared to control subjects. Neither the subjective reports nor pulse rate 

revealed a differential decrease in anxiety in experimental subjects as a function 
of whether or not the subjects projected the anxiety-provoking trait onto 
their friends. These results raise serious questions concerning the hypothesized 
defensive function of attributive projection. 


Most of the research attention directed 
toward projection is devoted to the value an 
individual’s projective responses may have in 
revealing aspects of his personality. Tradi- 
tionally, however, projection has been con- 
ceived of as a defense mechanism. Freud 
(1959, orig. publ. 1896) suggested that at- 
tributing one’s unacceptable traits to others 
prevents the projecting individual from be- 
coming aware that he has these traits, thereby 
reducing his anxiety over possession of the 
traits.* In contrast to Freud's hypothesis, the 
available studies of projection indicate that 
people project only those characteristics of 
which they are conscious (Holmes, 1968). 
Consequently, projection is clearly not used 
to exclude unacceptable traits from conscious- 
ness. Projection, however, may serve a defen- 
sive function by another means. There is 
good evidence that persons do project traits 
of which they are conscious, and it may be 
that the projection of conscious but unaccept- 


1 The authors would like to thank Roger Ratliff, 
for his assistance in carrying out the experiment, and 
Lawrence Stricker, Walter Emmerich, and E. Femme, 
for their helpful comments on the manuscript. Final 
presentation of the manuscript was supported by 
United States Public Health Service Grant MH-20819 
to the first author, 

* Requests for reprints should be sent to David 
S. Holmes, University of Kansas, Lawrence, Kansas 
66044. 

*It should be noted that in this experiment we 
are dealing with "attributive projection" in which 
an individual projects his own feelings or traits 
onto another person, rather than "complementary 
projection” in which an individual projects the 
complement or cause of his feelings onto another 

person. 
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able traits onto others makes the session 
of these traits less disturbing. Bra (1962) 
and Secord, Backman, and Eac (1964 
have suggested that the projectio i nega 
tive trait onto respected persons or peen 
may enable the projector to reev:'vate th 
trait or to convince himself that he does no 
deviate from his reference group. ‘ther d 
these processes could. be tension reducing. 
Some experiments have revealed - pattem 
of projections that can be interpreted asii 
defensive maneuver (ie., negative tvaits arei 
projected onto liked and respecte: persons 
rather than disliked or out-group persons 
Bramel, 1962, 1963; Edlow & Kiesle i 
Secord et al., 1964). Unfortunately, none of 


these studies has investigated the anxiety: 
reducing function of projection. Therefore 
the present experiment was designed spe ifi- 
cally to determine whether projection of one’ 
unacceptable traits onto a peer reduces the 
anxiety associated with the conscious posses 
sion of the trait. To test this hypothesis, 
subjects in two experimental groups were led 
to believe that they were maladjusted. One ol 
these groups was then given the opportunity 
to project maladjustment onto peers, whil 
the second group was not given this oppor 
tunity. Subjects in a control group were led 


to believe that they were normally adjusted 


dicted that immediately after receiving y 
false personality feedback, the two expe : 
mental groups would evidence higher leve 
of anxiety than would the control group, anti 
more importantly, that the experimental sub: 


jets who projected would show a subsequent 
reduction in an while those who did not 
project would : aow the reduction. Anx- 
ity was mea both by the subjects’ 
reports and a | logical measure, 
TETHOD 
Subjects 
The subjects experiment, 48 females from 


introductory ps) ry classes at the University 
of Texas, Austir > randomly assigned in equal 
numbers to thr roups: experimental-projection, 
typerimental-no ion, and control During the 
experiment, the cL sat facing a screen on which 
slides could be cied. Attached to the arm of 
the subject’s ch s a response panel containing 
four switches an gnal light. The experimenter 


St behind the : at a control center from 
which he could e the apparatus without being 
observed, 

Procedure 

Prior to begir the experiment, the experi- 
menter asked the subject to give him the names of 
three female fr whom she knew well. He 
Pointed out tha would be used in a second 
part of the exp nt. He then placed a photo- 
electric plethysmosrsph transducer on the middle 
finger of the ts nondominant hand. This was 
‘onnected to an E & M physiograph (Model 


DMP-4A) which made continuous recordings of the 
subject’s pulse rate. While the experimenter was 
iltaching the transducer, he explained to the subject 
that the device would record physiological responses, 
that it would not hurt her, and that the main part 
of the experiment was controlled by a computer. 

Aller the transducer was attached and operating 
Properly, the subject was given a rest period of 
 "Pproximately 30 seconds. The experimenter then 
%ked the subject to fill out Zuckerman’s (1960) 
Affect Adjective Check List (AACL), which asked 
the subject to check the adjectives which described 
ET she felt at that particular moment. The AACL 
it aoe as the self-report measure of anxiety, and 
Mtg as justified to the subject by telling her that 
pu derstand your physiological data, it is im- 
"EN to know how you are feeling.” After this 
o Completed, the experimenter gave the subject 

m Bou test score sheet" which she would 
wi xd in the session, and then threw the switch 
: Supposedly activated the computer. In fact, 

hs Switch started a concealed tape recorder which 
| Vias ed the experiment and gave the instructions 
Io ken mounted in front of the subject. The 
Rica indicated that the subject was par- 
test ae in a test “of a very sensitive and accurate 
ture ptsonality . . . which can be used to predict 

M adjustment and performance of college stu- 
P And which “is especially helpful in identifying 

Students who are going to have adjustment 


—— Lad 
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problems and who should be seen for further diag- 
nostic evaluation in the Health Center" It was 
explained that "the famous Rorschach Psychodiag- 
nostic Inkblots" would be projected onto the screen 
one by one, and that while each blot was on the 
screen, four nouns would be read aloud to the 
subject. The subject's task was to decide which of 
the four nouns best described the blot at which she 
was looking and to indicate her choice by throwing 
the appropriate switch on her response panel. The 
subject was led to believe that this would register 
her response in the computer, and the computer 
would then score the response and immediately give 
her feedback. The feedback was given with the green 
light on the response panel. The subject was told 
that if her response indicated some pathology the 
light would flash, but if the light did not flash, her 
response had been a normal one. The subject was 
asked to record each occurrence of the light on the 
personality test score sheet. The subject was told 
that she should “not be concerned if the light flashes 
on once, for this is normal for college students. 
It does not become serious until the light has 
flashed four or five times" The interpretations 
associated with various numbers of flashes were 
available on the score sheet. 

The response alternatives presented for each card 
were, in fact, all normal responses (W, F+; Beck, 
Beck, Levitt, & Molish, 1961). In the experimental- 
projection and experimental-no-projection conditions, 
independent of the subjects’ responses, the light 
indicating pathology flashed after Blots 1, 3, 4, 6, 
8, 9, and 10, while it never flashed for subjects 
in the control condition. The interpretation for “7 
or more flashes" at the bottom of the personality 
test score sheet indicated that "students receiving 
this feedback should be seen immediately in the 
Health Center or Counseling Center." 

After the subject responded to the last blot and 
was given time to total up the number of flashes 
she had received and to read the interpretation 
schedule, the subject was asked to fill out the AACL 
again under the guise that "it is important that 
we know how you feel right now so we can accu- 
rately interpret your Rorschach responses Time 
was allowed to complete the AACL, and then the 
taped instructions went on to indicate that "for the 
second part of the experiment, we need to know 
something about people whom you know well" At 
this point the experimenter gave the subjects in the 
experimental-projection and control conditions a 
form on which the names of the three friends she had 
identified earlier had been printed. The form asked 
the following question about each friend: “If this 
person took the personality test you just took, what 
is the total number of flashes you think she would 
have received?" This gave the subject an opportu- 
nity to project maladjustment onto three friends, 
and the total number of flashes she indicated was 
used as the measure of attributive projection. Sub- 
jects in the experimental-no-projection condition 
were given forms on which they were asked to esti- 
mate the height and weight of the three friends they 
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Mean AACL Scores 


Experimento! Periods 


Fic. 1. Mean AACL scores for (a) before person- 
ality test; (b) after personality feedback, but before 
projecting or judging height and weight; (c) im- 
mediately after projecting or judging height and 
weight; (d) end of a waiting period. 


had named earlier, and, therefore, these subjects 
were not provided with an opportunity to project 
the anxiety-provoking trait. 

When the subject completed her peer rating form, 
the experimenter asked her to complete the third 
AACL. After it was filled out, the experimenter 
indicated that that concluded the part of the experi- 
ment which was controlled by the computer and 
that he was going to turn the computer off and get 
Dr. Karlschmidt who was going to go over the 
subject’s personality test responses with her. The 
experimenter then left the room for 1 minute. Upon 
returning, he asked the subject to fill out the fourth 
AACL while she was waiting for Dr. Karlschmidt. 
This postprojection waiting period was included to 
allow for the possibility that the anxiety-reducing 
effect of projection might be somewhat delayed. The 
repeated administration of the AACL was justified 
by telling the subject that to interpret the results 
of the test, it was necessary to know how the subject 
felt at all times. After the last AACL was completed, 
the subject was questioned Concerning the deception, 
thoroughly debriefed, and dismissed. 

In summary, the procedures for the experimental 
session were as follows: AACL 1, manipulation of 
threat through feedback from personality test; 
AACL 2, differential opportunity to project; AACL 3, 
waiting period; AACL 4, debriefing. Pulse rates 
were recorded continuously, but the data for the 
analyses were based on the number of pulse beats 
for the 30-second intervals during which the subject 
(a) completed the first AACL, (b) totaled up her 
flashes and read the interpretation schedule, (c) com- 
pleted the third AACL, and (4) waited for the 
experimenter to return. 
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RESULTS AND Discussion 


The mean AACL scores and the mean pul 
rates for the four periods of the experim 
[(a) before personality test; (5) after per 
sonality feedback, but before projecting 
judging height and weight; (c) immediately 
after projecting or judging height and weigh! 
(d) end of a waiting period] are presente 
graphically in Figures 1 and 2, respectively, 


Effect of Personality Feedback 


It was predicted that after being led to 
believe that they were maladjusted, the sub 
jects in the two experimental groups woul 
evidence greater increases in anxiety as me 
sured by AACL responses and pulse rate 
than would control subjects. Consistent 
the prediction, a between-within group an 
sis of variance comparing the AACL 
of the three groups over the prepersona 
and postpersonality feedback periods (Pe 
riods 1 and 2) revealed a significant Group 
X Periods interaction (F = 10.85, df = 2/45 
P < 01). To explore this interaction her 
several nonorthogonal comparisons were caf 


perimental groups with the control group 
showed that subjects in the experimenta 
groups increased significantly more in neg 
tive affect than subjects in the control grou 
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Fic. 2. Mean pulse rate for 30-second interval 
(a) before personality test; (b) after personal 
feedback, but before projecting or judging H 
and weight; (c) immediately after projecting ° 
judging height and weight; (d) end of a waiti 
period. ] 


THE DEFENSIVE FUNCTION oF PROJECTION 


projection, F = 20.07, dj = 


esperimen 

1/45, p < When compared with each 
other, the experimental groups did not 
differ signi y in their increases in nega- 
tive affect 1,29, df = 1/45), These find- 
ings indic the negative personality 
feedback g o experimental subjects was 
successful erating anxiety as measured 
by the AAC id that there were no differ- 
ences betv 1e two experimental groups in 
the degree ich their anxiety levels were 
increased. 

The res vith regard to the pulse rate 
measures a sentially parallel to those de- 
rived from \ACL measures. A between— 
within grou alysis of variance comparing 
the pulse : a of the three groups over 
the preper lity and postpersonality feed- 
back perio Periods 1 and 2) revealed a 
significant raction (F = 6.99, df = 2/45, 
p< 01). orthogonal comparisons indi- 
cated that 1 of the experimental groups 
showed greaics increases in pulse rates after 
receiving the personality feedback than did 
the control croup (experimental-projection, 
P= 436, .05; experimental-no projec- 
tion, F = 13.90, df = 1/45, p < 001), while 
the two experimental groups did not differ 


to which their pulse rates in- 
0, df = 1/45). From the re- 
sults of the statistical analyses carried out on 
the data as well as the inspection of the data 
Plotted in Figures 1 and 2, it is clear that 
after being led to believe that they were 
Poorly adjusted, the subjects in the two ex- 
perimental groups showed a greater increase 
M anxiety as determined both by the AACL 
and pulse rate measures than did the subjects 
in the control group. 


Effects of Threat on Projection 


Since the personality feedback was success- 
ful in altering the levels of anxiety, analyses 
Were carried out to determine whether these 
differences in anxiety were reflected in the 
Subjects’ projections. On the basis of the theory 
and previous research on attributive projec- 
tion, it was predicted that experimental sub- 
Jects who had been led to believe that they 
Were maladjusted would project more malad- 
Justment onto their peers than would control 
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subjects who had not been led to believe that 
they were maladjusted. A comparison of the 
experimental-projection and control groups 
in terms of the total number of flashes (indi- 
cating pathological responses) which the sub- 
jects estimated their three friends would have 
received on the personality test revealed that 
the experimental subjects projected signifi- 
cantly more pathology (F — 93.36, df — 1/ 
30, p < .001) than did the control subjects. 
In fact, there was no overlap between the 
distributions for the two groups. Thus, it is 
clear that the subjects in the experimental- 
projection group did use attributive projec- 
tion, but the question remains whether the 
use of projection served to reduce the anxiety 
induced by the experimental manipulation. 


Effects of Projection on Threat Reduction 


On the basis of the theory of defensive 
projection, it was predicted that subjects in 
the experimental-projection group who were 
given the opportunity to project would evi- 
dence a greater reduction in anxiety than 
would subjects in the experimental-no-projec- 
tion group who were not given the oppor- 
tunity to project. A between-within group 
analysis of variance comparing the AACL 
scores of these two groups for the periods of 
time before and after the opportunity to 
project (Periods 2 and 3) did not reveal the 
expected interaction (F = .55, df = 1/30), 
although both groups did show an overall 
reduction in anxiety (F — 19.81, df — 1/30, 
p< 001). Furthermore, a similar analysis 
comparing the AACL responses given by the 
subjects in the experimental-projection and 
experimental-no-projection groups before and 
after the waiting period (Periods 3 and 4) 
also failed to reveal the expected interaction 
(F = 01, df = 1/30). Neither of these com- 
parisons, then, supported the prediction that 
projection would lead to anxiety reduction. 

The results for the pulse rate data were 
again parallel to those of the AACL mea- 
sures, There was no significant interaction 
(F — .25, df — 1/30) in pulse rates between 
the experimental-projection and experi- 
mental-no-projection groups over the prepro- 
jection and postprojection periods (Periods 2 
and 3), and the same was true when the data 
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for the prewaiting and postwaiting periods 
(Periods 3 and 4) were compared (F = .33, 
dj = 1/30). The consistency of results com- 
pels one to conclude that the subjects who 
were given the opportunity to project did not 
evidence a greater reduction in anxiety than 
did the subjects who were not given this op- 
portunity, 

In considering these results, it is important 
to note that not all individuals may employ 
projection in the face of threat and, there- 
fore, probably not all of the subjects in the 
experimental-projection group used this de- 
fense. As Belmont and Birch (1951) have 
pointed out, "since we cannot expect uni- 
formity of defense mechanisms in any ran- 
domly selected population, the assumption of 
constancy leads necessarily to contradictory 
results. and unresolved confusions in the 
available experimental data [p. 226].” If not 
all of the subjects within the experimental- 
projection group used projection or used it to 
the same degree, then the inclusion of non- 
projecting subjects in the experimental-pro- 
jection group would have attenuated differ- 
ences in the reduction of anxiety between the 
experimental-projection and experimental-no- 
projection groups. In view of this possible 
problem, internal analyses were carried out 
within the experimental-projection group. 
Subjects in this group were divided into high 
and low projectors according to whether they 
were above or below the median in terms of 
the number of flashes they projected onto 
their three friends. There were no differences 
between the high- and low-projection groups 
in terms of increases in anxiety as measured 
by AACL and pulse rates (F = 1:52:48 = 
.25, df = 1/14) after receiving the feedback, 
but before projecting. On the basis of the 
theory of defensive projection, it would be 
expected that the high projectors would evi- 
dence greater postprojection threat reduction 
than would low projectors. Between-within 
group analyses of variance on the AACL and 
pulse rate data for the two groups for the 
periods before and after the projection. did 
not reveal a significant interaction (F — 45; 
F = 27, df = 1/14, respectively), The same 
was true when the AACL and pulse rate data 
for the periods before and after the waiting 


Davi S. Hotwxs anp B. Kent Houston 


period were analyzed (F = 3.15; / 05, 
= 1/14, respectively). 

The results of these internal s 
also relevant to another possi! robl 
Although the subjects in the expe 


projection group were not prov vith a 
specific opportunity to project a ere Í 
stead given another task to perfor dire 
assurance exists that these subj: lid 
also project in some way. Tha n the 
absence of th portunity to pr by us 
ing a rating form, some of the subjec!s in the 
experimental-no-projection grou; have 
projected covertly in thought. Suc cove 
projection in -experimental~1 jectio 
group might have reduced the difi::cnces 
anxiety reduction between the two ex 
mental groups. However, this cri: ; could 
not be directed at the comparisons of high 
and low projectors within the ex iental- 


projection group since it is unlikely that 
use of projection would not hav 
flected in the forms completed by : 
jects. The results of the analyses « 1 
within the experimental-projection :-oup 4 
consistent with those stemming ! 
parisons between the experimenta! j 
and experimental-no-projection groups 
by suggesting that covert projection had 
influenced the results. 


CONCLUSIONS AND IMPLICATIONS 


The results of this investigation clearly 
provide no evidence for the hypothesis that 
the projection of an undesirable trait ont 
peers results in a reduction of the anxiety) 
associated with the possession of that trai 
These results were highly consistent, being 
observed both in the comparisons of the ex^ 
perimental groups, as well as in the internal 
analyses carried out within these groups. 
Furthermore, the results were found regard 
less of whether anxiety was measured by sub 
jective self-reports (AACL) or an objective 
physiological indicator (pulse rate). The high 
degree of similarity in the results for these 
two independent and very different measures 
of anxiety provides some degree of convergent 
validity for the measures, as well as addi 
tional strength for the conclusion that projec i 
tion did not result in anxiety reduction. One 
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After 
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of elements performed correctly would be expected t 

pay much closer attention to the relevant modeling 
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The positive results of many studies cannot 
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;odel ex- 


due to improved attending durin 


posure and due to other process sifecting 
later test performance. Others (c... Bergen 
1966; Berger, Irwin, & Frommer, | /70) have 

bserver’s 


emphasized the importance of t! 
practice during exposure to the m 
requiring adequate attention. T' 
study was designed to investigate observing 
behaviors directly, since no support was found 
in previous literature for the assumption that 
the variables commonly used in vicarious 
learning do indeed influence attending to à 
model. 

The specific purpose of the present study 
was to examine the frequency of observing * 
performing model in a two-choice situation 
when (a) an observing child either expected of 
did not expect to engage in the same task 
subsequently; (6) reinforcement to the modd 
was portrayed as high or low; and (c) a seco 
observation period either preceded or follo 
the actual execution of the modeled task. 
expectation to engage in the same game su 
sequent to observation should enhance atten 
ing (Bandura, 1969). Such a prediction is also 
consistent with the conceptualization of im 
structions as facilitating cue utilization (8 
McLaughlin, 1965). 

The effects of observing reinforcement to the 
model have been studied in adults, in children, 
and under different conditions, Imitative be- 


1, clearly 
present 
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ut not always enhanced by 
y positive reinforcing conse- 
z inple, in some problem-solving 
T ike situations, the fear that 
kt : will be considered cheating 
N nay be disapproved can result 
(Bandura, 1968). The third 
(o Í a second observation phase) 
lu » evaluate the degree to which 
of the task would affect 

ses, It was argued that this 
erve as a check on the effects 
nce for the group not expecting 
its execution should nullify the 
tion effects. Second, somewhat 
the prefeeding procedure in 
, additional involvement and in- 
tion could serve as motivational 
Y ing attending behavior follow- 
ticipation. In the present study, 
with which the subject chose to 
himself with a model picture rather 
tral picture constituted measure- 
»bserving response. The nature of 
ıttending activities, if any, during 
"on of the slides was thus not 
mc:-ured. As in previous studies, the 
esumes that rate of selí-presenta- 
reasonable index of attending 


METHOD 


and Procedure 


Subjects were 145 second-grade boys from the 
nd Public School System. They were randomly 
d to eight experimental groups of 15 subjects 
a single control group of 25 subjects. 
apparatus consisted of equipment for display of 
WO Separate sets of slides on one i screen, 
iate timing devices, a ready signal light, and 
h buttons for selecting either of two slide sets. 
child was seated before the screen. A blue ready 
indicated that the subjects could choose either of 
of slides by pressing a button to his left or right. 
Tesponse buttons activated either of two slide 


of the second slide, a blank slide remained in 
1 until another response was made. The ready 
came on again after ure of the second slide in 
quence had been completed. Decade interval timers 
tolled total intervals, slide exposures, and inter- 
‘intervals. To facilitate the discrimination for the 
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a hith-grade boy was shown seated in (ront 
of items taken from the Columbia Mental 
Scale (CMMS). The model was 
under tbe correct CMMS item, that 
an item set which be judged to be 
others. A light above bis chosen 
The CMMS items and correct 
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When the subject entered the room, he was asked 
be seated and told that he would be playing games with 
experimenter. He was asked to read the word 


Lj 


‘The experimenter then explained the free choice situa- 


Phase 2. The children were given no choice during 
Phase 2, and all slides were shown in a predetermined 
random order, 

In Phase 3, half of the subjects in all subgroups were 
given their second opportunity to freely choose among 
the slides. The remaining half were given the CMMS 
and performed this task for the same 14 items as shown 
on the model slides. In Phase 4, those subjects who had 
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had a free choice on the preceding phase performed the 
CMMS, while the remaining subjects who had com- 
pleted the CMMS previously were now given a free 
Choice. In addition to the eight experimental groups that 
underwent the above procedure, a control group was 
exposed only to performance of the CMMS. 


Design 


The experimental procedure consisted of four phases 
and cight experimental treatments. All groups were 
treated the same way in Phase 1, In Phase 2, the 
subjects were split into groups with high and low model 
reinforcement and further divided into groups ex- 
pecting to play the game or expecting not to play the 
game. These four subgroups were further divided by 
the order in which a second free choice sequence and the 
CMMS were given. This design yielded eight groups in 
a factorial design with 2 reinforcement conditions 
(high, low) X 2 Instruction conditions (expecting or not 
expecting to play the game) X 2 order factors (early 
or late). The control group was used for special com- 
parisons only. The main dependent variables consisted 
of each child's choices of the model slides in the first 
phase and on the second presentation, and of a difference 
score between choices on the first and second presenta- 
tion. Raw scores on the CMMS were also recorded, even 
though an examination of the amount of vicarious 
learning was not of interest in this study. 


RESULTS AND DISCUSSION 


Evaluation of changes in later preferences for 
either set of slides requires demonstration of 
similar base rates during Phase 1. A2 X 2 X 2 
analysis of variance was carried out on the 
mean number of modeled choices for the first 
14 (free) presentations. The analysis revealed 
no significant differences (p> .10) for all 
main effects and interactions. Thus, all groups 
were comparable in their initial choice prefer- 
ences. A mean of 6.70 for model choices 
suggests that the two slide sets were of almost 
equal attractiveness to the children. 

The main analysis of the effects of the 
independent variables on observing responses 
was carried out on the mean difference scores 
for each subject’s model choices from his first 
choice situation (Phase 1) to his second choice 
situation (either Phase 3 or Phase 4). For the 
experimental groups, mean changes in model 
observations ranged from +2.73 to —2.40. A 

2 X 2 X 2 analysis of variance yielded signifi- 
cant main effects for instructions (F — 4.18, 
df = 1/112, p < .05) and order (F = 33.65, 
df = 1/112, p < .001). The presentation of 
slides with an adult reinforcing agent did not 
significantly alter model observation fre- 
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quencies. The Instructions X Order 
barely fell short of significance at t 
(F = 3.91; with 1 and 112 df, an 
is required). Since the interactio: 
closely approached significance, si 
tests were performed. An analysi: 
structions effects indicated that th: 
for whom the second free choi 1 Į 
preceded participation in the CMMS were sig- 
nificantly affected by instructions 
df = 1/56, p < .01). The groups 
given the CMMS after the second ch 
and had expected to play, showed 
decrease of .19 model observatio: 
groups which had not expected to 
game showed a mean decrease of 2 
ever, those children who obtained t? 
first and the second choice period las! 
affected by instructions (F < 1). 
bined subgroups that were given th 
prior to their second choice period 
mean increase of 1.9 model observations, 
no significant differences due to reinforc 
or instructions. Thus, the main contr! 
the instruction variable, consistent wi 
nature of the experimental design, wa 
prior to the child’s actual participation in 
CMMS. When the subjects were told t! 
game had been fixed and they were allow 
play, the earlier instructions were o! í 
overridden, and the following observing re- 
sponses were no longer affected by them. — 
The results suggest a surprising increase in 
observing responses after the child had played 
the game himself. Since the performance of the 
CMMS was not reinforced nor followed by the 
experimenter’s correction, it seems plausible 
that the children chose to observe the model’s 
performance subsequent to their own in order 
to obtain information about their own achieve- 
ment. Such an interpretation, however, cannot 
be differentiated from one which assumes that” 
actual execution of the task increases subject- 
model similarity, and thereby secondarily in- 
creases the subject’s attention to the model. 
The significant effects of instructions prior te 
performance by the subject himself do suggest 
that such instructions, similar to those used in. 
vicarious learning designs, tend to maint 
attending at a higher level than instructio! 
suggesting that the child will not be able to 
engage in the same behavior, The termination 
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Some EFFECTS oN MODEL OBSERVATION 


ol the ins m effect, by permitting the 
no-expecta groups to play, supports the 
assumption children attended to these 
instructio: ! believed them. 

The perí :ce increments on the CMMS 
due to m posure were examined by a ¢ 
test on t! erence between pooled experi- 
mental gro neans and the control group 
mean. A / 12 (p < .005) indicated that the 
subjects w itched the model made signifi- 
cant gains olving the CMMS items. The 
items had selected to include items which 
were abov erage difficulty for the average 
second graicr, While the control group mean 
was 8.68, combined experimental groups 
yielded a n of 11.18. Since the range of 
means amo og experimental groups was ex- 
tremely limited (10.64-11.72 for the eight 
groups), no further analysis of correct CMMS 
items was «:*empted. Results clearly indicate 
that, regarsiess of the particular conditions 
under whic) observation took place, such 
observation produced better performance on 
these nonverbal intelligence test items. 

The present experiment attempted to ex- 
amine the cTects of instructional and rein- 
forcement variables on the frequency of 
observing nses, in order to contribute 


toward understanding whether these variables 
carry their main effect in vicarious learning 
Studies during the period of model observation 
9r during later subject performance. The failure 
to obtain effects due to differential model re- 
inforcements suggests that the effectiveness of 
Vicarious reinforcement may lie not so much in 
stimulating the subject to increased attending 
during model exposure as in enhancing the 
Subject’s performance during test trials. These 
Conclusions are somewhat limited because of 
the possibility that the particular levels of 
Presented model reinforcement may not have 
been sufficient to affect the subjects differ- 
entially. Although consequences to the model 
of similar magnitude have been shown to be 
efective in previous studies with children 
(Bandura, 1969), the present study utilized a 
level of low reinforcement in which the subject 
Was shown a model who always had guessed 

€ Correct response. Perhaps, the model’s 
achievement of accuracy on each trial may 


217 


have been sufficiently impressive so that the 
additional observation of a material reinforcer 
in the slide for the high-reinforcement groups 
did not increase the strength of this variable 
very much. In addition, the children for whom 
the CMMS preceded the second choice period 
had experienced that neither reinforcement 
nor feedback followed their execution of the 
items. 

The instructions indicating that the subject 
would later engage in the same task resulted in 
significant increases in attending behavior. If 
such instructions enhance attending, then one 
would expect a greater amount of observation 
during co-learning, a paradigm in which the 
subject and model alternate on the same task, 
than during no-trial-learning designs. It would 
be of further interest to investigate the effects 
of the subject’s preexperimental experience 
with the game, since the significant order 
effects in the present study suggest that prior 
experience with the task strongly increases the 
subject’s attending behavior during exposure 
of a model to the same task. 

The present results suggest that the role of 
attending behavior in observational learning 
requires detailed examination of the attending 
behavior itself, in order to avoid acceptance of 
plausible but partly incorrect assumptions 
about the role of attention during observational 
learning. 
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A nonreactive field research technique was used to indicate the extent to which rac» 
attitudes affert helping behavior in the general adult population. Five hundred a 
forty black subjects and 3 white subjects received what was ostensibly a wror 
number telephone call. The caller, clearly identifiable by his voice characteris! 
as being black or white, explained that he was attempting to reach his mecha 


used as an indication of racial 


extended relatively equivalent levels of assistance to blacks and to whites. Howeve: 
black callers somewhat less frequently than white callers. | 
subject and the caller, male subj 


indication of racism. Public opinion polls and 
. questionnaires have also been used to indicate 
the extent of discrimination. However, these 


The experimental study of racial conflict 
has almost entirely been limited to the labora- 
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from a phone booth located on the parkway because his car had broken do» 
perge om mover rir ar ow gemi cee as which to make anot} 


call to the Tbe subject could help the caller by contacting his gars 
Kin. The riii ec wih wich Mack and white men were hele» 


+ The results showed that black subject 


concerned with white racism, we 
to discover whether there was any anti 
discrimination by blacks. Marx (1969), i 
study of black attitudes toward whi 


1964, found little evidence of the intolerance 
that whites show toward blacks. llowever, 
since 1964, many changes appear to have 
occurred in the black community. The riots 
and the voices of militant leaders cad one to 
assume that the attitudes of most blacks have 
changed. However, in reviewing the numerous 
surveys of black attitudes since 1964, Marx 
(1969) concluded that black attitudes have not 
changed. Marx feels that the mass media have 
given a distorted picture of how the majority 
of blacks feel toward whites. He concluded 
that “blacks remain perhaps the most tolerant 
group in American society [p. 224].” 

Marx pointed out that there are alternative 
explanations for the data collected from 
public opinion polls. Blacks could have been 
lying to white poll takers, single males could 
have been undersampled, and the real militants 
may have refused to be interviewed. In other 
words, public opinion polls have many of the 
weaknesses of the more reactive social psy- 
chological experiments. One of the aims of the 
present study was to assess how blacks behave 
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hout the biases introduced 
rectly. 

» the study of racism was 

vation of helping behavior. 

olleagues (Berkowitz, 1966; 

cls, 1964; Berkowitz, Klander- 

^4) have conducted a number 

the general area of helping 

ve assumed the existence of a 

lty norm" in our society that 

son's helping another when 

is can be gained. If anti- 

is present, then we would 

social responsibility norm 

‘ed more frequently for a black 

clp than for a white man in 

the same situation. Therefore, we have used 

relative íze juencies with which a black 

a white mon are helped as an indicator of 


tio 
To date, i! cre have been three studies on 
i bchoor that have studied race. 


in, Rodin, and Piliavin (1969) found 
tendenc, for more same-race helping in 
face-to-face emergency situation. Bryan 


And Test (1967) found significant differences 
the number of persons who con- 
! Salvation Army kettle, depend- 
Mg upon the race of the person who was 

iting. White persons in a shopping center 
fended not to contribute as often when the 
tle was manned by a black woman. Wispé 
and Freshley (1970), however, failed to 
e a race effect in the willingness of 
Passersby to assist a black or white female 
bag of groceries had just broken in 
front of a supermarket. Wispé and Freshley 
Suggested that the immediacy of the onset of 
Ihe incident may have attenuated a race effect. 
Insummary, we predicted that whites would 
inate against blacks by not helping 
them as frequently as they help their fellow 
Whites. In addition, based on the results of 
Polls of blacks, we expected that blacks would 
not show any discrimination against whites, 
that is, that blacks would extend equivalent 
of assistance to black and white victims. 


METHOD 


Subjects 


Ma Subjects were 1,109 residents of Brooklyn, New 
ok. Subjects were sampled from the 1968-1969 
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the parkway ...nd Im wondering if 
you'd be able to come out here and take a 


Caller: This isn't Ralph's Garage! Listen, I'm 
terribly sorry to have disturbed you, but 
listen . . . I’m stuck out here on the high- 
way .. . and that was the last dime I had! 
Thave bills in my pocket, but no more change 
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to make another phone call . . . Now I'm 
really stuck out here. What am I going to 


do now? 

Subject: ... [Subject might volunteer to call the 
garage.) 

Calle: Listen . . . do you think you could do me 


the favor of calling the garage and letting 
them know where I am... ? I'll give 
you the number. . . . They know me over 
there. 
Oh brother . . . listen, I’m stuck out here 
. . . Couldn't you PLEASE help me out 
by simply calling the garage for me [plead- 
ingly]? 
Listen . . . If YOU were in my situation 
. wouldn't you want someone to help 
you? 


Prod A: 
Prod B: 


If after Prod B the subject refused to place the call 
but did not hang up, he was relieved of any concern he 
may have had for the stranded motorist when the 
caller reported: “Oh, one second . . . here comes a 
police car . . . I think he will be able to give me a 
hand." 

If the subject agreed to help, the victim gave him a 
telephone number to call. In fact, the subject's call 
was received by an assistant acting as the garage at- 
tendant. The assistant assured the subject that the 
victim would be helped immediately, and graciously 
thanked him for his helpfulness. 

To identify which subjects actually helped, the follow- 
ing procedures were employed. First, black and white 
callers gave the subjects different telephone numbers to 
call. Second, each time a subject agreed to call, the caller 
Changed his location on the parkway, rotating eight 
different locations. At the “garage,” the subject was 
asked for the location of the stranded motorist. Finally, 
the time at which the call to the subject was completed, 
along with the time at which subject’s call was re- 
ceived at the garage, was recorded. 


TABLE 1 


PERCENTAGES OF BLACK AND WHITE SUBJECTS 
Herpine BLACK AND Warte VicriMS 


White subjects Black subjects 
Item 
White Black White Black 
victim victim victim victim 
Geoverall | — 65 53 7 | dw 
n 251 236 247 | 239 
x? = 7140599 x? = 2.34* 
% males 73 59 69 | 68 
n 97 97 84 87 
x? = 4.50*** x = .05 
% females 60 49 66 57 
n 154 163 163 152 
x? = 389*** x? = 3.12" 
l 
*p <.20. 2 
"Es <.10. 
** p 4.05, 
skp c 01. 


Only if the subject actually called t rage was he 
credited with a helping response. If t ject refused 
to help after Prod B, or hung up after aller stated 
“and that was the last dime . . .” a Ip response 
was scored. However, if the subject up prior to 
the word “dime,” a premature han esponse was 
recorded and was considered sepa from the 
help, no-help response categories. ! case of a 
premature hang-up, the subject cou reasonably 
be expected to realize that the needed kis 


assistance. 


RESULTS 


Perceived Characteristics of the nd While 


Callers 


College students listening to tape recordings 
of the callers’ practice performances of the ex- 
perimental dialogue judged the white callers 
as “white” 92% of the time = 89) and 
judged the black callers as “black” 97% of the 
time (n = 97). 

In addition, the students pé ved social 
and personality differences betwee. the black 
and white callers. Although blacks were 
perceived to be of lower social class (£ = 2.51, 
p < .01) and less education (/ = 1.97, p <.05), 


both groups were judged as ig lower 
middle class and as having completed a high 
school education. Furthermore, blacks were 
perceived to be more good-natured (/ = 2.13, 
p < .05), while, in general, both groups of 
callers were judged favorably on character 
istics such as good-naturedness, friendliness 
courteousness, reliability, truthfulness, and 
sincerity. 


Effects of the Race of the Victim on Helping 
Behavior 


The results (see Table 1) of this experiment 
show that white subjects helped the white 
victims 12% more frequently than they hel 
the black victims (x? = 7.40, df = 1, p < 01). 
Therefore, the results support the hypothesi: 
However, although this 12% difference P 
statistically significant, the magnitude of the 
effect is rather small. i 

Blacks, on the other hand, were not at al 
antiwhite. In fact, among black subjects 
white victims were accorded help 6.6% Dw 
frequently than black victims. This 6.6% w 
ference, however, is not statistically significan 
(@ = 2.34, df = 1, p < .20). 


relative extent to which the subjects’ 
behavior appeared prowhite was not 
ntly greater for the white than the 
ck subjects (X? = .83). Thus, although 
tes extended help less frequently to the 
tk than to the white victims, whites did 
bt favor whites to a greater extent than blacks 
ed whites. 
complexion of the findings changes 
hat when the subject's sex is considered. 
1 reveals that among whites, both 
and females helped the black victims 
requently than white victims. Black 
es similarly helped white victims some- 
more frequently than black victims 
3.12, df = 1, p < .10). Black males, 
er, extended equivalent levels of as- 
to the black and white victims. 
ing the subject's and the victim's race, 
anded motorists were assisted by 67% 
male subjects and by only 58% of the 
subjects. The difference of 9% in the 
uency with which males and females 
was statistically significant (X* — 7.35, 
1» «.01) 


lure Hang-U ps 


-subject's response was categorized as a 
ture hang-up rather than help or no 
if he hung up before the stranded motorist 
the opportunity to state “and that was 
last dime." The results show that neither 
White nor the black subjects discriminated 
terms of the premature hang-up. The white 
e and black female subjects, however, 
eemed to hang up prematurely somewhat more 
uently on the black than on the white 
S (p< .10). Overall, white subjects 
g up prematurely more frequently than 
ck subjects. That is, 14.2% of the white 
ects, but only 9.4% of the black subjects, 
g up prematurely (x?= 6.07, df = 1, 
02). A sex difference also emerged in the 
of premature hang-ups. Fourteen percent 
the females, compared to 5% of the males, 
a prematurely (x? = 1841, df = 1, 


DISCUSSION 


this nonreactive study, it appears that 
whites, the social responsibility norm 


EFFECTS OF RACE ON ELICITATION OF HELPING BEHAVIOR 
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was violated somewhat more frequently for 
black than for white victims. It should be 
stressed, however, that for 88% of the white 
subjects, the race of the victim did not seem to 
affect whether or not they helped. ‘Therefore, 
the race of the victim (inferred from his 
dialect) has a small but detectable influence 
upon the helping behavior of white residents 
of New York City. 

Black subjects, confirming the findings of 
public opinion polls, did not discriminate 
against whites. Somewhat surprising, but 
approached cautiously because of the low level 
of statistical significance, was the unpredicted 
finding that black females also tended to help 
white victims somewhat more frequently 
(9%, p < .10) than the black victims. 

Comparing the wrong number technique 
with more traditional measures, some ad- 
vantages of the technique can be noted: (a) 
The method is nonreactive; (6) it measures 
overt behavior, rather than belief or intention 
to behave; (c) it permits the investigator to 
expand the sampling base beyond the college 
student population; (d) large numbers of sub- 
jects may be tested in a relatively short period 
of time (i.e., each caller averaged 12 subjects 
per hour). Speed is especially important if we 
wish to gauge the immediate effects of current 
events or programs to reduce intergroup 
tension. 

However, there are some disadvantages and 
limitations associated with the wrong number 
technique: (a) Although the sampling base 
is greatly expanded, the sample must be limited 
to those people with telephones whose phone 
numbers are listed in the public directory. 
However, the widespread use of telephone 
service tends to minimize this problem. Never- 
theless, investigators interested in sampling 
black and white households must recognize 
that their sample of the black population will 
probably be less representative than their 
sample of whites, based solely upon economic 
factors. An examination of elementary school 
records in areas populated largely by blacks 
revealed that approximately 7095 of the 
children's homes maintained private telephone 
service. Thus, nearly 30% of the black popula- 
tion may be inaccessible via telephone. (b) 
The investigator has no control over the sub- 
jects’ behavior prior to the telephone call for 


K a mani 


222 SAMUEL GAERTNER 
help. The subject may have been doing some- 
thing at home which might have interfered with 
or increased his willingness to help, and thus 
the relationship between race and helping may 
have been attenuated. However, this may be 
only an apparent disadvantage since the com- 
plexity of the situation was real, and thus the 
result might be more representative of how 
people actually behave in real-life situations. 
(c) A more serious disadvantage of the wrong 
number technique is its possible insensitivity 
to varying degrees of prejudice necessitated by 
the restrictive response categories (ie., help- 
no help). Are all white subjects who help the 
black victim nonprejudiced? Most probably 
not. They are, however, nondiscriminators in 
this situation. This method may be relatively 
insensitive, but it need not be the sole in- 
strument in a researcher's battery of measures. 


. Also, it would be a mistake to view the wrong 


number situation solely as a one-item test 


. With a yes-no response. Actually, the subject 


makes a number of responses in this situation 
which may be systematically recorded; for 
example, does he hang up immediately, does he 
volunteer to help without being asked, does 
he agree to help only after the victim pleads 
for help, does he help grudgingly or in friendly 
spirits, does he actually call the garage? A 
cursory view of this complex set of responess 
indicated, however, that only the help-no-help 
response yielded any meaningful differences. 

Other studies using this technique are in 
progress. An investigation of whether the sex of 
the victim is important in determining the 
subject's response is being conducted. Will a 
black female get more help than a black male? 
In addition, the political affiliations of the sub- 
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jects are being considered. Will members 
different political parties with their differs 
ideologies (conservative versus liberal party 
behave in different ways? 

The usefulness of this method is not 
to the study of intergroup attitudes. 
ample, the manner in which the vi 


help is under investigation. Determination 6 
whether help is more likely when the victim 
requests or demands help is being made. 


addition, the relationship between the 
the victim and the sex of the subj 
studied. 
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CON 


Heider 58) proposed that people ana- 
lyze human action as the joint product of an 
ability component (“can”) and a motiva- 
tional component (“try”) as follows: Effect 


= Ability x Motivation. This analysis pre- 
dicts that given an effect or an outcome, the 
attributed magnitude of exertion by an actor 
Will be inversely related to the perceived 
ability of the actor. Thus, if two persons 
achieved comparable goals, greater motivation 
would be attributed to the one with lesser 
ability: similarly, greater ability would be 
attributed to the one with lower motivation. 
Although they are both terms in the same 
equation, and both possible explanations of 
the same event, there is reason to think that 
ability and motivation are in fact quite dif- 
ferent kinds of constructs, and that the bases 
for attributing them are also quite different. 
Ability is a structural component. Structural 
Components are expected to persist with little 
change over reasonable periods of time—like 
Intelligence (Hunt, 1961). Thus, simultane- 
Ous or successive observations will be pooled 
in triangulating an underlying structural 
Construct, as if the observer recognizes the 
falibility of any single observation. The 
Mitial assumption of permanence suggests 
error as a plausible explanation of what 
otherwise might be construed as change. 


_' This research was partially supported by Na- 
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ISTENCY VERSUS DISCREPANCY AS CLUES IN THE 
ATTRIBUTION OF INTELLIGENCE AND MOTIVATION ' 
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m Kelley's attribution theory, it was predicted that when a person's ability 
id his behavior are congruent, estimates of that ability will be more extreme 
i will be made with greater confidence than when the ability and the behav- 
ure discrepant. From Jones and Davis’ attribution theory, it was predicted 
when a person's ability and his behavior are discrepant, estimates of that 
rson's motivation will be more extreme and will be made with greater con- 
nce than when the ability and the behavior are congruent. Two experiments 
e conducted that lend general support to these propositions. Results are 
scussed with a view to clarifying the distinction between structural and 
ivnamic constructs and the different bases for inferring each. 


On the other hand, as Ryle (1949) has 
argued, dynamic terms are constructs implic- 
itly summarizing change or a potential for 
change, which often acquire the connotation 
of cause in an explanatory sense or in a 
volitional sense. Motivation is a dynamic 
construct. Where motivation is concerned, 
discrepancy between behavior on two 
occasions does not suggest observational 
error, but rather enhances the perception of 
motivational influence and leads to greater 
certainty in attribution. 

The two major offshoots of Heider's “com- 
monsense” psychology—the attribution theo- 
ries of Kelley (1967) and Jones and Davis 
(1965)—may be seen as in some respects 
best suited to deal with the attributions of 
structural and dynamic properties, respec- 
tively. In Kelley’s scheme, attributions about 
entities are made on the basis of the con- 
sistency of observations (consistency over 
time, modalities, situations, and other observ- 
ers). In accord with an empirical epistemol- 
ogy, Kelley’s theory states that as observa- 
tional consistency increases, the resultant 
knowledge is more likely to be a true reflec- 
tion of the underlying environmental compo- 
nent. Accordingly, Kelley suggests that confi- 
dence in an attribution will increase with ob- 
servational consistency. 

In Jones and Davis’ scheme, attributions 
about motivation emphasize unique or idio- 
syncratic events. In accordance with infor- 
mation theory (Shannon & Weaver, 1949), 
these authors noted that the less likely an 
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TABLE 1 


ATTRIBUTED INTELLIGENCE 
Aes = 


Achievement. (grades) 


Ability (SAT score) 


: Below 
MEN Average average M 
Far above average 5.75 5.50 | $30 | 5.52 
Average 4.95 470 | 480 | 482 
Far below average 485 | 395 | 340 | 407 
M 5.18 4.50 


4.72 


event, the greater the information obtained 
if this event occurs. When a person performs 
in a manner discrepant from his expected 
role, not only are attributions about him 
more extreme, but confidence in such attribu- 
tions increases (Jones, Davis, & Gergen, 
1961). Thus, for Jones and Davis (1965), 
confidence in an attribution increases with 
increasing discrepancy between observation 
and expectation. 

Now there is a sense in which Kelley’s 
scheme is more general than the Jones and 
Davis analysis, and may indeed subsume the 
latter as a special case. Attribution of motiva- 
tion, like attribution of ability, should be- 
come more confident as an increasing number 
of instances record the same pattern of 
events—even though the pattern in question 
involves a contradiction between an expected 
behavior and an observed behavior. None- 
theless, it remains true that the emphasis in 
Kelley’s formulation on consistency between 
behavioral indicators in bolstering confidence 
in attribution seems more appropriate to 
attributions about structural components, 
like intelligence, which are believed to be 
stable over time and sources of stability over 
time, while the emphasis in Jones and Davis’ 
formulation on discrepancy between behav- 
ioral indicators in bolstering confidence in 
attribution seems more appropriate to at- 
tributions about dynamic components, like 
motivation, which are believed to change 
over time and be the sources of change over 
time. 

Experiment I was designed to test these 
predictions. From Kelley’s attribution theory, 
it was predicted that when a person’s ability 
and his behavioral outcomes are congruent, 
estimates of that person’s ability will be more 
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extreme and will be made with 


dence than when the ability and beh 
are discrepant. From Jones anc Davis 
tribution theory, it was predicte! that 


a person's ability and his behavior are d 
crepant, estimates of that person's moti 1 
will be more extreme and will be made will 
greater confidence than when the ability 
the behavior are congruent. 


EXPERIMENT | 


Method 
Subjects. A total of 270 Northw: Univer 
students served in this experiment. } rei 


subject-pool credit in his introduci 
course. 
Independent variables. A hypoth: 


person was described as having one ree 
of achievement in college, in that ! nsistei 
received above average, average, or iclow a 


college grades. He was also described 
of three levels of ability, in that he scc 
above average, average, or far belo 
his college entrance exam, the Scholii 
Test (SAT). The additional qualific: 
used in designating SAT levels to 
effect on this primary dimension, the 
sion. Finally, each subject made attribution rm 
on either the intelligence scale, the motivation 
or both scales. Thus, the total design was a : 
factorial, with 10 subjects in each cell 
Dependent variables. The intelligence sca 
7-point semantic differential anchored by ir 


ility d 


unintelligent. The motivation scale was a 

scale anchored by weakly motivated/strongly m 
vated. In addition, the subjects also rated 
confidence in these attributions on a 12-point $ a 
divided into four major categories: not sure at all 
somewhat sure, moderately confident, and extremel 
confident. 

Results 


Intelligence. Attributed intelligence 
Table 1) increased as SAT scores incre 
(F = 31.55, df = 2/162, p < .001) and a 
as college grades increased (F = 7.65, dj 
2/162, p < .001). There was no interact 
between these two factors in their effects 0l 
attribution of intelligence. _ 

Confidence in attributions of intelligent 
(see Table 2) was virtually unaffected 
information about college grades, but V 
strongly influenced by SAT scores, with : 
subjects reporting least confidence in thi 
attribution when the target person was @ 
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xtibed as ving 
Again, ther 
factors; th 
for certain 
ability an 
main diag: 
ity and gra 


an average SAT score. 
s no interaction of these two 
there was only a weak trend 
timates to be higher when 
les were congruent (on the 
M = 6.70) than when abil- 
vere incongruent (off the main 


diagonal—.1/ 57). 

Motivati \ttributed motivation (see 
Table 3) i ised as grades increased (F = 
169,09, d 162, p < 001) and—as pre- 
dicted by ler—decreased as SAT scores 
increased 16.60, df = 2/162, p < .001). 
In addition, there was a significant inter- 
action between grades and SAT (F — 6.64, 
df = 4/162 .001), apparently reflecting 
the fact that the import of information about 
SAT scores occurred primarily when the 


target person was reported to have average 
grades. 


While confidence in attributions of motiva- 
tion (see Table 4) was virtually unaffected 
by information about the SAT score, this 
confidence was strongly affected by the grades 
Manipulation——precisely the reverse of the 
pattern for confidence in attributions of intel- 
ligence, As ior SAT and intelligence, however, 
with grades and motivation, confidence was 
least when the target person was reported as 
having average grades, and greatest when he 


Was reported as having above average grades. 
Finally, confidence in estimates of motiva- 
tion was higher when ability and grades 
Were inconsistent (off the main diagonal— 

= 7.32) than when they were consistent 
(on the main diagonal—M = 5.77; t = 2.46, 
dj = 162, p < .05, two-tailed). 

As an alternative way of stating the pat- 
ten of findings in Tables 1-2 and Tables 
~4, we may report the results of two joint 
analyses of variance in which SAT, grade, 
and type of attribution (motivation or intel- 
gence) were the three factors. SAT informa- 
"on had a significantly different influence 
on intelligence attributions than on motiva- 
tion attributions (F = 20.02, df = 2/162, 
P<.001), and so did grade information 
E 23.96, dj = 2/162, p< 001). To a 
“sser extent, SAT information also had a 
Significantly different influence on confidence 
"intelligence attributions than on confidence 
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TABLE 2 
CONFIDENCE IN ATTRIBUTED INTELLIGENCE 
Achievement (grades) 
Ability (SAT score) | i 
Abow E 
average | Averuse| eem. | A 
Far above average 7.80 7.70 7.20 | 757 
Average 5.60 | 5.70 | 5.50 | 5.00 
Farbelowaverage | 6.70 | 6.70 | 6.60 | 6.67 
4 | 6.70 | 6.70 643 | 
TABLE 3 
ÄTERIBUTED MOTIVATION 
= SSS ———— 
| Achievement (grades) 
Ability (SAT score). |7 R n Tj = 
Abowv ^ Bel 
| av erage Average | average | at 
= Res ee i 
Faraboveaverag | 5.50 | 335 | 250 | 3.78 
Average | 620 | 435 | 280 | 445 
Farbelowaverage | 6.20 | 5.30 | 285 | 4.78 
M | $97 433 | 272 | 
TABLE 4 
CONFIDENCE IN ATTRIBUTED MOTIVATION 
Achievement (grades) 
Ability (SAT scores) a] ial 
Abov ; Bel 
Recta 139 erage | ay rage ar 
Faraboveaverage | 7.00 | 5:70 | 630 | 633 
Average 8.30 | 470 | 7.60 | 6.87 
Far below average 9.60 | 6.40 | 5.60 | 7.20 
M 8.30 5.60 6.50 | 


in motivation attributions (F = 3.13, dj = 
2/162, p < .05), and so did grade informa- 
tion (F = 3.47, df = 2/162, p < 05). 

These patterns were the same for the 
between-subjects analyses (considering just 
those subjects who made only one attribu- 
tion, either motivation or intelligence) as for 
the within-subject analyses (considering just 
those subjects who made both intelligence 
and motivation attributions). Hence, the 
means in Tables 1-4 are pooled over scale 
conditions, and it may be concluded that dis- 
criminations between the two constructs are 
made even when the salience of the distinction 
is not inflated by the measurement procedure. 


EXPERIMENT IT 


The finding that attributed motivation 
declined with increasing SAT, although pre- 
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dicted from Heider’s equation, still seemed 
surprising to us—in part, because it seemed 
very likely that SAT scores alone would have 
served as a direct clue to motivation, or that, 
by themselves, higher SAT scores would be 
associated with higher attributed motivation. 
This fact, if true, would be important, for it 
would imply even more strongly than the 
somewhat equivocal interaction effect in Table 
3 that motivation cannot be predicted from 
existing additive models of impression forma- 
tion (Anderson, 1965; Rosenberg, 1968), 
since the ordering of predictions from ability 
taken alone would be the reverse of the order- 
ing of predictions from ability taken to- 
gether with behavioral outcomes. This would 
very much strengthen the notion of motiva- 
tion as a different kind of construct than 
ability, that is, as a concept derived from 
discrepancies between other indicators, rather 
than a structural construct in its own right. 

Experiment IT was conducted to gather 
base-line data on the effects of SAT and 
grades on attributions when only one of 
these indicators was made available, and 
also, secondarily, to replicate the results of 
Experiment T. 


Method 


Subjects. A total of 160 Northwestern University 
students served in this experiment. Each received 
subject-pool credit in his introductory psychology 
course. 

Independent variables. These were identical to 
Experiment I except that (a) additional conditions 
were included in which the subjects received infor- 
mation about only one of the two indicators (grades 
and SAT), and (b) since the one-scale and the two- 
scale conditions of Experiment I proved so similar, 
all subjects were asked to make ratings on both 
scales (intelligence and motivation), The order of 
presentation of scales was counterbalanced. Thus, the 
design was a 4 X4 factorial, with 10 subjects in 
cach cell. 

Dependent 
Experiment I, 


variables. "These were identical to 


Results 


Replication of Experiment I. The results 
of Experiment IT closely replicated the com- 
parable figures of Tables 1-4. For both intel- 
ligence and motivation, the average squared 
deviation between the respective cell means 
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of Experiment I and Experiment II was 
less than the error terms in the analyses 
of variance, | 

Intelligence. A contrast was computed 
(Myers, 1966) that compared the mean of 
the ratings made across the three levels of 
grade information, at a given level of SAT. 
score, against the rating made at that level 
of SAT score without any grade information. 
As expected, rated intelligence increased as 
SAT scores increased; however, the rate of 
increase was faster when no information was” 
provided about grades (F = 4.79, 7/ = 3/128, 
p < .01). In the absence of grade informa- 


tion, mean intelligence attributions were 5.90, 
4.70, and 3.20 for far above average, average, | 
and far below average SAT scores, respec- 
tively. It will be noted that the and last 
of these means are more extreme than any 
estimate in Table 1, where grades also influ- 
enced attribution. This finding becomes less. 
surprising when we note that indicators of 
ability were more extremely worded in de 
first place than the indicators of achievement | 
(i.e. far above average or far below average 
SAT versus above average or below average 
grades), so that the two indicators taken 
together would yield an inference less extreme 
than that for the ability indicator alome if 


the subjects were averaging the values of 
these indicators (Anderson, 1965; Rosenberg, 
1968). 

Motivation. As indicated in Table 3, higher 


SAT scores, in conjunction with information 
about grades, led to lower estimates of moti- 
vation. However, in the absence of grade 
information, exactly the opposite is true— 
higher SAT scores lead to higher estimates 
of motivation, Mean motivation attributions 
were 5.20, 4,20, and 3.60 for the far above 
average, average, and far below average SAT 
scores, respectively. This trend is significantly 
different from the trend observed when infor- 
mation is given about grades (F= 12.18) 
df = 3/128, p < .001). 


Discussion 
Implications for Attribution Theories 


The present results lend some support ' 
the hypothesis that ability, as a structura 
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construct, will be inferred primarily to the 
‘extent that data from several indicators are 
consistent in all pointing in a single direction. 
Thus, in attribution of ability, information 
from both SAT scores and college grades 
contributed independently to the final esti- 
mate in a commonsense fashion. However, 
there was no significant indication that con- 
fidence in this estimate increased if these two 
indicators were consistent. The present results 
lend even stronger support to the hypothesis 
‘that motivation, as a dynamic construct, will 
be inferred primarily to the extent that data 
from severa! indicators (e.g. gathered at 
several points in time) are inconsistent, call- 
ing for some explanation of the divergence. 
Thus, in attribution of motivation, informa- 
tion from both SAT scores and college grades 
contributed to the final estimate in the 
manner predicted by Heider—inferred moti- 
vation increasing as achievement increased 
and decreasing as ability increased. Further- 
more, and most significantly, confidence in 
these estimates increased if the two indicators 
(SAT and grades) were inconsistent. 

It is suggestive to think of the distinction 
between the structural factor (ability) and 
the dynamic factor (motivation) as analogous 
to the distinction between the “first factor” 
and the “second factor" in a principal- 
Components factor analysis. A second factor 
d5 only extracted if the first factor provides 
an inadequate fit to the data. As a first 
Approximation, expectations of ability from 
the separate indicators of SAT scores and 
-Brades may be “averaged” to yield a struc- 
| tural factor score that attempts to sum- 

Matize performance. When the discrepancy 
etween indicators or ability expectations is 
minimal, the single construct—ability—is suf- 
ficient to account for the available informa- 
tion. When performance indicators are not 
Consistent, however, an average of the two 
indicators does not provide an adequate “fit” 
to the data, and a second factor (motivation) 
'S evolved to further describe and “explain” 
the data. 

The Present data yield better support for 
the Jones and Davis model for attribution 
than for the Kelley model, in that the sub- 
lect?’ confidence in attributions of motivation 
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increased as a function of discrepancy, while 
their confidence in attributions of intelligence 
did not increase as a function of consistency. 
Furthermore, very much in the spirit of 
the Jones and Davis model, the subjects’ 
confidence in their attributions of both 
motivation and intelligence was less when the 
dominant cue (SAT for intelligence, grades 
for motivation) described the target person 
as being average. The target person’s being 
described as different from average thus not 
only facilitated confidence in attributing a 
motivational disposition to him, but also facil- 
itated confidence in attributing a structural 
capacity to him. 

Grades and SAT were both used as clues 
for both intelligence and motivation, but SAT 
constituted the primary clue for intelligence, 
and grades, the primary clue for motivation. 
It is perhaps worthy to note that in explain- 
ing actual achievement, the subjects drew 
primarily upon a motivational construct 
rather than an ability one. If this is a general- 
izable finding, it constitutes a significant ad- 
dition to our understanding of how common 
sense partitions explanations of behavior into 
ability and motivational components. On the 
other hand, the finding may be limited by 
the fact that the target person was presented 
as a college student, and that the subjects 
presumed (regardless of the SAT manipula- 
tion) that college students in general were 
selectively recruited so that they all had the 
ability to do well. 


Implications for Linear Models of Impression 
Formation 


Present results for attributions of intel- 
ligence can be described with a simple linear 
model; both increasing SAT scores and in- 
creasing college grades, in isolation, lead to 
increasing intelligence attributions, and the 
joint effects of these two variables on intel- 
ligence attributions (see Table 1) can be 
described as an additive combination of these 
two cues. Attributions of motivation, how- 
ever, can be described in this fashion only if 
we are willing to make the unusual as- 
sumption that one of the cue dimensions— 
SAT scores—be assigned negative importance 
weights. This is because, in isolation, in- 
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creasing SAT scores lead to attribution of 
higher motivation, whereas in conjunction 
with grade information, increasing SAT scores 
lead to attribution of lower motivation. If we 
scale the implications of SAT scores for 
motivation on the basis of the attributions 
made when these scores are presented in 
isolation, we would get scale values that were 
inversely proportionate to the ones needed 
io describe the pattern in Table 3—which 
we could rectify by assigning the SAT values 
negative importance weights. 

It is also interesting to note that while 
confidence in attribution of intelligence is 
well fit by a linear model (see Table 2), as 
is intelligence itself, confidence in attribution 
of motivation shows a significant departure 
from linearity (see Table 4), that is, a sig- 
nificant interaction effect. It may perhaps be 
that structural constructs, like intelligence, 
are better described by linear models than are 

. dynamic constructs, like motivation; and in- 
deed, it may well be that one of the reasons 
that negative importance weights and similar 
issues have not occurred in the various models 
of impression formation is that this literature 
has concerned itself almost exclusively with 
how much subjects like the target person— 
or in other words, the attribution of 
“likability,” which is probably a structural 
rather than a dynamic construct. 


Implications for Education 


Besides the present study, there are now 
several investigations which indicate that 
ability, as a structural concept, will be resist- 
ant to change and will not be used to explain 
change; instead, discrepancies may be dealt 
with by attribution of differences in motiva- 
tion—with obvious dysfunctional conse- 
quences for those believed to have low ability 
(Rosenthal & Jacobson, 1968). Jones, Rock, 
Shaver, Goethals, and Ward (1968) discov- 
ered a strong primacy effect in ability attribu- 
tion, that is, that such attributions were 
relatively resistant to disconfirmation by per- 
formance changes. Thus, success on the part 
of persons attributed low ability is seen as 
due to “extra” motivation (Weiner & Kukla, 
1970), or attributed to external factors such 
as luck (Feather, 1969; Grzelak, 1965) or 
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the instructional skill of the teacher (John- 


son, Fiegenbaum, & Weibly, 1964). All of 
these things mitigate against the permanent 
incorporation or internalization of the respon- 
sibility for success as a structura! component 
of the person and hardly encourage further 
hard work and commitment by the person. 
Conversely, failure on the pari of persons 
attributed high ability is seen as due to lack 
of motivation (Lanzetta & Hannah, 1969; 
Weiner & Kukla, 1970), or else is attributed 
to external factors such as luck (Feather, 
1969; Grzelak, 1965). In this case, these 
interpretations mitigate against the incorpora- 
tion of responsibility for failure as a struc- 
tural component of the person, and encourage 
further hard work and commitment as the 
basis for altering the unacceptable outcome. 
It should be noted that the discounting of 
outcomes discrepant with ability levels by 
attributing them to motivational or external 
factors is as likely to be done by ‘he subjects 
themselves (e.g., Feather, 1969) as by exter- 
nal observers (e.g., Johnson et al, 1964). 

An ability attribution, like any «tiribution, 
is usually a convenient summary of past data 
and a useful predictor of future data; but 
we should try to prevent such attributions 
from acquiring a spurious validity simply 


because there are too many easy ways t0 
discount discrepancy events. If we cannot 
change the educational system so that ability 
attributions are not made, we should at least 
work to devise explicit whereby 
changes or discrepancies in people's per 
formances can be seen as occasions for 
revising attributions of ability, rather than 
always as cues for motivational or external 
explanations, 


means 
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This experiment investigated the processes 


norms emerge in a bargaining situation. Two independent variables, the level oí 
threat capability and the probability that newly established norms would sub 
sequently be enforced, were manipulated in a 2 X 2 factorial design. Results in 
dicate that threat capability determined the points received by the bargainers but, 
contrary to expectation, had no effect on the penalties assigned to the rules. Prob- 
ability of rule enforceability exerted a significant impact on penalties, with higher 
indemnities being assigned under lower probability. Internal analyses of the da! 

elucidate why threat capability did not affect penalties. 


Although the determinants of compliance 
with social norms have received extensive 
scrutiny by researchers, only recently has much 
attention been devoted to a related problem, 
namely, the conditions under which such 
norms emerge. Several recent studies (Michener 
& Zeller, 1970; Murdoch, 1967; Thibaut & 
Faucheux, 1965) have investigated how in- 
dividuals establish agreements that regulate 
behavior in situations involving conflicts of 
interest, These studies have emanated from 
the general premise, enunciated by Thibaut 
and Kelley (1959, pp. 127-135), that the 
content or form of such norms reflects the 
underlying power capabilities of the contract- 
ing parties. This perspective maintains that 
once norms have been established, individuals 
can influence one another merely by invoking 
these impersonal regulations. Persons, in fact, 
may prefer to exercise influence by invoking 
norms, because this avoids the direct use of 
power and conserves the valuable resources on 
which interpersonal power is based. Since the 
consequences of normative invocation resemble 
those of direct power usage, individuals will 
strive to institute norms that favor their own 
interests. Therefore, in the process of establish- 
ing norms, bargainers espousing divergent 
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SITUATION ! 


by which conflict-regulating contractu: 


use) t 
s. In 


interests may use (or threaten t 
power to attain favorable agreem: 
sequence, the balance of^powe: betwe 
bargainers becomes important, and the mor 
phological structure of emergent contractu 
norms will mirror the underlying 
capabilities of the negotiating parties. 
Testing a hypothesis derived from 
general perspective, Thibaut and Faucheu 
(1965) varied the power capabilities of 
bargainers in a mixed-motive situation am 
found that when each bargainer had 
capacity to substantially damage the othe 
interests, they enacted a large number 
normative contracts bearing stringent pen 
to limit the internecine use of power. Fe 
contracts with smaller penalties emerged Wi 
only one (or neither) of the bargainers ha 
such power capabilities. In a parallel study 
Murdoch (1967) varied the subjects’ peri 
tions of the likelihood that their adversi 
would actually use his power in the situati 
As expected, this affected the number of € 
tractual agreements, as well as the attend 
sanctions. More agreements with severe 
alties materialized when bargainers percel 
one another as likely to use their respec 
power capabilities than when they viewed e 
other as unlikely to do so. More recen 
Michener and Zeller (1970) extended this 
of investigation to coalitions and observed th: 
the strength of a coalition—that is, the da 
that it could inflict on its target—afiected 
magnitude of the penalities assigned to nor 
limiting both the activities of the coalition s. 


target and those of the coalition itself. Greater 


strength led veater normative restriction 
and stiffer p: s for violation. 

Considere ther, these studies indicate 
that power « ities can determine both the 
occasion and the form of emergent contractual 
regulations. > studies share several char- 
acteristics. T: h case, the only norms that 
subjects cou! plicitly establish were those 
placing a li n power capabilities. When 
establishing norms, the bargainers were 
required to st te the sanction to be imposed 
in the even ‘iolation. Moreover, with the 
exception of -doch’s study, these sanctions 

re indem: That is, fines forfeited for 

e violation were paid directly to the other 

rty in the ::-zotiations, thereby compensat- 

g him for damage caused by the trans- 
gressor’s act Further—and perhaps most 
importantly ch of these studies assured 
perfectly reli enforcement (via the experi- 
menter) of auy and all contractual norms 
tstablished by the bargainers. Whenever they 
treated a ru's, both negotiators realized that 
Violation wes untenable and enforcement, 
inevitable. 

In everyday interaction, however, norms 


(contractual or otherwise) are not perfectly 
enforced. As attested by the statistics on crime 
in natural settings presented by the President's 
Commission on Law Enforcement and the 
Administration of Justice (1967, pp. 1-46), 
Violators frequently avoid apprehension, escape 
Punishment, or both. Some rules, of course, 
may prove more difficult to enforce than others. 
The probability that a rule or agreement 
could be successfully enforced may become a 
Pivotal concern at the time when that agree- 
ment is established, Certainly, negotiators will 
Ponder the likelihood of successful enforcement 
efore assenting to contractual stipulations. 
Norms that individuals can flout with impunity 
àre frequently of limited utility. 

Ih situations where adversaries can unleash 
Power tactics against one another and where 
| COntractual agreements constitute the only 
ve at least, the primary) means to constrain 

* usage of power, the probability of sub- 
Sequent enforcement should especially affect 

© sanctions appended to contractual agree- 
iud The present study hypothesized that 

der low probability of enforcement, bar- 
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gainers will place high penalities on the agree- 
ments in the hope of deterring violations by 
raising the expected cost of aberrance. In 
contrast, high probability should occasion 
lower penalties. Bargainers will realize that 
when delitescent transgressors view appre- 
hension as very probable, less stringent penal- 
ties will suffice to deter violations. 

In addition to reflecting the probability of 
enforcement, penalties should vary as a func- 
tion of power and threat capabilities. Con- 
sistent with earlier studies, higher levels of 
threat potential should lead to stiffer penalties. 
Moreover, there may be an interaction effect 
between probability of enforcement and threat 
potential in determining penalties assigned to 
norms. The present study hypothesized that 
when a high level of threat occurs in conjunc- 
tion with a low probability of enforcement, 
penalties will be especially severe. Faced with 
extreme threat, bargainers will desire to fore- 
stall potential disruption, and for this reason 
they may seek the establishment of norms to 
limit the use of power. If they anticipate, 
however, that such agreements will defy 
reliable enforcement and prove only sporadi- 
cally effective, they will try to bolster deterr- 
ence by imposing acute penalties. Only by this 
means can they insure that these norms will 
be sufficiently robust to prevent rampant 
violation. 

In sum, this line of reasoning, which empha- 
sizes the deterrent effect of normative sanc- 
tions, leads to the hypotheses that (a) penalties 
will be more severe under low probability of 
enforcement than under high, (b) penalties 
will be more severe under high threat than 
under low, and (c) penalties will be especially 
severe under the interactive conditions of low 
probability and high threat. 


METHOD 
Overview 


Participating in two-person groups, the subjects 
bargained over the allocation of rewards in a mixed- 
motive situation, The negotiations were based on a 
payoff matrix that delineated four possible outcomes; 
a fifth, independent outcome was available outside the 
matrix, Although the subjects in a dyad occupied differ- 
ent roles, each exercised some power over the other. 
One bargainer held the prerogative of allocating points 
whenever one of the four outcomes within the matrix 
was chosen. The other derived power from his capacity 
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Bargainer 
BDP 
ci c2 
| Total = 195 points | Total = 50 points 
R1 BT must receive at BT must receive at 
least 10 points least 10 points 
Eas 
T UTE NEM 
Total = 50 points | Total = 75 points 
R2 BT must receive at | BT must receive at 
least 10 points | least 20 points 


Fic. 1. Payoff matrix for Trial 1. (In the low-threat 
condition, the external alternative equals 80 points 
for the bargainer choosing the external alternative and 
70 points for the other; in the high-threat condition, 
the external alternative equals 80 points for the 
bargainer choosing the external alternative and 0 
points for the other.) 


to choose the alternative outside the matrix, thereby 
damaging the interests of the first bargainer. Subjects 
could limit these power capabilities by written con- 
tracts that regulated the point distribution and out- 
lawed the external alternative. 

Two independent variables were manipulated in a 
2 X 2 factorial design. The study varied threat capa- 
bility—that is, the damage that one bargainer could 
inflict on the other—by differentiating the payoffs in 
the external alternative. Similarly, by introducing 
differences in the procedures for apprehending violators, 
the design manipulated the probability that contrac- 
tual norms would be enforceable. Subjects bargained 
over a series of six trials, during which several depen- 
dent variables were measured. These included the 
allocation of points, the number and types of contrac- 
tual norms formed, the penalties imposed for violation 
of these norms, and the number of resulting violations. 
A postexperimental questionnaire assessed the sub- 
ject's perceptions of the negotiations. 


Subjects and Procedure 


Eighty male undergraduates at the University of 
Wisconsin participated as subjects in the experiment. 
They constituted 40 two-person groups. Members 
within a dyad understood that they represented oppos- 
ing teams and that their goal was to win points for their 
respective sides. Subjects expected to earn $1.50 for 
the J-hour experiment, and, to heighten competitive 
motivation, they were informed that a bonus of $1 
would subsequently be paid (by mail) to each of the 40 
bargainers comprising the winning team, 

Each dyad was randomly assigned to one of the four 
experimental treatments in the 2 X 2 factorial design 
(Threat Capability X Probability of Enforcement). 
The bargainers within each dyad occupied different 
roles (called *Red" and *Blue" in the experimental 
setting), and they were assigned to these roles ran- 
domly. Subjects sat at the opposite ends of a table, 
separated by a screen of eye-level height that enabled 
them to make decisions in private. The seating positions 
of the bargainers (roles) were counterbalanced across 
experimental treatments to offset any systematic loca- 
tion effects. 
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The subjects read printed instructi 
situation; these were subsequentl; 
experimenter to assure uniformity 


The instructions explained that th: iment con 
sisted of six trials, the first two of v re practice 
trials to familiarize subjects with t off matrix 
and the bargaining procedures. Points ing toward 
the team totals could be scored durin our regular 
trials, but not during the two practic i 

With the experimenter absent fror bargaining 
room, the subjects began the first tri ich allotted 
4 minutes for negotiations. The pay ix confront- 
ing the subjects on this trial appears in igure 1; the 
same matrix was also used on all subsequent trials 
The matrix portrays four internal outcomes (RC; 
Ri,C2; R2,Ci; Re,C2), as well as an external alternative. 
While the internal matrix values remained constant 
across conditions, the value of the ex alternative 
varied, depending on the experin treatment 
(high- versus low-threat capability). ! z the bar- 
gaining session, the subjects discussed +! ssible out- 
comes and attempted to strike a deal fa ble to their 
respective teams. After time had elap the experi- 
menter spoke to the subjects over an i m system, 
instructing them to cease negotiation d to write 
their final decisions on printed decisio» forms. One 
bargainer chose, in private, among thre options. He 
could pick either column 1 or column 2 ('«., C; or C2), 


or he could choose the external alte ve. Simul- 
taneously, the other bargainer privately decided among 
three similar options: row 1 or «ow 2 (i.c., Ri or Re) 
or the external alternative. The inte ion of the 
bargainers’ choices determined the outcome for the 
trial. Thus, one of five possible outcomes would result 


(RyCi; RyCs; Re,Ci; Ros; or the external alter 
native). If either (or both) subjects chose the external 
alternative, that preference dominated other out- 
comes, and the payoffs for the trial were those specified 
by the external alternative. After the 
dicated their choices in writing, the ex; ri 
entered the bargaining room, inspected the decision 
forms, and announced the joint outcomes. ; 

As indicated in Figure 1, the four outcomes in the 
payoff matrix each specified only a range of point out- 
comes, not the precise values to be allotted to the two 
bargainers. Thus, whenever the subjects chose one of 
these four outcomes, it was still necessary to determine 
the exact point distribution (within the limits delineated 
by that outcome). This final decision was the preroga- | 
tive of one of the bargainers throughout the experi- 
ment, as designated by the role assignments at the 
experiment's outset. Whenever one of the four matrix 
outcomes resulted, the experimenter left the room; 
and the bargainer with the right to divide the pou 
(hereafter labeled BDP) indicated (in writing) the fina! 
allocation of points for the trial. Clearly, the preros® 
tive to divide points gave BDP power over the other 
bargainer. 

The other bargainer was not impuissant, however 
because he derived power from the use of the ere 
alternative. While both bargainers could choose F 
external alternative, BDP had little motivation to 
so, for it provided poorer outcomes than he wou 


cally obtain within the matrix. In contrast, the 
bargaüiner (who may be designated as BT, 
he could use the external alternative as a 
found the external alternative relatively more 
i . In choosing it, he could not only deliver 
point outcomes to BDP, but also nullify BDP's 
viding prerogative (because BDP could allo- 
nts only if the resulting outcome was one of the 
n the matrix). 
the subjects had completed Trial 1, the experi- 
reentered the bargaining room and gave them 
er set of written instructions. These introduced 
procedures by which the subjects could establish 
etual rules during the remaining trials. Three 
the division rule, the external alternative rule, 
the row-column rule—could be formed. The 
rule placed a normative constraint on BDP, 
how he must divide the points in the event 
‘outcome RC; occurred. Since Ri,C; was the 
jointly profitable and attractive outcome in the 
ix, this rule limited BDP's power over BT's re- 
Tn contrast, the external alternative rule pro- 
BT from choosing the external alternative, and 
constrained him to one of the four outcomes 
the matrix. In essence, this rule disarmed BT by 
ng the threat of the external alternative. The 
umn rule required BDP to choose a particular 
(whichever the bargainers specified) and BT to 
a particular row. This rule not only limited the 
s capacity to disrupt, but also enhanced co- 
lation by prohibiting deviation from a specified 
ome within the payoff matrix. Structurally, then, 
experimental situation incorporated three rules 
ng the use of power by the opposing parties. 
les were established by written contract. Using 
ed contract forms provided at the start of each 
“(Trials 2-6), the subjects could establish as 
rules as they wanted (i.e., from zero to three on 
trial). A rule emerged only when both bargainers 
ed concurrence by signing the contract and 
lating a penalty for violation. These penalties, 
h were indemnities to be paid by the transgressor 
other, always ranged from a minimum of 1 to 
aximum of 500 points. Penalties of 0 points were 
bited, as were those exceeding 500. Indemnities 
established separately for each rule formed, so 
ent rules could carry different penalties if the 
ainers desired, To assure that subjects would not 
ü penalties profligately, each bargainer was re- 
td to pay the experimenter a “contract fee" of 3 
ts per 100 penalty points assigned to each rule; 
Procedure emulates that used by Thibaut and 
ler (1969). Although rules could be enacted only for 
trial at a time, they could be reestablished on 
quent trials if the bargainers concurred. 
e rules had been established during the 4-minute 
tiating period, they were considered binding on the 
ects’ behavior for that trial. Any bargainer trans- 
ing a rule was required to indemnify the other 
iner by the specified penalty if he was appre- 
during the violation. Apprehension was not 
however, for the likelihood of enforcement was 
1 or $, depending on experimental treatment. 
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Enforcement occurred via the experimenter, After the 
bargainers rendered their decisions, the experimenter 
(who had been absent during the negotiations) returned 
to the room, examined both the rule contracts and the 
bargainers' outcome choices and determined whether 
any violations had occurred. In the event of violation, 
the experimenter tossed a die to determine whether the 
rule would be enforced or not. 

After the subjects had completed Practice Trials 1 
and 2, they began the regular trials (3-6), during which 
they could gain points for their respective teams. 
Trials 3-6 resembled Trial 2; no other procedural differ- 
ences were introduced. After the subjects had com- 
pleted five of the six trials (and were therefore deeply 
involved in the bargaining), the experimenter adminis- 
tered a questionnaire eliciting perceptual and attitu- 
dinal information about the normative contracts, the 
relative power of the bargainers, and the rule violations. 
Subjects then resumed bargaining and completed Trial 
6. Subsequently, they were debriefed and paid $1.50. 
The extra $1 bonuses promised as inducements for 
successful bargaining were later mailed to the 40 
members of the winning team. 


Rule Contracts 


Rules assumed the following form. The division rule 
stated: “If outcome R;,C; results on this trial, [BDP] 
will divide the points as follows: [BDP] will receive__ 
points, [BT] will receive —— points.” In other words, 
the division rule was a written agreement between 
BDP and BT specifying how BDP would apportion 
the 195 points available from the highly attractive 
outcome Ri,C;. As such, the division rule constituted 
a normative instrument by which BT could handcuff 
BDP, requiring him to allot the points from RyCiin a 
fashion acceptable (i.e., favorable) to BT. 

In contrast, the external alternative rule restricted 
the behavioral latitude of BT. This rule stated: “On 
this trial, [BT] agrees that he will not choose the ex- 
ternal alternative." Since BT's strength rested on the 
threat of picking the external alternative, a contractual 
norm prohibiting this choice nullified the threat and 
bridled BT's power. 

The row-column rule required the bargainers to 
choose a specified outcome within the matrix. It stated : 
“On this trial, we agree that [BT] will choose: Ry or 
Rs (indicate), and that [BDP] will choose: C; or Cz 
(indicate)." The row-column rule, then, stipulated the 
exact choices to be made by each bargainer. It provided 
a basis for coordination between the bargainers and 
also in some degree limited the power of BDP and BT. 


Independent Variables 


This study included two independent variables. One 
of these, BT's threat capability, was manipulated by 
variations in the point outcomes attached to the ex- 
ternal alternative. Figure 1 shows that if one bargainer 
chose the external alternative while the other did not, 
the former would receive 80 points, and the latter 
would receive either 70 (in the low-threat condition) 
or 0 (in the high-threat condition). If both picked the 
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external alternative, they each received 80 points. 
Since threat capability is defined theoretically as 
one's capacity to block or damage another's outcomes, 
the external alternative option afforded substantially 
more power to the person picking it when it gave 0 
points to the other than when it gave 70 points to the 
other. While both bargainers could pick the external 
alternative, BDP was unlikely to do so, for he could 
usually obtain more than 80 points by staying within 
the matrix; generally, his only motivation for choosing 
the external alternative would be self-defense against a 
threat from BT. In essence, then, differences in the 
payoffs associated with the external alternative varied 
the power wielded by B'T, but did not materially alter 
BDP's capacity to damage BT's interests. 

"The other independent variable was the probability 
that contractual rules would be enforced. The likeli- 
hood of enforcement was either ġ or §, depending on the 
experimental treatment. Under low probability (1), 
a toss of 6 on the die meant that the violator had been 
"caught," while any other value meant that he escaped 
with impunity. Under high probability (2), the obverse 
Obtained: a toss of 6 indicated successful violation, 
while any other value meant apprehension of the 
violator. If several violations occurred on a given trial 
(either by the same bargainer or by different bargainers), 
these were treated as independent instances, and a die 
was tossed separately for each infringement. 


Dependent Measures 


Several dependent variables were measured. The 
most important dependent variable—the penalty 
points assigned to each of the three types of rules—was 
predicted to vary as a function of both threat and prob- 
ability of enforcement. Ancillary dependent variables 
included the number of rules formed and the number 
of points obtained by each bargainer. For each depen- 
dent variable, summative scores (i.e., the sum of values 
for Trials 3-5) were used in the analysis. To obviate 
possible ending effects, responses from the sixth trial 
(after the administration of the questionnaire) were not 
treated as data and were not analyzed. 

Data on the magnitude of penalties originated from 
the contract forms used by the subjects to establish 
rules, Separate scores (summed over Trials 3-5) were 
obtained for each of the three rules, In calculating these 
scores, a penalty of O points was tallied for those in- 
stances where bargainers enacted no rules. 

Information on the number of rules formed by the 
bargainers over Trials 3-5 also stemmed from the con- 
tract forms. Again, separate scores were obtained for 
each type of rule. 

Data on the points allocated to each bargainer arose 
from the printed decision forms used by the bargainers 
in specifying their outcome choices. Separate scores 
(summed over Trials 3-5) were obtained for BDP and 

BT. These scores were the number of points obtained 
prior to payment of penalties for violation and prior to 
receipt of indemnities from violations by the other. 

Additional information on the bargainers’ perceptions 
of relative power and the importance of rules and 
penalties emanated from the questionnaires adminis- 
tered between the fifth and sixth trials, 
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RESULTS 

Check on Experimental Manipulc ions 

Threat capability. The threat manipulation 
was designed primarily to var; BT’s power 
in the situation, since BDP’s power rested in 
his capacity to allocate points w ‘vin the pay- 
off matrix. Several tests indicate t: «t variations 
in the external alternative payoffs were success- 
ful in manipulating BT’s threat capability, 
First, BDP perceived BT as more powerful 
under high threat (80-0). His responses on the 
postexperimental questionnaire ‘o the item 
“How much influence did [BT | have in the 
bargaining situation?" showed » significant 


threat main effect (F = 4.2 if = 1/36) 
p < .05), Moreover, as expected, D P showed 
less enthusiasm than BT about ‘he external 
alternative outcome. A £ test difference 


of means, comparing BDP's axl BT's re 
sponses to the question “As a poss; ie outcome, 
how attractive was the external : t crnative to 
you?" showed a significant differesce (£ = 3.16, 
p «.005), with BDP finding the external 
alternative significantly less desirs^!c than BT. 
The variation in payoff values for the external 
alternative choice, then, did effectively mant- 
pulate BDP's perception of B's power to 
damage his interests. 

Probability of enforcement. Variation in the 
probability of enforcement affected both 


BDP and BT, since each could violate contrat- 
tual norms. Both bargainers accurately pef 
ceived the manipulation. They were asked, 
“What was the actual probability of getting 
caught if you violated any rules formed during 
the experiment?" Both BDP’s response 
(F = 95.61, df = 1/36, p < .001) and BTS 
responses (F = 125.38, df = 1/36, p < 001) 
evince the expected probability main effect. 
Moreover, this manipulation directly affected 
the subjects’ perceptions of the probability 
that they would be apprehended in the event 
of transgression. The bargainers were ask Í 
“In your opinion, what was the likelihood 0 
getting caught if you violated any rules tha 
you formed during the experiment?” BDF 
responses showed a probability main effe 
(F = 54.14, df = 1/36, p < .001), and so di 
BT's (F = 47.60, df = 1/36, p < 
Clearly, subjects perceived the likelihood ° 
enforcement as greater under $ then under ë 
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Dependent Variables 
Points. Table 1 reports the mean points 
(summed o frials 3-5) received by BDP 
and BT, as a function of threat capability and 
probability enforcement. An analysis of 
variance of |!) P’s points indicates a significant 
threat main effect (F = 7.00, df = 1/36, 
p € 05), a nonsignificant probability of en- 
- forcement main effect (F < 1, ns), and a non- 
significant reat X Probability interaction 
(F < 1, ns). \ similar analysis of BT's points 
evinces n :nificant effects (for threat, 
F = 2.32, df = 1/36, ns; for probability of 
enforcemen 1, ns; and for the interaction, 
F < 1, ns). Thus, when BT had higher threat 
capability, BDP received fewer points, but 
BT did no: succeed in obtaining more. In 
terms of points attained, threat benefited no 
one. Althou:;'; the data indicate that BT did 
not actually make the aggressive move of choos- 
ing the external alternative more frequently 
under high threat than under low threat 
(E = 1.02, ¿f = 1/36, ns), the smaller quantity 
of points attained by BDP when the external 
alternative was chosen under high threat (0, 


in contrast to 70 under low threat) obviously 
accounted for BD P’s reduced outcomes. 

Number of rules formed. Table 2 reports the 
Mean number of rules formed over Trials 3-5, 
for each of tlie three types of rules, as a func- 
tion of threat capability and probability of 
enforcement. 

An analysis of variance for the frequency of 
formation of the division rule shows no signifi- 
cant effects (for threat capability, F < 1, ns; 
for probability of enforcement, F = 2.88, 
df = 1/36, ns; and for the interaction, F < 1, 
ns). A similar analysis for the external alter- 


TABLE 1 
Mean Points RECEIVED BY Each BARGAINER AS A 
Function or THREAT CAPABILITY AND 


E PROBABILITY OF ENFORCEMENT 


| Bargainer 
Treatment —~ -4 
E-- : Lys BDP | BT 
l threat capability | 
probability 234.5 175.5 
Lor obability 234.0 188.0 
s reat capability 
$ probability 278.6 221.4 
4 Probability 306.4 201.1 
Coli 
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TABLE 2 


Mean NUMBER OF RULES ESTABLISHED AS A 
FuNCTION OF THREAT CAPABILITY AND 
PROBABILITY OF ENFORCEMENT 


Rule type 
Treatment 
A External Row- 
Division atctontive column 
High threat capability 
$ probability 1.20 60 .90 
& probability 2.10 1.90 1.90 
Low threat capability 
$ probability 1.40 .60 1.30 
è probability 1.70 1.20 .90 


native rule, however, indicates a significant 
probability of enforcement main effect (F 
= 8.71, df = 1/36, p < .01), a nonsignificant 
threat effect (F = 1.18, df = 1/36), and a 
nonsignificant interaction (F — 1.18, df — 1/ 
36). The bargainers, apparently at BDP's 
behest, established the external alternative 
rule more frequently under $ probability of 
enforcement than under $ probability. 

The row-column rule, intended as a vehicle 
both to limit power and to promote coordina- 
tion, displays a different pattern regarding the 
frequency of formation. While the analysis of 
variance indicates that neither the threat 
capability main effect (F — 1.02, df — 1/36), 
nor the probability of enforcement main eflect 
(F = 1.02, df = 1/36) is significant, the 
Threat X Probability interaction effect does 
emerge as substantial (F = 5.53, df = 1/36, 
p € .05). The mean frequency of row-column 
rules forming under $ probability-high threat 
is greater than the means for the other treat- 
ments in the design. Evidently, this occurs 
because BDP strives for maximum protection 
under this condition; he does this by demand- 
ing that the bargainers establish not only the 
external alternative rule but also the row- 
column rule. Thus, if BT chooses the external 
alternative when both norms prevail, he is 
subject to apprehension for two violations, and 
this is the maximum level of protection that 
BDP can achieve in the situation. 

Rule penalties. Table 3 reports the mean 
value of rule penalties over Trials 3-5, for each 
of the three rules, as a function of threat 
capability and probability of enforcement. 
Tables 4, 5, and 6 summarize the analyses of 
variance for the division rule, the external 
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TABLE 3 


Mean Rute Penatties as A Function or TukkAT 
Carauury AND PROBABILITY OF ENFORCEMENT 


Rule type 


Treatment 
External Row- 


| Division Tl column 


High threat capability 


$ probability 115.0 41.0 | 463 

į probability | 374.7 | 3008 296.5 
Low threat capability | 

1 probability 170.0 | 925 | 93.5 

| 237.5 157.5 


1 probability 298.1 


alternative rule, and the row-column rule, 
respectively. All rules evince the same pattern : 
a strongly significant probability of enforce- 
ment main effect, a nonsignificant threat capa- 
bility main effect, and a nonsignificant inter- 
action effect, For each rule, the mean penalties 
are significantly higher under 3 probability of 
enforcement than under § probability. As 
predicted, bargainers assign higher penalties 
when enforcement is less certain. Contrary to 
expectation, however, the data fail to show a 
significant threat capability main effect. 


Discussion 


The data on rule penalties, therefore, do not 
fully substantiate the hypotheses advanced in 
the earlier theoretical discussion. While they 
do evidence a strong probability of enforcement 
main effect (as predicted), they fail to replicate 
the threat capability main effect found by 
earlier studies, and they disconfirm the pre- 
dicted Threat X Probability interaction effect. 

To elucidate this pattern of results, one must 
ask what function was fulfilled by the rules 
in the bargaining situation. The answer to this 
proves to be complex, and it begins with a 


TABLE 4 


ANALYSIS OF VARIANCE SHOWING THE EFFECTS OF 
THREAT CAPABILITY AND PROBABILITY or EN- 
FORCEMENT ON DIVISION RULE PENALTIES 


bal o. nd miM. ee e = 
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TABLE 5 
ANALYSIS or VARIANCE SHOWING LPPECTS oF 
THREAT CAPABILITY AND PROBA: or Ex- 


FORCEMENT ON EXTERNAL ALT IVE 
RULE PENALTIES 


| 
Source of variation df F 


Threat capability (A) Du 1| «1 
Probability of enforcement 

(B) 1 | A | 9.86* 
AXB 1 «Lr 

Error 36 

*p <0. T 
determination of which barga wanted 
which rule(s). 

Rule Preference a Trai: 

The postesperimental question:: c asked 
both BDP and BT: “In your opi: to what 
degree was it desirable to adopt tie division 
rule, whether the rule was actua!!: :blished 
or not?" Parallel questions were asked for the 
external alternative rule and the row-column 
rule. Tests for difference of means indicate that 
BT found the division rule more desirable 
than did BDP (¢ = 2.05, p < .025), and BDP 


found the external alternative rule more de 
sirable than did BT (t = 3.22, p < .005). The 
data reveal no significant difference in the 
extent to which BT and BDP desired the row- 
column rule (t = 1.64, ns). 

Importantly, these preferences were actually 
translated into the formation of rules during 
the negotiations. The partial correlation be 
tween BT's response to the above question 0n 
the desirability of the division rule and the 
number of such rules actually formed, holding 
statistically constant BDP's answer to the 
same question, yields a value of .389 (p = .014), 


TABLE 6 


ANALYSIS OF VARIANCE SHOWING THE EFFECIS OF 
THREAT CAPABILITY AND PROBABILITY OF EN- 
FORCEMENT ON Row-COLUMN RULE 


= PENALTIES 
Source of variation ae MS F Source of variation df MS rs 
Threat capability (A) 1| 11664|«1 Threat capability (A) 1| 21068.1 | «1 
Probability of enforcement Probability of DOE " 
(B) 1|3750421| 6.76* 1|2468041| 794 
AXB 1| 432064 | «1 AXB 1| 86676.1| 27 
Error 36 | 556169 Error 36 | 31077.8 
*p «.025. TS 


*p <01. 


indi: BT’s advocacy played a cen- 
i blishing this rule. The partial 
BD P's answer to this item and 
ivision rules formed, controlling 
nse, is a nonsignificant .266 
enever the division rule, which 
P’s power, was established by 
it emerged primarily because 
vanted it, 
| correlation analyses for the 
"native rule yield a value of .380 
DP and a nonsignificant value 
937) for BT. As would be ex- 
<ternal alternative rule, which 
s threat capability, was estab- 
’s behest, not at BT's. 
:rtial correlations for the row- 
hich limited the behavior of both 
| also facilitated coordination, 
significant .675 (p < .001) for 
) (p = .073) for BT. The bar- 
instiiuted this rule especially when 


gs, then, are consistent with 
T wanted the rule that limited 
aneuverability, and BDP wanted 
rule that constrained BT's threat capa- 
y. The division rule emerged when BT 
ited it; the external alternative rule, when 
| it; and the row-column rule, when 
(and especially BDP) wanted it. 
erestingly, the data indicate that the 
sion rule and the external alternative rule 
to occur together. The correlation 
en the number of division rules formed 
ind the number of external alternative rules 
omed is 533 (p < .001), and the correlation 
en the penalties assigned to these rules is 
(p < .001). When the bargainers estab- 
hed one of these rules, they usually instituted 
the other. This may constitute a process of 
trading, whereby BDP gave his consent 
the division rule in exchange for BT's 
liescence to the external alternative rule. 
arently, however, this trading process 
down under high threat, which may 
n why the data do not substantiate the 
licted threat main effect and the predicted 
at X Probability of Enforcement inter- 
effect on the penalties assigned to the 
BT strongly opposed the external alter- 
rule under high, threat. This seems to 
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have occurred not only because the external 
alternative rule would disarm him but also 
because trading this rule for a division rule 
would not provide adequate protection against 
BDP. The situation permitted both BT and 
BDP to choose the external alternative. 
Normally, BDP would have no motive for 
doing so (except to protect himself against 
BT's threats) because he could gain more 
points within the matrix. Under high threat, 
however, there arose the slight possibility that 
BDP would (at some sacrifice to himself) pick 
the maa alternative in order to severely 


damage int total. BT noted this po- 
tentiality. i repens to the question 
“How threatening w e possibility that the 


other person might choose the external alter- 
native?” show a threat effect (F = 4.04, 
df = 1/36, p < .06). In consequence, BT re- 
sisted any external alternative rule restraining 
his behavior and did not trade an external 
alternative rule for a division rule. No more 
such rules were established under high threat 
than under under low threat, and the penalties 
likewise failed to respond to differences in 
threat. 

The implication, then, is this: the empirical 
generalization (supported by earlier studies) 
that greater threat leads to more stringent 
normative sanctions should be qualified. Ap- 
parently, this generalization holds only when 
the norms that one can establish in the situa- 
tion provide assurance of complete disarma- 
ment—that is, that power usage will be 
definitely curtailed. Rules containing loop- 
holes or providing only for partial disarma- 
ment will not be more stringently sanctioned 
under high threat than under low. 


Indemnities and Deterrence 


Although threat had no effect on rule 
penalties in the present study, probability of 
enforcement did. Bargainers established penal- 
ties that were two to three times more severe 
under $ probability of enforcement than under 
$ probability. As indicated in the earlier theo- 
retical discussion, it had been expected that 
penalties would deter the overt use of power. 
Surprisingly, however, the data affirm that 
penalties were not effective in preventing viola- 
tions. Higher penalties did not lead to fewer 
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violations, even when the probability enforce- 
ment was statistically controlled. This is re- 
flected both in the data on actual violations 
and in the data on reported temptation to 
violate. An item on the postexperimental 
questionnaire asked, “How tempted were you 
to violate any rules that formed?” If higher 
penalties lessen the proclivity to violate, signi- 
ficant negative correlations should emerge be- 
tween magnitude of the penalties and reported 
temptation to violate, when probability of en- 
forcement is held constant. The data do not 
evince such a pattern. For BDP, the partial 
correlation between reported temptation and 
the division rule penalties, with probability of 
enforcement held constant, is .173, and that 
between temptation and the row-column rule 
penalties is .228. For BT, the partial correla- 
tion between temptation and the external 
alternative rule penalties, with probability 
held constant, is .092, and that between temp- 
tation and the row-column rule penalties is 
-168. In no case do the expected significant 
negative relationships appear. 

More impressively, the relationship between 
magnitude of penalties and the frequency of 
actual violations (summed over Trials 3-5) also 
fails to run in the expected negative direction. 
The partial correlation between BDP's actual 
violations and the division rule penalties, with 
the probability of enforcement held constant, 
is .330; and the analogous partial correlation 
between BDP's violations and the row-column 
rule penalties is .254. For BT, the partial cor- 
relation between violation and the external 
alternative rule penalties, with probability of 
enforcement constant, is .189, and that be- 
tween violations and row-column rule penalties 
is .131. Again, the expected negative correla- 
tions do not appear, indicating that stringent 
penalties were no more effective in deterring 
transgression than lenient penalties: an un- 
anticipated finding. 

But if the penalties do not palliate tempta- 
tion or deter violation, what function did the 
rules serve in the negotiations? Apparently, 
they reduced uncertainty regarding outcomes 
in the situation. The postexperimental ques- 
tionnaire asked each bargainer, “To what 

extent did the rules and penalties help reduce 
your uncertainty about the outcomes that the 
other bargainer would choose?" As might be 
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expected, the various rules h ifferential 
effects on the two bargainers. h the ex- 
ternal alternative rule and the r« umn rule 
significantly reduced BDP's r: d uncer- 
tainty regarding the consequ: of BT's 
behavior, while both the divi rule and 
the row-column rule significantly :ed BT's 
uncertainty regarding BDP's ortment. 
The more rules, the greater the iction in 
uncertainty. This was true re » of the 
probability of enforcement. | DP, the 
partial correlation between repo: duction 
of uncertainty and the number external 
alternative rules, with probability cî enforce- 
ment held constant, was .389 (p = .014), and 
the analogous partial correlation for the num- 
ber of row-column rules was .325 () = .044). 
For BT, the partial correlations bei ween reduc- 


tion of uncertainty and the number of division 


rules, with probability of enforce ment con- | 
stant, was .466 (p = .003), while that for the 
number of row-column rules wes .392 (p 
= .014). Both subjects perceived the rules— 
especially the rules constraining t+e other's 
behavior—as reducing the uncertainty of 
outcomes. 

But how can the formation of rules reduce 


uncertainty regarding outcomes if, as in the 
present instance, penalties do not deter viola- 
tions? Since stringent penalties fail to constrain 
deviance more than lenient ones, how can 
rules suppress errant behavior and diminish 
uncertainty? 

The subjects felt that rules reduced uncer- 
tainty about the outcomes not because they 
deterred the other bargainer's (incluctable) 
violations, but because they provided a form 
of insurance against the damage wrought by 
the other's aberrance. In the present study, 


B. 


the penalties assigned to the rules were” 


indemnities. If a bargainer violated a rule and 
was caught in the act, he forfeited the specified 
penalty to the other bargainer, not to the ex 
perimenter. In this sense, then, the penalties 
constituted a form of insurance. Evidently, 
the subjects assigned higher penalties under 6 
probability than under $ probability because 
this increased the expected value of the Jm 
demnifying payments that they might T€ 
ceive in reparation for the violations by the 
other bargainer under low probability ° 
enforcement. 


pretation gleans added credence 
: on the indemnities actually 
the bargainers over Trials 3-5. 
relational analysis reveals that a 
oportion of the variance in the 
f the indemnities paid is accounted 
oint effect of the size of the rules’ 
d the number of violations. The 
relation coefficient between in- 
ctually paid by BDP (the de- 
iable) and the division rule penal- 
column rule penalties, and the 
violations by BDP (the three pre- 
(sables) is .545, which is significant 
he .005 level. Similarly, the multiple 
coefficient between indemnities 
peid by BT (the dependent variable) 
external alternative rule penalties, 
‘umn rule penalties, and the num- 
ions by BT (the predictor vari- 
.156, which is significant beyond 
" Thus, the magnitude of the rule 
nd the number of violations, con- 
ilv, account for a substantial pro- 
the variance in the indemnities 
; paid, although some variance still 
T unexplained because the bargainers 
e of course not apprehended every time 
violated a rule. Clearly, a bargainer could 
ect himself to some degree against the de- 
redatory transgressions of the other bargainer 
insisting on stringent sanctions for rules 
iting the other’s behavior. The higher 
alties established under $ probability of 
forcement did not deter violations, but they 
d assure higher payments in those (infre- 
nt) occasions when violators were appre- 
ded. In this fashion, bargainers facing § 
robability endeavored to assure good out- 
mes despite adverse conditions. 
Tn conclusion, several points should be 
d. The present study has demonstrated 
anticipated probability of enforcement 
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affects the penalties assigned to negotiated 
contracts. Probability of enforcement ac- 
counted for 18% of the variance in the division 
rule penalties, 27% of the variance in the. 
external alternative rule penalties, and 22% 
of the variance in the row-column rule 
penalties. Moreover, the findings suggest that 
several issues merit further research. As dis- 
cussed earlier, the failure of the data to sup- 
port the hypothesized threat effect on rule 
penalties hints that negotiations over complete 
disarmament may differ radically from negotia- 
tions over partial disarmament. The proportion 
of power usage actually limitable by contrac- 


tual norms may be an important factor in - 


normative emergence, Second, the inability of 
the indemnities in the present study to deter 
transgression suggests that the nature of the 
penalties—indemnities versus nonindemnifying 
fines—may prove an important variable in 
negotiations. Both issues deserve additional 
study. 
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REDUCING THE INFLUENCE OF AN AGGRESSIVE MO 


THE RESTRAINING EFFECTS OF DISCREPANT 
MODELING CUES! 


ROBERT A. BARON ° 
University of South Carolina 


Sixty male undergraduates participated in an experiment designed to investiga: 

the hypothesis that the strong aggression-eliciting influence of an aggressive mode 

could be counteracted by the presence of a restrained, nonaggressive model. In ord 

to investigate this suggestion, subjects in four experimental groups were provide N 
with an opportunity to aggress against an anger instigator following, respectively 3 
exposure to the behavior of an aggressive model, a nonaggressive model, first an 

aggressive and then a nonaggressive model, or first a nonaggressive and then a: 

aggressive model, Subjects in a fifth (control) group attacked their tormentor in the 

absence of prior exposure to any social models. Results strongly support the main 


experimental hypothesis. In addition, it was found that the aggression-inh'bitin: 


The results of a relatively large number of 
recent experiments (e.g., Bandura, Ross, & 
Ross, 1963a, 1963b; Baron, 1971c; Baron & 
Kepner, 1970; Hartmann, 1969) have indicated 
that exposure to either filmed or live aggressive 
models is highly effective in raising the level of 
subsequent aggression on the part of human 
observers. Indeed, so powerful do these effects 
appear to be, that they have been reported 
even under conditions where subjects express 
strong disapproval of the model's actions 
(Bandura et al., 1963b), indicate a relatively 
low level of attraction toward this individual 
(Baron & Kepner, 1970), or anticipate im- 
mediate retaliation from the victim of their 
attacks (Baron, 1971c). Thus, the powerful 
influence of aggressive models in eliciting 
increased violence on the part of both child and 
adult observers has been well documented. 

In contrast, however, little attention has 
been directed to the question of how the 
potentially dangerous influence of such indivi- 
duals may be counteracted or reduced. Indeed, 


1 This research was supported by Grant GS-3230 
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influence of a nonaggressive model was enhanced when observe 
strained behavior before, rather than after, witnessing the violent acts 
aggressive model. These findings are discussed in terms of their implicati 
prevention and control of collective violence in naturalistic social settings, — 


En 


'rature- 


very of 


a careful search of the available 
reveals that only one factor, the dci 
negative consequences to the mode! for his 
aggressive acts, has been found to be efective 
in this regard (e.g., Bandura, 1965; k osekrans 
& Hartup, 1967). In view of the major role im 
the elicitation of tragic instances of collective 
violence often assigned to aggressive models 
(see, eg., Lieberson & Silverman, 1965; 
Momboisse, 1967), the current lack of infor- 
mation concerning this topic is somewhat 
disconcerting. Thus, the present study was 
designed to investigate the effectiveness of an 
additional means for reducing the influence of 
an aggressive model upon the subsequent 
behavior of adult observers, that of exposing 
such individuals to the restrained actions of 
one or more nonaggressive models. 

The view that such procedures would often 
be effective in counteracting the influence of an 
aggressive model is suggested by two distinct 
lines of evidence. First, informal observation 
indicates that aggressive models are not always 
successful in converting tense situations into 
overtly violent ones, and that in many caseS; 
this is due to the presence on the scene of 
individuals who both demonstrate and urge 
restraint (i.e, nonaggressive models). Second, 
the results of several recent experiments Con- 
ducted with young children (e.g, Allen & 
Liebert, 1969; Bandura, Grusec, & Menlove 
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1967; Mc sns & Liebert, 1968) suggest that 
the influc f one social model on the be- 
havior of vers may be markedly reduced 
by their s ent exposure to a second model 
who acts distinctly different manner. 
Considere: ther, these observations and 
findings su: «st that the presence of nonaggres- 
sive mode! y often serve to counteract the 
negative ‘nce of an aggressive model. 

In ord investigate this possibility in a 
systematic inner, the subjects in the present 
experime: re first angered through insult 
(eg., Geen, 1968) and then provided with an 
opportunity to aggress against their tormentor 
by means c: electric shock (Buss, 1961). Before 
aggressiny, however, subjects in four different 
experimer groups witnessed, respectively, 
the behavior of an aggressive model, a non- 
aggressive vcocel, first an aggressive and then 
a nonaggressive model, or first a nonaggressive 
and then ggressive model. Subjects in a 
fifth (control) condition attacked the anger 
instigator in the absence of prior exposure to 
any social models. 

On the basis of previous research concerned 
with the inlluence of aggressive and nonaggres- 
sive models on the subsequent behavior of 
adult observers (e.g., Baron, 1971c; Baron & 


Kepner, 1970; Wheeler & Caggiula, 1966; 
Wheeler & Smith, 1967), it was predicted that 
the highest and lowest levels of aggression 
would be shown by subjects exposed, respec- 
tively, to only an aggressive or nonaggressive 
model, while subjects in the no-model control 
condition would evidence an intermediate level 
of violence. In addition, it was also predicted 
that the presence of a nonaggressive model in 
the two multiple-model groups would be 
effective in reducing the influence of the 
aggressive model (ie., subjects in these two 
groups would show a lower level of aggression 
against the victim than subjects exposed to 
Only an aggressive model). However, in the 
absence of previous evidence, no formal pre- 
dictions concerning the possible influence of 
order of presentation of the two models in 
these conditions were offered. 


A METHOD 
Subjects 


y undergraduate males enrolled in sections of 
mentary psychology at the University of South 
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Carolina participated in the study. Students took part 
in the experiment in order to satisfy a course 
requirement. 


Apparatus 


The apparatus consisted of a modified Buss “aggres- 
sion machine" (Buss, 1961), lists of 4075-60; Glaze 
association value nonsense syllables, a Lafayette stop 
clock (Model 20225 A), and an intercom system. 

The aggression machine was identical to apparatus 
employed in previous experiments (Baron, 1971a, 
1971b, 1971c) and contained 10 switches which could 
be employed by the subjects to deliver electric shocks 
of varying intensities to another individual on occasions 
when he made “errors” in a learning task. The Lafayette 
stop clock was employed to measure the duration of 
shocks administered by subjects to the learner. The 
materials to be mastered by this individual were lists 
of 40%-60% Glaze association value nonsense syllables, 


and were presented to the learner over the intercom AM 


system. 


Procedure 


When the subjects arrived for the experiment, they 
found two confederates already present in the waiting 
room. Shortly thereafter, a third confederate arrived 
and engaged the subject and other individuals in con- 
versation. During the course of this brief discussion, he 
singled out the subject for several insulting comments 
(concerning, e.g., his choice of psychology instructor) 
and generally acted in a very obnoxious manner. 
Immediately after the completion of these remarks, the 
experimenter arrived and conducted all four individuals 
to the first experimental room, where she explained that 
the study was concerned with the effects of punishment 
on learning. She further indicated that in order to 
investigate this problem, three of the individuals pre- 
sent would play the role of teacher, while the remaining 
person would play the role of learner. The teachers 
would have the task of instructing the learner in certain 
verbal materials, rewarding him for correct answers and 
punishing him for errors with electric shock. After 
completing this brief description of the experiment, she 
asked if any of the individuals present would like to 
volunteer to play the role of learner. At this point, the 
third confederate (the one who had acted in an ob- 
noxious manner in the waiting room) offered his 
services, remarking as he did so that the other subjects 
(i.e, the real subject and the other two confederates) 
probably lacked the courage needed to play this role. 
Thus, for the second time, he acted in an insulting and 
obnoxious manner. The experimenter then conducted 
this individual to the next room, where she appeared 
to attach shock electrodes to his arms and presented 
him with detailed instructions for his task in the experi- 
ment. At the conclusion of these instructions, the con- 
federate expressed strong doubts concerning the 
ability of the subject and two remaining confederates 
to perform their role as teachers adequately, and dis- 
paraged their intelligence and maturity in a very 
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insulting manner. Thus, for the third and final time, be 
delivered strong verbal abuse to the subject’ 
Following this final outburst by the confederate, the 


each time the learner made an error, a red light on the 
aggression machine would be illuminated. This would 
serve as a signal for the teacher whose turn it was to 
first read the correct answer to this person, and then to 
shock him by pressing one of the 10 shock buttons on 
the experimental apparatus, The experimenter care- 
fully explained that the higher the number of the 
button chosen by the teacher, the stronger the shock to 
the learner, and that the longer any button was de- 
pressed, the longer the shock to this person would last. 
3 shocks from Buttons 4 


and 5 on the aggression to all three individuals 
in order to demonstrate that the us was in fact 
operational.!* These shocks were delivered by means of 


the roles of first and third teacher, respectively? Thus, 
c pee ee a a witnessed the behavior 

ore being permitted to aggress 
against the anger instigator. In the aggressive pie 
condition, the first teacher (i.e, one of the confed- 
erates) behaved in a highly aggressive manner, pushing 
Buttons 8, 9, or 10 on each occasion when the learner 
made an error, while in the nonaggressive model con- 
dition the same individual acted in a nonaggressive 
manner, choosing instead, Buttons 1, 2, or 3 for his 


? The present three-pronged verbal assault against 
the subject was employed because interest in the present 
study was focused on determining whether the presence 
of a nonaggressive model would be effective in counter- 
acting the influence of an aggressive model under just 
those conditions that would tend to maximize the 
aggression-eliciting effects of the latter. 

* The confederates were not actually shocked at this 
time, but merely feigned their reactions to these 

noxious stimuli. 

* In both of these groups and in all remaining condi- 
tions, the two confederates played all appropriate roles 
an equal number of times. 
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attacks against the learner. In both 
third teacher (Le., the second confeders 
moderately aggressive manner when it 
instruct the learner (Le., employed Bu: 
when this individual made errors). How 
subject had already completed his turn 
to the actions of this teacher, it is clea 
havior could have no influence upon the 
of the present experiment. In the two í 
model) conditions, the subject was cho 
the third teacher. Thus, in these groups 
behavior of two models before bein: 
aggress himself. In the aggressive—non 
condition, the first teacher acted in a hi 
manner (see above), and the second, in « 
manner (see above), while in the mona; 


egres- 


sive model condition, the order of these : is re- 
versed. Thus, the only difference betw: e two 
multiple-model groups lay in the relative r with 
which the two types of models were presec'c! to the 
subjects. 

In all five conditions, the learner appear ake a 
total of 20 errors while being instructed b) acher, 
but seemed to master the materials presen: um by 
each as the session progressed. At the con: of the 
third teacher's turn, the experimenter a red a 
brief postexperimental questionnaire to « e indi- 
viduals. Items on this questionnaire : d the 
subjects to rate (on 7-point scales) their ree of 
anger toward the insulting learner and the gth of 
the shocks delivered to this individual by of the 


two other teachers. 


RESULTS 
Shock Intensity 


The mean intensity of shocks delivered to the 
learner by the subjects in each of the five 
treatment conditions is presented in the first 
column of Table 1. Inspection of these means 
suggests that as expected, the strength of the 
subjects’ attacks against the learner was 
strongly influenced by the modeling cues to 
which they had been exposed. This suggestion 
is supported by the fact that an analysis of 
variance performed on these data yielded & 
highly significant treatments main effect 
(F = 16.17, df = 4/55, p < .001). j 

In order to examine more closely the in- 
fluence of the various modeling conditions on 
the subjects’ behavior, the means shown !n 
Table 1 were compared by Duncan multiple- 
range test, These comparisons indicated that 
as predicted, the influence of an aggressive 
model was substantially reduced by the pre 
sence of a nonaggressive model. More specifi- 
cally, it was found that the means of the non- 
aggressive-aggressive model and aggressive- 
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xlel groups were significantly 
ıt of the aggressive model 
í 05). Indeed, further examina- 
reveals that the mean of 
aggressive model group is 
smaller than that of the no- 
dition. Thus, it appears that 
the presence of a nonaggres- 
completely effective in counter- 
nce of the aggressive model. 
d more unexpected finding is 
nean of the nonaggressive 
el condition is significantly 
t of the aggressive-nonaggres- 
iition, This difference suggests 
cressive model may be more 
nteracting the influence of an 
cl when the subjects witness his 
avior prior to, rather than 
ure to the more yiolent acts of 
»odel. 


an duration of shocks delivered to the 
e subjects in each of the five 
ditions is presented in the second 
‘ble 1. Examination of these means 
the strength of the subjects' 
ainst the learner was strongly in- 
y the modeling cues to which they 
exposed. This suggestion is supported 
fact that an analysis of variance per- 
on these data yielded a significant 
ments main effect (F = 5.85, df = 4/55, 
-001). 
AS was the case with the intensity data, the 
$ shown in column 2 of Table 1 were 
ed by Duncan multiple-range test. 
€ comparisons indicated that as predicted, 
influence of an aggressive model was re- 
by the presence of a restrained, non- 
ive model. More specifically, as found 
€ intensity measure, the means for the 
multiple-model groups (ie., the aggres- 
honaggressive and nonaggressive-aggres- 
model conditions) were significantly 
than that of the aggressive model 
ition (p < .05). Further examination of 
1 reveals that the mean of the nonaggres- 
essive model group is actually slightly 
than that of the no-model control 
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TABLE 1 


Maran Suock Derksary axo Dusamos 
vox Five TaraTMENT 
L———————2222 22 € -———— — 


| Dependent thence 
Treatment condition mer | 
Stock pue 
| intensity | {need 
Nonaggressive model ETE S 
Nonaggressive-aggressive model | 343.5 | 49,4 
No model 38,» | Shu 
Agxressive-nonagxressive model | 5.24 | Jh 
Aggressive model 712, | LOR 


Note.—Means within each column that do 
mon subscript differ significantly at the 0S 
multiple-tange test. 


i 
g | 
Pe 

i 


condition. Thus, as was the case with the 
intensity data, the presence of a nonaggressive 
model appeared to be effective in completely 
counteracting the influence of the aggressive 
model in this condition. Finally, and again 
consistent with the findings for the intensity 
measure, the mean of the nonaggressive- 
aggressive model group is smaller than that of 
the aggressive-nonaggressive model group 
(p < 10). Thus, further evidence was ob- 
tained that the effectiveness of a nonaggressive 
model in counteracting the influence of an 
aggressive model is strongly affected by 
whether the exposure of subjects to his be- 
havior precedes or follows exposure to that of 
the aggressive model. 


Postexperimental Questionnaire 


The first item on the postexperimental ques- 
tionnaire required the subjects to rate their 
degree of anger toward the insulting learner. 
The mean rating on this item supplied by the 
subjects was 2.83, thus indicating that they 
experienced only moderate anger arousal as a 
result of the verbal abuse that they endured. 
However, it should be noted that the subjects 
made this rating only after attacking the anger 
instigator. Thus, their arousal at the time of 
their aggressive acts may have been con- 
siderably higher than is suggested by the data 
for this item. 

Two additional items on the questionnaire 
required the subjects to rate the strength of the 
shocks delivered to the learner by the other 
teachers (i.e., the two confederates). The mean 
ratings assigned to these shocks by the subjects 
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when the confederates played the roles of 
aggressive and nonaggressive models were, 
6.00 and 2.77 respectively. Thus, it appears 
that as intended, the subjects viewed the 
confederates as acting in a highly aggressive 
manner when they played the role of aggressive 
model, but as demonstrating relatively non- 
aggressive actions when they played the role of 
nonaggressive model. 


DISCUSSION 


The results of the present experiment offer 
strong support for the hypothesis that the 
aggression-eliciting influence of an aggressive 
model may be substantially reduced by the 
presence on the scene of an individual who acts 
in a restrained, nonaggressive manner (ie., a 
nonaggressive model). More specifically, it was 
found that the subjects in the two groups who 
witnessed the behavior of both an aggressive 
and a nonaggressive model before being per- 
mitted to aggress against an anger instigator 
delivered significantly less severe attacks 
against this individual than the subjects ex- 
posed to only an aggressive model. Indeed, so 
effective was the presence of a nonaggressive 
model in counteracting the influence of the 
aggressive model that under conditions where 
the subjects viewed the restrained actions of 
this individual first (i.e, the nonaggressive- 
aggressive model group), their subsequent 
attacks against the victim were actually of 
slightly lower overall magnitude than those of 
subjects not exposed to any social models 
before aggressing (i.e, the no-model control 
condition). In view of recent findings which 
suggest that the aggression-eliciting influence 
of aggressive models remains undiminished 
even under conditions where subjects express 
a low level of attraction toward the model 
(Baron & Kepner, 1970), or are threatened 
with severe punishment for emulating his 
actions (Baron, 1971c), these results are some- 
what comforting. More importantly, they are 
suggestive of a possible means for reducing 
the potentially dangerous aggression-disinhibit- 
ing effects of modeled violence upon the sub- 
sequent behavior of adult observers (see, e.g., 
Wheeler & Caggiula, 1966). 

It should be noted, however, that a degree of 
ambiguity is introduced into the interpretation 
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of these findings by the fact that 
between the final provocation to 
and their opportunity to atta 
instigator was not held constant 
tions. In view of the possibility 
arousal may dissipate relatively 
the passage of time, it is appar: 
factor may have exerted some 
fluence on the obtained patte: 
However, the fact that the inter 
was approximately equal in the g 
the highest (ie., the aggressiv 
dition) and lowest (ie. the 
model condition) levels of aggressio 
that this variable was not a major determi 
of the subjects’ performance in th 
research. Nevertheless, the possible 
of this factor should be investiga 
experiments by varying the inte 
provocation to observers and thei 
one or more social models in 
manner. 

The finding that the level of 
shown by the subjects in the no: 
aggressive model group was signilic: 
than that shown by the subjects in 
sive-nonaggressive model condition i 
siderable interest. In view of the fact the 
total pattern of modeling cues observed by) 
subjects in these two groups was identical, it 
might reasonably have been expected that they 
would show highly similar levels of aggression 
against the victim. The fact that they 
suggests quite strongly that in situations where 
observers are exposed successively to two OF 
more social models who evidence distinctly 
different patterns of behavior, the order of 
presentation of the models may sometimes be 
an extremely important factor. In particular, 
the present study provides some evidence for 
the view that subject’s behavior may be most 
markedly influenced by the actions of the 
first model they observe. Although additional 
research conducted with subject populations 
and modeled behaviors other than those of the 
present experiment is necessary to establish 
the reliability and generality of this phe 
nomenon, one possible explanation for the 
occurrence of such primacy effects may be 
tentatively suggested. Specifically, it may e| 
the case that modeling cues presented in close 
juxtaposition to the arousal of transient 
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motives (e zer arousal) are more effective 
in influenc 'bservers! subsequent behavior 
with resp: such motives than modeling 
ques present t later points in time.* Such 
elects m: e, in turn, from a stronger 
tendency « part of observers to pay close 
and careí ention to the model’s acts 
immediate! >r motive arousal than at 
other timc 

Relating uggestion to the procedures of 
the present iy, it is apparent that the first 
model obs by the subjects in the two 
multiple-: conditions was presented at a 
point in ti mewhat closer to the arousal of 
anger tha: second model. Thus, it would 
be expected that the actions of this individual 
would be e effective in influencing the 
observers' avior than that of the second 
model for reasons described above. The 
validity of these suggestions may be readily 
investigated in future research by varying both 
the interval between motive arousal and pre- 
sentation of ‘ie model and the relevance of the 
model’s actions to the motive in question in a 
systematic manner. 

Regardless of the outcome of such research, 


however, the implications of the findings of 
the present study for the prevention and 
control of collective violence in naturalistic 
social settings seem clear. Specifically, it ap- 
Pears that the presence of a nonaggressive 
model may be highly effective in counteracting 
the strong aggression-eliciting influence of 
aggressive models, but only under conditions 
Where observers (i.e, potential rioters) are 
exposed to the restrained behavior of the 
former before witnessing the violent actions of 
the latter. Thus, insofar as the control of 
human violence is concerned, it appears that 
as suggested by the old saw, "an ounce of 
Prevention” may indeed be worth “a pound 
of cure," 


Tg author is indebted to the anonymous reviewer 
of this article for this interesting suggestion. 


— 


REFERENCES 


ALLEN, M. K., & Lesert, R, M. Effects of live and 
Symbolic deviant-modeling cues on adoption of a pre- 
viously learned standard. Journal of Personality 
Social Psychology, 1969, 11, 253-260. 

ANDURA, A. Influence of models’ reinforcement con- 
lingencies on the acquisition of imitative responses. 


245 


Journal of Personality and Social Psychology, 1965, 1, 
589-595. 

BANDURA, A., Grusec, J., & Mentove, F. Some social 
determinants of self-monitoring reinforcement sys- 
stems. Journal of Personality and Social Psychology, 
1967, 5, 449-455. 

BANDURA, A., Ross, D., & Ross, S. A. Imitation of film- 
mediated aggressive models. Journal of Abnormal and 
Social Psychology, 1963, 66, 3-11. (a) 

Banpura, A., Ross, D., & Ross, S. A. Vicarious rein- 
forcement and imitative learning. Journal of Ab- 
normal and Social Psychology, 1963, 67, 601-607. (b) 

Baron, R. A. Aggression as a function of magnitude of 
victim’s pain cues, level of prior anger arousal, and 
aggressor-victim similarity. Journal of Personality and 

Social Psychology, 1971, 18, 48-54. (a) 

BARON, R. A. Exposure to an aggressive model and 
apparent probability of retaliation from the victim 
as determinants of adult aggressive behavior. Journal 
of Experimental Social Psychology, 1971, 7, 343- 
355. (b) 

Bano, R. A. Magnitude of victim's pain cues and level 
of prior anger arousal as determinants of adult aggres- 
sive behavior. Journal of Personality and Social 
Psychology, 1971, 17, 236-243. (c) 

Baron, R. A., & Kepner, C. R. Model's behavior and 
attraction toward the model as determinants of adult 
aggressive behavior. Journal of Personality and Social 
Psychology, 1970, 14, 335-344. 

Buss, A. H. The psychology of aggression. New York: 
Wiley, 1961. 

Gren, R. G. Effects of frustration, attack, and prior 
training in aggressiveness upon aggressive behavior. 
Journal of Personality and Social Psychology, 1968, 
9, 316-321. 

HARTMANN, D. P. Influence of symbolically modeled 
instrumental aggression and pain cues on aggressive 
behavior. Journal of Personality and Social Psycho- 
logy, 1969, 11, 280-288. 

LIEBERSON, S., & SILVERMAN, A. The precipitants and 
underlying conditions of race riots. American Socio- 
logical Review, 1965, 30, 887-898. 

McMarws, M. J, & LIEBERT, R. M. Influence of dis- 
crepancies between successively modeled self-reward 
criteria on the adoption of a self-imposed standard. 
Journal of Personality and Social Psychology, 1968, 
8, 166-171. 

Mommorsse, R. M. Riols, revolts, and insurrections. 
Springfield, Ill.: Charles C Thomas, 1967. 

ROSEKRANS, M. A., & HARTUP, W. W. Imitative in- 
fluences of consistent and inconsistent response con- 
sequences to a model on aggressive behavior in 
children. Journal of Personality and Social Psychology, 
1967, 7, 429-434. 

WHEELER, L., & Caccrura, A. R. The contagion of 
aggression. Journal of Ex perimental Social Psychology, 
1966, 2, 1-10. 

WHEELER, L., & Smita, S. Censure of the model in the 
contagion of aggression. Journal of Personality and 
Social Psychology, 1967, 6, 03-98. 


(Received January 11, 1971) 


core Sa RER mato 


PRIOR ATTITUDE AND LANGUAGE INTENSITY 
PREDICTORS,OF MESSAGE STYLE AND ATTITUI 
CHANGE FOLLOWING COUNTERATTITUDINAI 
ADVOCACY 
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Three experiments were conducted to test the following hypotheses: (s) Pers 
who encode à counterattitudinal message will use language of significantly low 
intensity than persons who encode a belief-congruent message; and (b) giv 
counterattitudinal advocacy by people with similar prior attitudes, the magnitu 
of attitude change varies directly with the intensity of the counterattitudinal asser 
were used in all three experiments. Languay 
intensity scores were computed by summing the values of words selected to co 
plete partially constructed messages. In the first two experimen’ 
scale anchored by words assigned weigh! 


tion. The same general 


measured by a 7-interval 


Present explanations of attitude change 
following counterattitudinal advocacy require 
the communicator to actually encode a counter- 
attitudinal message. Incentive theorists (Janis 
& Gilmore, 1965) contend that attitude change 
results from the communicator's biased scan- 
ning of arguments, from his motivation to 
think up good belief-discrepant arguments, and 
to suppress belief-congruent ones. Early dis- 
sonance theorizing (Festinger & Carlsmith, 
1959) attributes attitude change to dissonance 
produced by the discrepancy between private 
belief and public behavior, while later dis- 
sonance interpretations (Aronson, 1968) stress 
the discrepancy produced by one's perceptions 
of himself as a moral, truthful person, on the 
one hand, and on the other, his knowledge that 
he may have induced audience acceptance of an 
undesirable or untruthful position. Obviously, 
failure to encode counterattitudinally would 
enable the communicator to avoid or to resolve 
either of these sources of dissonance without 
engaging in attitude change. Finally, Bem’s 


1 Requests for reprints should be sent to Michael 
Burgoon, who is now at the Department of S Com- 
munication, San Fernando Valley State College, North- 
ridge, California 91324. 
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ts, attitudes wer 


;olds t 


(1965) self-perception viewpoin' 
attitudes are inferred from behavi 
attitude change results from utteri: 


attitudinal statements in the pr 
external cues associated with t 
Thus, for all three theoretical 1 
assumption of a counterattitudinal stance & 
necessary condition for subsequent attitud 
change. 
Despite the importance of actual counter 
attitudinal encoding behavior, there is go 
reason to believe that individuals often acce] 
counterattitudinal tasks and still devise beh 
congruent messages. Such inconsistency D 
tween assent and action may occur conscious 
because of the aversive consequences associate 
with public endorsement of a coun 
tudinal viewpoint, or unconsciously, be 
the notorious unreliability and caprici 
of human perception. Moreover, emp 
support for a tendency to avoid 
attitudinal encoding is provided in a 
study by Burgoon (1970). Subjects were 
to judge the position advocated in samp 
belief-congruent and belief-discrepant mi 
drawn from prior research by Miller | 
Bodaken (1969), Belief-congruent comm 


us problems; for 96%" of 
e subjects correctly judged 
s position. In sharp con- 
í the belief-discrepant mes- 
ied correctly. Thirty-one 
ved as adopting a neutral 
while 20% were judged as 
sed to the counterattitudinal 
cator had agreed to adopt. 
of the belief-discrepant com- 
t actually produce clearly 


Mentifiabl. rattitudinal messages. 

While a : of prior studies (Carlsmith, 
Collins, & | cich, 1966; Elms & Janis, 
1965; Festi v Carlsmith, 1959; Janis & 
Gilmore, 1% nder, Cooper, & Jones, 1967 ; 
Rosenberg have obtained measures of 
ertain dir: ns of counterattitudinal mes- 
ages, no at t has been made to study the 


Litude on intensity of message 
camine the possible relation- 
nsity of counterattitudinal en- 
quent magnitude of attitude 
ent study reports three experi- 
with these problems. Experiment 


I, a syster extension of Burgoon's prior 
Study, tested the following hypothesis: Persons 
Who encode » counterattitudinal message will 


We language of significantly lower intensity 
than will persons who encode a belief-congruent 
Message. 
The second and third experiments investi- 
fated the likelihood of a direct relationship 
Ween the intensity of a counterattitudinal 
Message and subsequent attitude change. 
Highly intense encoding satisfies McGuire's 
(1964) conditions for strong commitment: (a) 
3 private decision to encode the message, (b) a 
Public pronouncement of the position, (c) 
“tive participation, and (d) external commit- 
ment (a person telling another that he holds a 
Particular belief). As indicated above, satis- 
n of Condition d is necessary to all 
Coretical explanations of attitude change 
Wing counterattitudinal advocacy, and 
low-intense or neutral encoding would enable 
Jé Communicator to circumvent this condi- 
- Furthermore, highly intense encoding 
Should make it difficult for the communicator 
deny the counterattitudinal nature of the 
message, while low-intense or neutral encoding 
| behavior enhances the possibility of perceiving 
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that a belief. discrepant position was not taken. 
Because of these considerations, 


II and IIL tested the Hewnde sane s 4 
Given counterattitudinal advocacy by 

with similar prior attitudes, the of 
attitude change varies directly with the inten- 
sity of the counterattitudinal message. 


Exrnement I 
Method 


Subjects were 45 undergraduate students in an intros 
ductory business writing class at Michigan State 
University, Subjects were assigned randomly to either 
a beliefdiscrepant or a belief-congruent encoding 


used in a project for another class. Subjects’ attitudes 
toward five controversial issues were assessed, using 


result in a maximally persuasive message. 

the belief-congruent condition were given only words 
that would permit construction of a message opposing 
mandatory on-campus living, while the subjects in the 
belief-discrepant condition were given words that would 
result in a message favoring mandatory on-campus 


The list of words distributed were those for which 
Jones and Thurstone (1955) have developed scale 
values (Table 1). Their procedures yield scale values 
that allow estimates of the magnitude of the word's 
t con- 


Analysis of the two lists revealed that the mean inten- 
sity scores did not differ significantly. 

Subjects were told that the completed messages 
would be shown to high school students who had 
indicated previously that they had no opinion toward 
mandatory on-campus residency. It was also empha- 
sized that the subjects should create the most persuasive 
message possible, with an eye toward changing the 
attitudes of the high school students. Finally, the 


? Since the attitude measure was only used in this 
experiment to determine pretest attitudes, a detailed 
discussion of the instrument is deferred until Experi- 
ment II. 
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TABLE 1 
Wetontep Scare VaLCES or EXPERIMENTAL WORDS 
AND PuxASES 
, * wr s | Belief-congruent condition 
Word or phrase | Value Word ot phrase | Value 
Best of all | 6.15 | Fair | 348 
Favorite 4.68 | Acceptable 43 
Like extremely 4.16 | Only fair 431 
Like intensely | 4.08 | Like slightly 9 
Excellent 3.71 | Neut 02 
Wonderful 3.31 | Like not so well — 30 
Strongly like | 2.96 | Likenotsomuch | — Al 
Like very much 2.90 | Dislike tly — .59 
Mighty fine 2.88 | Mildly dislike |- .74 
ramus d 2.86 | Not i — 83 
Mighty favorable | 2.81 | Don't care for it —1.10 
Like very 2.60 | Dislike moderately | — 1.20 
Very good 2.36 | Poor —1.35 
Like quite a bit 2.31 | Dislike —1.58 
Enjoy 2.21 | Don’t like —L8I 
1.96 | Bad —2.02 
Good 1.91 | Highly unfavorable | —2.16 
Welcome 1.77 | Strongly dislike —2.37 
Tasty 1.76 | Dislike very much | —2.49 
Pleasing 1.58 | Very bad —2.53 
Like fairly well 1.51 | Terrible —3.09 
Like 1.38 | Dislike intensely 331 
Like moderately | 1.12 the —3.76 
ox ru Dislike extremely A 
verage E Despise —6. 
Mildly like .85 


experimenter instructed the subjects to sign the mes- 
sages so that they could be given credit for their ideas. 
All of these procedures were intended to heighten the 
subjects’ perceptions that they were the actual sources 
of the messages and to increase their involvement in 
the task, a step considered particularly essential since 
the subjects received partially prepared messages. 
Postexperimental comments of the subjects regarding 
attribution were mixed; some indicated that they 
attributed the messages to themselves, while others 
disassociated themselves from the messages. All sub- 
jects reported that they were at least moderately in- 
volved in the task. 

Following construction of the messages, all subjects 
completed a questionnaire aimed at tapping their 
perceptions of certain dimensions of the task. The 
actual purpose of the study was then divulged to them. 


Results 


To test the effect of the inductions, the 
subjects were asked to rate the difficulty of the 
task, the attitude of the high school audience, 
and the importance of their contribution to the 
persuasion project. Results indicated that the 
task was perceived as moderately difficult, 
moderately important, and the high school 
audience was perceived as uncommitted on 
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mandatory on-campus residen: » differ 
ences between the two groups we - found on 
these measures. Thus, the expe: il induc 
tions were generally successful 

Table 2 contains the observed s of the 
summated scale values of the vords or 


phrases chosen to complete the m 
subjects in the belief-congrue: 
discrepant conditions. In additi: 
reports the possible score valus 
groups, given the selection o! 
intense or the 10 most intense 
respective lists. 

It can be seen that as predicte 
in the belief-congruent conditio 
more intense message than did 
discrepant subjects, although 
noted that both groups encoded : 
ately intense level. The test of 
between the means was significan! 
df = 40, p < .05); thus, the findi 
Hypothesis 1. 

As Table 2 indicates, if the sub 
belief-congruent condition had sel: 
most intense words, the mean inte 
would have been 34.39. By contr 
most intense words in the belicf-ciscrepant 
condition yielded a mean of 37.66. Conse 
quently, any differences in the lists provide for 
a conservative test of Hypothesis !, since the 
potential mean intensity rating is higher for 
the group who was expected to and who 
actually encoded less intense messages. 


e by the 
| belief 
he table 
the two 
10 least 

on the 


subjects 
coded a 
belief- 
ould be 
moder- 
ierences 
(( = 2:33; 
s support 


ts in the 
ed the 10 
ty rating 
t, the 10 


EXPERIMENT II 
Method 


Subjects were 106 undergraduates in introductory 
business writing classes at Michigan State Universit): 
Subjects were assigned randomly to one of three 
experimental conditions or a cortrol condition. _ 

The same pretest procedures described in Experiment 
I were used. Because all subjects were opposed 1^ 


TABLE 2 


ACTUAL MEANS AND PossrbLE SumaaTep MEANS 
Given SELECTION OF THE 10 Least INTENSE ANP 
10 Most INTENSE Worns: EXPERIMENT I 


Daun e| OERSET [00 mont | actual 
Belief-discrepant | — 9.50 3166 | 224 
Belief-congruent 9.90 34.39 25. 


*$ «.05. 


oe 
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TABLE 3 2 


UES AND MEANS OF WORDS SELECTED POR USE IX THE EXPERIMENTAL CONDITIONS 1 


ty Moderate intensity | Low Intensity 


| Value Word ot phrase Value Word of phrase 

416 | Very good 236 | Poor 
4.05 Like quite a bit 231 Like moderately 
3.71 Enjoy 2.21 OK 
3.31 ^ Dd unfavorable 2.16 
3.09 20 | Mildly like 
2.96 Preferred 196 | Not mes 
290 Ge | 191 Fair 

k elcome 1.77 table 
2.86 ng | 1,58 Only jair 
2.81 Like fairly well iS | Like slightly 
2.53 e 138 | 
3.2 18 


pus living for undergraduates, that — intensity, the choices bad and highly unfatorable; and 
ected as the experimental issue. The low-intensity, the choices not pleasing and poor. 

| to measure attitudes toward the issue After the subjects had responded to the 10 arguments, 
scale, with the anchors excellent, good, they were instructed to reproduce the entire message in — — 
had, and terrible (Jones & Thurstone, their own handwriting so it could be shown to the high — — 
nal scale assigns negative values to school students. This addition to the inductions em- 
:chors, a constant of 4.1 was added ployed in Experiment I was intended to further heighten s 
alue, producing a 7-interval scale witha the subjects’ perceptions that the message originated 

(least favorable toward mandatory on- from them and to increase their involvement in the 


ifferential-type scale using polar adjectives After the messages were completed, all subjects com- 
d highly on Osgood’s evaluative dimension. pleted the postencoding questionnaire. The purpose of 
encoding conditions were employed: high, the experiment was then revealed to them. 

and low intensity, All subjects encoded 
titudinally, that is, in favor of mandatory on- Resulis 
i Table 3 contains the word lists, Sfr zs ! > 
and Thurstone’s scaled terms, for As in Experiment I, the subjects viewed the 


onditions, as well as the ater an task as moderately difficult and moderately 
[lo st. The three lists differ significan important. Moreover, as intended, they per- 
59478, dj = 230). The control condition com- A that the high school erate bonis 


retest st measures, but did not : 
Ee and pur uncommitted on the issue of mandatory on- 


inductions used to elicit counterattitudinal campus residency. 

ing from the subjects, while similar to those used Analysis of the pretest attitude scores 
periment I, differed in several respects. Subjects (Table 4) revealed that the four group means 
iven an envelope containing 10 strips of paper, did not differ significantly. Hypothesis 2 pre- 


ith an argument typed on it. Each argument ~, 4 ade : 
least one blank space, and the subjects were dicted a direct relationship between magni- 
those words that matched the intensity level tude of attitude change and the intensity of 


they were assigned. Following is an example of the counterattitudinal message. The overall 
ty manipulation: amount of change for the four conditions is 
ument: I think any high school student would found in Table 4, along with the summary of 
DA Aaka mistake if he did the analysis of variance of the change scores. 

not expose himself to the many ideas, Scheffé’s test was used to determine differences 
attitudes, and values one finds in uni- — between each experimental group and the 
versity housing. control group. The findings indicated that both 
argument, high-intensity-condition subjects the high- and moderate-intensity conditions 
the choices very bad and terrible; moderate- differed significantly from the control condi- 
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TABLE 4 


ATTITUDE Scones, AMOUNT or CHANGE, AND ANALYSIS 
ov VaxiANCE Or ATrrTUDE CHANGE SCORES FOR 
Sunyects IN EXPERIMENTAL AND CONTROL 
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TABLE 5 
Artrrupe Scores, Amount or Cx D ANALT- 
sis OF VARIANCE OF ATTITUDE ( Scores 


FOR SunJECTS IN EXPERIMENTA 


Groups: EXPERIMENT II | mo GROUPS: E. 
p DSi = Pretest | Posttest | Change Condition Pretest F € henge 
intensi 140 | 390 | 250, Highintensity 794 i 8.62. 

ote NR 147 | 358 | Zila Moderate intensity 7.24 i 2.94, 
Lowintensity —— 127 | 26 1.374 Low intensity 6.71 à 
Control 1.63 191 Bie Control | 6.79 

ES of variance SS 4f | MS F Source of variance | SS 
A——————ÀÁ—— —— — ——Ó——— 
Between treatments | 58.85 | 3 | 19.62 | 7.75* Betweentreatments | 699,93 | 3 1 
Within treatments | 257.73 | 102 | 2.53 | Within treatments | 1486.80 | 63 0 

Total | 316.58) 105 | | Total 


Note, — Means with a common subscript do not differ sig- 
ay = the ,05 level. 
? s 


tion. By contrast, the low-intensity and control 
conditions did not differ significantly. Finally, 
although the high- and moderate-intensity 
conditions did not differ, there was a significant 
difference in attitude change between the sub- 
jects in the high- and low-intensity conditions. 

As a further test of Hypothesis 2, a product- 
moment correlation was calculated on the 
relationship between attitude change and 
language intensity scores. This analysis yielded 
a significant r of .27. Since eta (.28) was not 
significantly greater than the obtained correla- 
tion, the observed relationship is a linear one. 
Thus, taken as a whole, the results provide 
strong support for Hypothesis 2. 


EXPERIMENT III 
Method. 


Experiment III sought to replicate the results of 
Experiment II, using a different issue and subjects 
drawn from a somewhat different parent population. 

Subjects were 67 students ina Cooperative engineering 
program at General Motors Institute, Flint, Michigan. 

All students attending the Institute are training for 
careers with General Motors. The Institute describes 
its students as relatively conservative, both economi- 
cally and politically, 

Pretesting procedures were identical to those em- 
ployed in Experiment II with two exceptions: first, the 
time between the pretest and actual experiment was 9 
weeks, rather than 2; second, instead of using 7-interval 
scales, anchored by words selected from Jones and 
Thurstone’s (1955) list, four T-interval, semantic- 
differential-type scales, bounded by the polar adjectives 

good-bad, valuable-worthless, fair-unfair, and pleasant— 


Note.—Means with a common subseri differ sig 


nificantly at the .05 level. 
*p «.01. 


was scored 
imally um 
raft defer- 
maximally 


unpleasant, were employed.’ Since cac! 
on a 1-7 basis, a score of 4 reflect 
favorable attitude toward eliminatior 
ments, while a score of 28 represen! 
favorable attitude. 


All experimental subjects wrote erattitudinal 


messages supporting elimination of draf! rments for 
college students, while the control group « ;:leted only 
the attitude measures. As in Experiin t, the sub- 
jects were asked to complete a partialiy constructed 
message by choosing between words of ü- :ppropriate 

treatment. 


intensity for their particular experimer 
Subjects were told the messages would bc used të 
convince students attending a youth conference that à 
resolution should be passed urging Congress to abolish 
student draft deferments. J 
After the messages were completed, the subject 
completed the postencoding question , and the 
purpose of the experiment was then divulged to them. 


Results 


As in the previous two experiments, the 
subjects viewed the task as moderately difficult 
and moderately important. Subjects also ač 
cepted the experimental induction, which indi- 
cated that the high school students were Un- 
committed on the issue of student deferment: 

Analysis of the pretest attitude scores (Table | 
5) revealed that the four group means did not 
differ significantly. Examination of the mean? 
indicate that they are ordered identically t° 
the means obtained Experiment II, an ordering | 
consistent with Hypothesis 2. Scheffé’s test rè- 
to avoid | 


attitude 
ment 


? A semantic-differential-type scale was used 
any possible bias resulting from anchoring the 
scale with some of the same words used in the treat 
word lists, 


high-intensity-condition mean 
antly from the control mean, 
rate- and low-intensity means 
significantly from the control. 
oderate-intensity mean to differ 

| can be attributed partially to 
nple employed in Experiment 
lly to the greater within-group 
ved in the latter experiment. 
dividual comparisons indicated 
'titude change for the high- 
bjects was significantly greater 
nge for the moderate- or low- 
ts. Thus, taken as a whole, the 
zly support the hypothesized 
ship between intensity of coun- 


did not < 
Failure of 
from the 
the small: 
MI and p 
yariance 
Finally, t! 
that mea 
intensity 
than the 
intensity su! 
findings s 
direct rel 


lerattitudi.;! encoding and subsequent atti- 
tude chang 
DISCUSSION 

While the support obtained for the two 
hypotheses of this study is not without theo- 
Tetical significance, its main import seems to 
derive from its methodological implications for 
researchers interested in the variables in- 
fluencing attitude change following counter- 
attitudinal advocacy, Broadly speaking, the 
findings indicate that the outcomes of much 


: on counterattitudinal advocacy 
may have been unintentionally influenced by 
factors associated with the actual encoding of 
founterattitudinal messages. 

Certainly, the finding that prior attitude 
affects the intensity of message encoding is 
neither surprising nor novel. But as indicated 
above, it does underscore an important con- 
| sideration for research carried out within 
the counterattitudinal advocacy paradigm. 
Granted, the dissonance position allows for the 
Possibility that self-persuasion may occur 
following public commitment to encode a 
Counterattitudinal communication, but preced- 
ing actual message encoding (Hunt & Miller, 
1968; Janis & Gilmore, 1965; Miller & 

Graw, 1969). Still, in cases where attitude 
Measures are obtained after presentation of 
Sounterattitudinal messages, all major com- 
Peting theoretical positions require the un- 
“quivocal assumption of a counterattitudinal 
Stance, The results of this study, as well as 
those of earlier research by Burgoon (1970), 
‘Indicate that persons placed in a belief- 
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discrepant encoding situation may engage in 
message behaviors that consciously or un- 
consciously circumvent such a clear public 
endorsement. 

It may be argued that even though indivi- 
duals in the belief-discrepant condition encoded 
significantly less intense messages than persons 
in the belief-congruent group, their counter- 
attitudinal messages were still of moderately 
high intensity. Several procedural charac- 
teristics of Experiment I suggest caution in 
generalizing this finding to other research deal- 
ing with counterattitudinal advocacy. First, 
the induction used pointed out in several places 
the importance of constructing the most 
persuasive message possible. Although it is 
admittedly difficult to compare the emphasis 
placed on persuasiveness in the present study 
with that of prior research, examination of the 
procedures used in several previous studies 
suggests that less stress was placed on encoding 
maximally persuasive messages. Second, al- 
though the induction sought to heighten per- 
ceptions that the subjects were the actual 
sources of the message, use of a partially 
prepared message apparently permitted some 
of them to minimize their involvement in the 
persuasive attempt. In fact, it was the tendency 
of some Experiment I subjects to disassociate 
themselves from their messages that led to the 
procedural changes in Experiments IT and III, 
changes requiring the subjects to write out the 
messages once they had made their word 
choices. To the extent that the subjects in 
Experiment I were able to engage in such 
disassociative behaviors, their willingness to 
encode rather intense counterattitudinal mes- 
sages would be enhanced. 

That intensity of counterattitudinal encod- 
ing is related to subsequent attitude change is 
attested to by the strong support for the second 
hypothesis of the study. The findings indicate 
that attitude change is related directly to the 
intensity of the counterattitudinal message. 
When individuals encoded a belief-discrepant 
message of relatively low intensity, their 
attitude change scores did not differ signifi- 
cantly from those of persons in a no-encoding 
control group. By contrast, high-intensity 
encoding producing a typical self-persuasion 
effect in both experiments, while moderate- 
intensity encoding produced the effect in one 
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experiment. 

number of prior self- may 
well have resulted from the failure of the 
subjects to assume a counterattitudinal posture 
of sufficient intensity to activate attitude 
change processes. ; 

‘The procedures used in Experiments H and 
ITI provide a promising approach toward con- 
trolling for this possibility in future research. 
By limiting the individual's choice to words of 
a particular intensity, the researcher can ensure 
that the communicator’s message will truly be 


Zi 1965), it Jor gus "em desirable to 
result in of 

ent pa ga messages 
The potential utility of the present pro- 
cedures is further underscored by the fact that 
the subjects in Experiments II and III ap- 
parently perceived a strong linkage between 
themselves, as message sources, and the 


responsibility for persuading high school stu- 
dents to form favorable attitudes toward 


ments; in fact, their resistance seemed high. 
Apparently, addition of the requirement that 
persons write out the entire message elimi- 
nated the tendency to disassociate that was 
observed in Experiment I. 


member of the pair was admittedly semanti- 
cally or syntactically awkward. It should be 
noted, however, that this was not a serious 


limitation in the present stu 
members of the pair were of the 
Presently, research is being co 
at expanding the number of y 
for use in future studies. 

To be sure, elimination of th: 
problems dealt with in this stud 
step toward resolving the many : 
procedural issues faced by tho 
research dealing with countera! 
cacy. Even so, control of inten 
attitudinal encoding and prod 
sages of equal length should en 
to interpret future self-persuasi 
equivocally. we 
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CHANGES IN CONCEPTIONS OF THE GOOD LIFE BY A! 


RICAN 


COLLEGE STUDENTS FROM 1950 TO 1970° 


CHARLES MORRIS 


Department of Philosophy, 
University of Florida 


AND 


LINWOOD SMALL * 


Counseling Center, University rida 


To investigate changes in American college students’ conceptions of the good 
life over the last 20 years, a partial replication of Morris’s study of 195 
was done. Morris’s Ways to Live scale was used to gather data from ! 
American colleges. It was found that, while students’ conceptions have change: 


little, today’s youth are less tradition oriented than the 195! 
are about equally divided as to whether present American soci: 


udents an’ 
iy is satisfac- 


tory. It was concluded that the two groups of students differ not so much ; 
their ideals as in their ways of implementing these ideals. 4 


Varieties of Human Value (Morris, 1956) 
contained a study of conceptions of the good 
life held by American college students (as 
well as students from Norway, Canada, 
China, Japan, and India), Though this book 
was published in 1956, much of the data 
used in it was collected around 1950, and 
in what follows this will be referred to as 
the 1950 study. The present study contrasts 
the conceptions of the good life of American 
college students in 1970 with those obtained 
around 1950. The same instrument for 
securing data—the Ways to Live scale 
(Morris, 1956, pp. 15-19; Morris & Jones, 
1955, pp. 524-526)—was used in both 
studies, 

The 1970 data were secured mainly from 
17 colleges spread widely over the United 
States. For the most part, the students 
sampled were in undergraduate philosophy 
courses. Hence, the sample is perhaps biased 
toward students in the humanities, psychol- 
ogy, and the social sciences and against stu- 
dents in the natural sciences and the profes- 
sions. The extent of this bias—if any—is not 
examined here. The important point for this 
comparison of 1970 and 1950 students is that 
the 1970 material was derived in the same 
way as the 1950 material. The data in both 
cases come mainly from students in under- 
graduate philosophy courses; many of the 


1 The computations on the 1970 data for Table 1 
were done by Frans Thyssen. 

2 Requests for reprints should be sent to Linwood 
Small, University Counseling Center, 311 Little 
Hall, University of Florida, Gainesville, Florida 
32601. 


colleges sampled were the s: he geo- 
graphical spread of these college : similar; 
and the Ways to Live scale was ‘se instru- 
ment used in both cases (in neither study 
did the students sign their s to the 
document). 

The 17 schools forming the » ain sample 
were as follows: De Anza College. University 
of California at Santa Barbara, Rice Univer- 
sity, Tulane University, Memphis State Uni- 
versity, University of New Mexic Southern 
Illinois University, University of South Caro- 
lina, Florida Atlantic University, Saint 
Petersburg Junior College, University of 

, Howard 


Michigan, University of Minneso j 
University, City College of the City Uni- 
versity of New York; Kingsborough College 
of the City University of New York, Uni- 
versity of Bridgeport, State University of 
New York College at Geneseo. In addition, 
data were obtained from the University of 
Calfornia at Davis and Brown University, - 
but these data were not used in many of the 
calculations.* 


3 Corresponding to the order of these institutions, 
the following persons helped in collecting the data: 
Marguerite Foster; Alexander Sesonske and Jon 
Wheatley: Konstantin Kolenda; Andrew J. Reck, 
Louise Roberts, Shannon Du Bose and nna 
Pancheri; Michael Osborn; Archie J. Bahm; 
Morris Eames and Elizabeth Eames; James ^7 
Oliver and Roger J. Sullivan; Denis O'Donovan) 
Daniel J. Hamilton; Abraham Kaplan and Donall 
M. Michael; Herbert Feigl; Winston K. McAllister: 
Abraham Edel, K. D. Irani, David Weissman, Bt 
Arthur Collins: Elsa A. Whalley; Howard Parson 
and Edward D'Angelo; Kenneth Wilkerson; Wille! 
P. Dukes and Jarvis Bastian; Richard Schm 
The help of these persons is gratefully acknowledge 
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TABLE 1 
^ RATINGS AND PERCENTAGES or First Cxorces OF THE Ways BY AMERICAN MALE 
AND FEMALE STUDENTS IN 1950 anp 1970 
Male | aoe j a SO 
Way M rating | % M rating E 

1970% | 1950* | 1970 19504 1970* 1950! 1970 
1 3.79 15.36 540 si6 | 3:33 | i525 | 397 
2 3.52 1.62 4.35 2.61 3,30 1.78 3.48 
3 4.37 | 6.06 6.15 4.43 4.55 4.55 10.63 
4 417 | 4.11 7.95 3.81 4.35 3.37 10.22 
5 3.82 | 4.72 480 | 436 3.71 5.74 471 
6 4.43 | 10.31 10.36 | 4.77 3.87 4.36 3.88 
7 5.39 | 36.19 32.73- l 5.84 5.53 47.13 39.89 
8 440 | 10.11 10.36 | 4.42 443 8.32 9,41 
9 952] 40 1.35 3.07 3.78 59 3.07 
10 | 3.46 3.84 3.30 | 3.43 3.01 | 1.78 E: 
11 3.27 88 2.85 2.74 3.55 | A0 3.68 
12 | 1 442 5.46 855 | 406 424 | £% 5.52 
13 2.24 94 182 | _ 2,53 250 | 2.38 2.04 

Note.— ices me. corresponding 1950 and 1970 means greater than .16 are highly significant (p < .005, two-tailed 

test). dard error of the mean difference = .0556.) 


(Avera 


Morris, 1956). 


Morris, 1956). 
Morris, 1956). 


Morris, 1956.) 


Basic DATA: MEANS AND 
First CHOICES 


The Ways to Live scale consists of 13 
possible conceptions of the good life. The 
respondent is asked to indicate how much 
he or she personally would like to live each 
of these alternatives. This is indicated by 
putting a numeral from 7 to 1 in the margin 
beside each alternative: 7 indicates that it 
is liked very much; 6, it is liked quite a lot; 
5, it is liked slightly; 4, the respondent is 
indifferent to it; 3, it is disliked slightly; 
2, it is disliked quite a lot; and 1, it is dis- 
liked very much, That these response cate- 
gories can for statistical purposes be treated 
as integers has been shown (Morris, 1956, 
Pp. 25-27). 

The content of each of the 13 alternative 
Ways to live is given in extended descriptive 
Paragraphs. Their emphases are suggested 
in a rough way in the following list of 
expressions: 

Io 1: preserve the best that man has attained. 
ay 2: cultivate independence of persons and 

W. things. 

Ve oe show sympathetic concern for others. 

Y 4: experience festivity and solitude in alterna- 

tion, 


act and enjoy life through group partici- 
pation. 

constantly master changing conditions. 
integrate action, enjoyment, and contem- 


plation. 

live with wholesome, carefree enjoyment. 
wait in quiet receptivity. 

control the self stoically. 

meditate on the inner life. 

chance adventuresome deeds. 

obey the cosmic purposes. 


How do 1970 American students compare 
to the 1950 students in their likings of these 
13 alternatives? The main basis of compari- 
son will be the average (or mean) liking of 
the ways in 1970 and 1950. A second basis is 
the comparison of the percentages of first 
choices of the ways from the rank orders (in 
addition to rating each way, the respondent 
is asked to rank them in order of preference). 

Concerning the material of Table 1, a num- 
ber of comments may be made: 

1. The most striking change in the ratings 
of the ways in 1970—indeed, the only really 
striking change—is the marked drop in the 
liking for Way 1 by both men and women, 
both in means and in first choices. Way 1 
stresses conservatism in the finest sense: 
“preserve the best that man has attained.” 


Way 5: 


Way 6: 
Way 7: 


Way 8: 
Way 9: 
Way 10: 
Way 11: 
Way 12: 
Way 13: 


— — °° | 7 adÀ diss 4 d - 2 
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In the Ways to Live scale, it is described as 
follows: 


In this “design for living" the individual actively 
participates in the social life of his community, not 
to change it primarily, but to understand, appreciate, 
and preserve the best that man has attained. Exces- 
sive desires should be avoided and moderation 
sought. One wants the good things of life but in an 
orderly way. Life is to bave clarity, balance, refine- 
ment, control. Vulgarity, great enthusiasm, irra- 
tional behavior, impatience, indulgence are to be 
avoided. Friendship is to be esteemed but not easy 
intimacy with many people. Life is to have disci- 
pline, intelligibility, good manners, predictability. 
Social changes are to be made slowly and carefully, 
so that what has been achieved in human culture is 
not lost. The individual should be active physically 
and socially, but not in a hectic or radical way. 
Restraint and intelligence should give order to an 
active life [Morris, 1956, p. 15; Morris & Jones, 
1955, p. 524]. 


The drop in mean rating of almost 1.50 
indicates a clear passage from active liking to 
slight disliking. Moreover, this lower rating 
for Way 1 is found in all the colleges sam- 
pled, and so is nationwide. Taken as a whole, 
the 1970 students are decidedly less tradition 
oriented than were the 1950 students. 

Since a mean rating may hide many dií- 
ferences, it is pertinent to examine, for the 
1970 group, the percentage of entries in each 
of the seven rating categories that were ap- 
plied to Way 1. 

Table 2 shows that in spite of the drop in 
mean liking from 1950, the percentage of 
likings (Categories 7, 6, and 5) is not much 
less than the percentage of dislikings (Cate- 
gories 3, 2, and 1). This holds for both men 
and women, although the difference is some- 
what greater in the women. Hence, there are 
in 1970 almost as many students who like 
the tradition orientation of Way 1 as those 
who dislike it. This fact must not be forgot- 


TABLE 2 


PERCENTAGES OF RATINGS IN EacH RATING 
CATEGORY FOR Way 1 (1970) 


Rating category 


T | 6 ees 

Mens 3.25 | 15.25 | 26.42 | 8.05 | 20.76 | 20. 
Women> | 2.92 | 12.84 | 25.87 18.48 | 2607 | 1100 
an = 703. 

by = 514. 
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TABLE 3 
Mare Factor Scores For 1950 
- 


Factor | 1950 
A 8.91 
B 8.67 
Cc 5.58 
D 5.18 
E | 8.27 


ten in recognizing that the liki 
has dropped noticeably from 1 

2. The changes in the other 
much less marked, but the 
change are of considerable int: 
of mean ratings, the 1970 stude: 
and women) are lower than the ! 
on Ways 5, 6, 7, and 10, and higher 
2, 3, 4, 9, and 11, (There is litt! 
Ways 8, 12, and 13.) 

The direction of these differences mat 
clarified by introducing the notion 
factor (or dimension). In a factor a: 
the 1950 Ways to Live data, five value 
tors were isolated (Morris, 1956, pp. 3l 
Morris & Jones, 1955). These were nam 


Factor A; social restraint and selí-conirol ( 
sented most strongly by Ways 1 and 10); Fi 
B: enjoyment and progress in action (repres 
most strongly by Ways 5 and 12); Factor Cr 
drawal and self-sufficiency (represented most stron 
by Ways 2 and 11); Factor D: receptivity and 
pathetic concern (represented most strongly by Wi 
9 and 13); Factor E: self-indulgence (or 

enjoyment) (represented most strongly by Ways 
and 8). 4 


If we add the means of Ways 1 and 1 
get a score for Factor A, and proceed 
larly for the other factors, we get, for 
subjects, the results displayed in Table - 
The 1970 group is noticeably lower tham tl 
1950 group on Factor A (social restraint 
self-control) and slightly lower on Facto! 
(enjoyment and progress through acti 
but noticeably higher on Factor C (wit 
drawal and self-sufficiency) and somi 
higher on Factor D (receptivity and syn 
thetic concern) and Factor E (self 
gence or sensuous enjoyment). 


* This is only an approximate—and rather crud 
procedure for obtaining factor scores. See 
(1956, pp. 42-43). 
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These es are not large, and the gen- 
eral ord: factor strength is much the 
same—\ , C, D in 1950 and E, B, A, 
C, D in (With very minor exceptions, 
the same true for women students.) 
Neverthe the direction of change is in- 
teresting 1970 students are less tradi- 
tion ori: than the 1950 students and 
more ori to self than to society. There is 
some in in liking for detachment and 
receptivit ille there are few first choices 
for ways in these characteristics (Ways 
2, 9, 11, , there are more than in 1950. 
The 197: lents as a whole are active and 
in-the-wo versons, but they are more fa- 
vorable t: oyment, receptivity, and detach- 
ment thar were the 1950 students. 

3. Way 7 is by far the best liked of the 13 
alternatives, in 1970 as well as in 1950, by 
both men -nd women, and throughout the 
colleges sed. The mean liking and the 
percentage first choices are somewhat less 
in 1970 for both men and women, but the 
drop is sm!!. In 1950 and 1970, Way 7 is 


y more by women than by men. 


nient of Way 7 is as follows: 
We should s: various times and in various ways ac- 
cept something from all other paths of life, but give 
no one our exclusive allegiance. At one moment one 
of them is the more appropriate; at another mo- 


Ment another is the most appropriate. Life should 
contain enjoyment and action and contemplation in 
about equal amounts. When either is carried to ex- 
tremes we lose something important for our life. So 
We must cultivate flexibility, admit diversity in our- 
selves, accept the tension which this diversity pro- 
duces, find a place for detachment in the midst of 
enjoyment and activity. The goal of life is found in 
the dynamic integration of enjoyment, action, and 
Contemplation, and so in the dynamic interaction of 
the various paths of life. One should use all of them 
in building a life, and no one alone [Morris, 1956, p. 
17; Morris & Jones, 1955, pp. 524-525]. 


Way 7 stresses the integration of action, en- 
Joyment, and contemplation, and thus favors 
à many-sided self, For this reason it does not 
have a high place, positive or negative, on any 
of the five value factors; this is not true of 
any of the other ways. In this sense, Way 7 
15a sixth dimension. A life according to Way 
Twould be a dynamic life, rich in diversity. It 
Would have a place for dominance, for recep- 
tivity and enjoyment, and for detachment— 


= ~~ + 
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but for all of these and for none of them to a 
degree which usurped the other.* 

The high favor shown to Way 7 in both 
1950 and 1970 is as striking as the drop in 
favor for Way 1 from 1950 to 1970, Some 
comments on the relation of these two im- 
portant facts are furnished by the material 
of the next section, 


STUDENT EVALUATION OF CONTEMPORARY 
AMERICAN SOCIETY 


One of the questions asked of students on 
the Ways to Live document is the following: 
“Do you feel that our society is satisfactory 
for the development and expression of your 
own particular abilities and wishes? Why or 
why not?” Those who administered the docu- 
ment to students were asked to encourage the 
students to respond to this question. Ninety 
percent of the students did. (The question 
was also asked of the 1950 students, but no 
numerical count of the replies was made. 
Hence, on this matter no direct comparison of 
the 1950 and 1970 groups can be made.) 

The replies of the 1970 group to this ques- 
tion were divided into three groups: the Yes 
group, for those who answered the question in 
a primarily positive way; the No group, for 
those who answered the question in a pri- 
marily negative way; and the Yes-No group, 
for those who gave both positive and negative 
answers in a roughly equal way.’ 

The Yes-No group contained only 8% of 
the replies; the Yes group, 43%; and the No 
group, 49%. There were practically no dif- 
ferences between men and women in these 
groups. Thus, about half of the 1970 students 
evaluated American society as unsatisfactory, 
while slightly less than half thought American 
society is satisfactory.” 


5For further discussion of Way 7, see Morris 
(1956, pp. 48-50, 104-105) and Butt and Signori 
(1965). 

ê This grouping, and the analysis of the groups 
which follows, was done by Linwood Small. 

7 The ratio between positive and negative replies 
differed considerably in various colleges, the Yeses 
ranging from 19% to 63% and the Nos from 31% 
to 69%. There were also a few wide differences 
between men and women. The problems involved 
in the interpretation of such differences are too 
complicated for analysis here. 


TABLE 4 
Mean RariNos or THE Ways ny Mates Juporso 


AMERICAN Socrety as SATISFACTORY 
OR UNSATISFACTORY 


Rating 
Way = — T 
| Ves group No group” 
1 4.05 332 
2 | 334 372 
3 4.04 4.65 
4 3.82 430 
5 3.83 3.50 
6 3.60 425 
1 5.16 | 524 
8 4.23 148 
9 | 322 3.70 
10 3.64 3.39 
"n 2.86 331 
2 4.90 420 
13 247 239 


i * Calculated from a random sample of 100 males from the 
Yes group. 

LI Calculated from a random sample of 100 males from the 
No group. 

Out of the nine main categories into which 
the favorable replies of the Yes group were 
classified, the top four reasons, and the per- 
centages of the frequency with which they 
were given, are as follows: 

1. This society offers a sufficient number of oppor- 
tunities for a person to pursue his goals: 27%. 

2. A person can achieve his goals in this society if 
he has a high level of conviction and drive: 20%. 
3. This society gives individuals a lot of freedom 
for self-expression, both in action and in speech: 


16%. 

4. The cultural diversity of this society permits many 
different life-styles: 9%. 

Thus, the replies in these four categories con- 
stituted 72% of the favorable replies. None 
of the percentages in the other five categories 
exceeded 8%. 

Out of the 12 categories into which the 
unfavorable replies of the No group were 
classified, the top 3 categories and the per- 
centages of the frequencies with which they 
were utilized are as follows: 

1. This society demands conformity to its narrow 
norms: 34%, 

2. This society focuses too much on material goods: 
9%. 

3. The government compels conformity to restrictive 
laws (such as the draft): 8%. 

The replies in these 3 categories constituted 
51% of the negative replies. In none of the 
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other 9 categories was the perc e of 
plies larger than 6%. 

It is interesting to ask how much the Yi 
and No groups differ in their r: of 
various ways. Table 4 permits a parison. 
The male No group is markedly er than 
the male Yes group on Ways 1 12, and 
somewhat lower on Ways 5, 6, and 10; but 
they are markedly higher on W 3, 4, 9, 
and 11, and somewhat higher on Ways 2, 7) 
8, and 13. Put in terms of factors, the 
group is lower than the Yes gro: Factor. 
A (social restraint and self-contr id Fac 
tor B (enjoyment and progress .ction), 
and higher on Factor C (withdra id self- 
sufficiency), Factor D (receptivity «ad sym 
pathetic concern), and Factor E f-indu 
gence or sensuous enjoyment). T^, the Ni 
group differs from the Yes group in much thi 
same way as the total 1970 sam differs 


from the 1950 sample. 
The differences between the nale 


group and the female Yes group not 

sharp as in the case of the men: ‘ic womel 
Nos (as in the case of the men) :'e lower 
than the Yeses on Factor A and higher 0 
Factor C, but the differences are very small 


on the other factors. 


Two SUPPLEMENTARY Torics 


Two interesting, though somewhat minor, 
topics will be discussed in this section. One 
concerns the relation of the lower 1970 ratini 
for Way 1 (“preserve the best that man 
attained”) to the large number of statem 
affirming the unsatisfactoriness of presen! 
American society. It is not surprising that 
relation is positive, but the closeness of 
relation does occasion surprise. 

A chi-square test for association showet 
that there was a strong tendency (p < .001) 
for the men who judged American society 
satisfactory to rate Way 1 high (Categorie 
7, 6, or 5), and for those who judged tht 
society unsatisfactory to rate Way 1 lo 
(Categories 3, 2, or 1). Way 1 was rated $ 
(indifferent) by 8% of the men, and 10% 9! 
the men gave no reply to the question abou! 
the satisfactoriness of American society. 
relation in the case of the women stude 
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was about the same as with the men, though 
pot quite se. If we assume that the 
same patter id for the 1950 students (al- 
though we e no data on this), then the 
1950 stud: with their much higher rating 
of Way 1, have regarded the American 
society of time as considerably more 
satisfactory ‘yan did the 1970 students, 

The se topic of this section centers 
around th t that the Ways to Live docu- 
ment inch he following instructions: “If 
you can fo:niulate a way to live you would 
like better n any of the 13 alternatives, 
please do Only about one-fourth of the 
students wrote something in reply to this 
request. 

Very fev, if any, genuine alternatives to 
the 13 was- were discernible in the replies 
(this was 5:-o true for the 1950 study). Some 
persons re ed objections to single facets 
within the ways; many offered combinations 
of features contained in the 13 ways. 

As regards the content of these replies, 
there was little stress on the attitude of domi- 
nance, some on the attitude of detachment, 
and a great ical on the attitude of receptivity. 
The accent was often on the attainment of a 
many-sided self (which matches the high 


favor shown to Way 7), but many students 


- emphasized that this goal of self-development 


is to be combined with a concern for other 
Persons. While there was much stress on the 
social in the sense of the interpersonal, there 
Was very little reference to “society” as such. 
And there were very few comments of a tra- 
ditionally religious nature. The mood was 
Predominantly humanistic rather than tech- 
nological, or political, or scientifically im- 
Personal. The favoring of “freedom to live as 
one wants" was strong, as was the favoring of 
Pleasure, enjoyment, and sensibility. At the 
Same time, there was little favor shown to 
"selfish" self-centeredness. Self-development 
ina framework of interpersonal sensitivity— 


this is perhaps the main impression which 


these replies convey. 

Thus, with respect to the two topics of this 
Section, the informal comments of the stu- 
dents supported the previous analysis based 
on the statistical treatment of the ratings of 

e ways, 
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SUMMARY AND PARTIAL INTERPRETATION 


The main result of this study is the finding 
that in terms of the Ways to Live document, 
the 1970 American college students did not 
differ greatly from the 1950 students (except 
for Way 1) in the order of liking or in the 
mean ratings of the 13 ways," but that, never- 
theless, almost half of those sampled in 1970 
did not believe that present-day American 
society is favorable for realizing the life they 
would like to live. 

The 1970 students do show somewhat more 
favor to detachment (Factor C) and recep- 
tivity (Factor D), and this should not be 
ignored. However, the changes are not large, 
and these attitudes are still dominated (as in 
1950) by stresses on enjoyment (Factor E) 
and action (Factor B). The way favored 
highest by the 1970 group (and the 1950 
group)—Way 7—shows that the students con- 
tinue strongly to desire the cultivation of a 
many-sided self. 

That the 1970 changes in mean ratings and 
in order of liking of the ways are not greater 
may occasion some wonder. But two consid- 
erations are relevant here. The first is that 
the 1950 students themselves no longer con- 
formed to the traditional stereotypes of 
American youth—a point that was expressly 
noted in Varieties of Human Value (Morris, 
1956, pp. 48-50). The main personal life 
ideals which appear in the 1970 group were 
already present in 1950. 

The second consideration is that the Ways 
to Live scale (as used in the present study) 
dealt only with conceived values (values as- 
serted verbally) and not with operative values 
(values manifested in conduct) and hence did 
not consider the extent to which a person 
actually lives the way he states he would like 
to liveJ? So the evident changes in college 


8 The differences were greater in a comparable 
study of Japanese students (Misumi & Ando, 1964). 

9“The situation as revealed in the analysis as a 
whole is complex, and none of the common generali- 
zations concerning American youth does it full 
justice [Morris, 1956, p. 50].” 

10The Ways to Live scale, with changes in 
instructions, can be used for other purposes, such 
as to allow a person to describe how he (or others) 
thinks he lives, or to find out how he thinks he 
ought to live. See Morris (1956) for relevant 
material. 


260 


students in 1970 are largely (though not en- 
tirely) differences in what they do (ie., in 
the values which are operative in their con- 
duct). 

Drawing on the evidence presented above, 
then, we might advance the following specu- 
lations about contemporary youth: Those 
students who believe American society as it 
exists is not favorable to the realization of 
their ideals of how to live are reacting in 
different ways: some by deciding just to 
seek their own pleasures, some by trying a 
more detached and meditative life, some by 
attempting in various ways to change the 
structure of the dominant American society— 
hence, the heightened search by various con- 
temporary students for the pleasures of the 
moment, the concern with various forms of 
Oriental meditation, the development of com- 
munal or revolutionary student groups.! 

Even if these developments take overt 
manifestation in only a portion of American 
youth (it is to be remembered that in the 
1970 sample nearly half of the students ex- 
pressed satisfaction with present American 
society), they are important. They are move- 
ments in the direction of an even more plural- 
istic America, The 1950 study (Morris, 1956) 

1 Statistical data are compatible with this differ- 
entiation. Both in 1950 and 1970, pairwise correla- 
tions of the ratings of Ways 2, 6, and 8 from 
random samples of the replies are almost zero. 
Hence, some persons might try to live just one of 


these ways, while others might combine the emphases 
of these ways according to personal taste. 
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gave evidence that the ground long been 
present for possible specializations of groups 
stressing personal pleasure, detachment, emo- 
tional dependence, and integrated self-devel- 
opment. Current movement: em from 
attempts to give concrete form io ‘hese possi- 
bilities. The dissatisfaction. o tain con- 
temporary students is not a bas epudiation 
of person-centered democratic ideals, but a 
dissatisfaction with the present social forms 
which they have attained. 

The differences between the and the 
1970 students are primarily, though not en- 
tirely, differences in strategy operative 
rather than in conceived values. 1: is as if the 
1970 students are trying actively in various 
ways to live by certain basic ides!s in a sod- 
ety which many of them feel represents à 


betrayal of these ideals. 
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select old alternatives, and the individual 
should experience some increase in the de- 
sirability of the old alternatives. The increase 
in desirability would be directly proportional 
to the prior attractiveness of these alternatives, 
so that the most attractive of the old alterna- 
tives should show the greatest increase in 
desirability, while the least attractive should 
show the least increase in desirability. 

The amount of threat to the individual's 
freedoms to select any of the old alternatives 
should be a direct function of the strength 
of his tendency to select the new alternative. 
Hence, the more attractive the new alterna- 
tive, the greater should be the magnitude of 
reactance and consequent increase in desira- 
bility of the old alternatives. Of course, once 
the new alternative has become so attractive 
that there is no longer any question of its 
being selected, additional increases in its 
attractiveness would not add to the threat 
to the freedoms to choose the old alternatives. 

In practice, other processes beside reactance 
may be involved when a new, relatively 
attractive alternative is introduced. For one 
thing, the individual's frame of reference in 
judging the attractiveness of alternatives may 
shift (Sherif & Hovland, 1961) such that the 
old alternatives appear somewhat less attrac- 
tive and more nearly equal. Specifically, the 
most attractive of the three old alternatives 
may be judged to decrease considerably in 
attractiveness when a new and relatively 
attractive alternative is introduced, while an 
old alternative that is quite low in attractive- 
ness could not be expected to decrease much 
further no matter how attractive the new 
alternative. Because of this possible negative 
shift due to factors unrelated to reactance, 
an appropriate base line for the detection of 
the predicted reactance effects is a low-freedom 
condition. In a low-freedom condition, what- 
ever nonreactance factors are at work should 
be revealed, and any differences from a high- 
freedom condition may be attributed solely to 
reactance. 

The following experiment was designed to 
show that when a person feels free to choose 
any one of a particular set of alternatives, the 
introduction of a new, relatively attractive 
alternative will arouse reactance and a conse- 
quent increase in the desirability of the old 
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alternatives. The design was intended to test 
two hypotheses: (a) When a person is free to 
have any one of several alternatives, the 
introduction of a new, attractive alternative 
arouses reactance and a consequent increase 
in desirability of the old alternatives in direct 


proportion to their prethreat tiveness; 
and (b) when a person is free to have any one 
of several alternatives and a new alternative 
in introduced, the magnitude of reactance and 
consequent increase in desirability of old 
alternatives is directly proportional to the 
attractiveness of the new alternative up ta 


the point at which the new alternative is 


unequivocally chosen. 


METHOD 


To test the hypotheses it was neces to give some 
subjects the impression that they were free to have 
any one of a set of alternatives, and to give other 
subjects the impression that they had little or ne 
freedom to have any of the same set of alternatives 
Because college students who volunteer io participatt 
in a research project in order to satisfy a course require 
ment are generally motivated to cooperate with the 
experimenter, their freedoms to reject the exper 


menter's requests are essentially none nt or of very 
little importance. It is therefore easy to place subjects 
in a condition of low freedom, but relatively difficult 
to place them in a condition of high freedom. To con 
vince the subjects that they had high freedom, Wë 
decided that it would be necessary to give them repeated 
experiences of free choice with the set of alternatives 
Low-freedom subjects were assigned to alternatives 
but in other respects were given equal experience wl 
the alternatives. Foods were used as the alternative 
so that repeated choices on successive days could be 
meaningful to the subjects, and exotic foods were u' 
to avoid strong initia] preferences. Because the freedom 
to have an alternative can be meaningful only to 
extent that the alternative can be discriminated front 
other alternatives, pretesting was done to determine: 
how many alternatives could be discriminated. It was 
found that subjects did not feel confident about their 
selections after 4 successive days of choosing unless 
there were relatively few alternatives, and so the K 
design utilized three alternatives during the prethre 
period. 


Subjects 
s wet 


H ; e x " dent 
Female introductory psychology stu sions 


recruited to sample and rate some items in six ses 
of a consumer research project. Subjects were assign 
randomly to conditions, except for the constraint d 
condition Ns were to be held approximately equal, an É 
altogether, 55 participated. Four subjects Were x a 
nated because they failed to show up for one of the 
successive sessions. 


übjects were run one at a time. When one arrived 
Pher first session, she was met by the female experi- 
inter and seated in a chair in front of a table on which 
placed the three items of exotic food. After making 
introd ; remarks to relax the subject, the 
i ined that the project was concerned 
h c rate items of various kinds, and in this 
articular instance the subject would be rating edibles. 
subject was told she was to make two ratings of 
th item, the first regarding its appearance, the second 
mrding its taste. Taste ratings were obtained merely 
fit the gencral rationale and to justify the selection 
cedure used on following days. It was carefully 
plained that the appearance ratings were to indicate 
ot only how attractive the items were but also how 
üch the subject would like to taste them. The subject 
fas told that she should first carefully inspect the items 
nd rate them on appearance, after which she would 
each item and rate its taste. It was emphasized 
ings should be as accurate as possible regard- 
h or low they were, that the items were 
>d, and that the ratings would have no 
t ng. It had been found in pretesting 
Some subjects felt that they should rate the items 
because they thought the items were to be 
Marketed and to rate them low would ruin the research. 
ny questions the subject had were answered, except 
lat a question about the nature of the foods was put off 
ntil later, and the subject was allowed to proceed with 
lie ratings. Throughout the ratings, the experimenter 
ned her back and busied herself with papers so as to 
Xert as little influence on ratings as possible. 
The food items (Argentinian desserts, namely, quince 
ste, sweet potato paste with chocolate, and sweet 
lato paste with candied fruit) were cut into 1-inch 
sand labeled A, B, and C. They were rated for 
ance in alphabetical order and then tasted and 
ated again in the same order. 
When the ratings were completed, the subject was 
Hieduled to return for 10 minutes on each of the fol- 
Owing 5 days, except for Saturdays and Sundays, 
iid she was asked not to discuss the project with other 
ple so that ratings would not be affected. 


'anipulations 


Freedom. Beginning with the second session, all 
ects were told that each day they would rate all 
aee items for appearance, would then taste one, 
ad would rate that item for taste. To create different 
ees of freedom, the subjects in the high-freedom 
dition were told they would always be able to choose 
h item to taste, and the subjects in the low-freedom 
ion were told they would be assigned an item 
? laste. In reality, the subjects in the low-freedom 


» About the first third of low-freedom subjects were 
med that they were to eat whatever another sub- 
Selected, Because these subjects might have felt 
Part in the experiment to be relatively unim- 
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condition were yoked to the subjects in the high- 
freedom condition, such that if one in the high-freedom 
condition tasted the item she had marked as most 
attractive in appearance the day before, the yoked 
low-freedom subject was assigned the one that she had 
rated most attractive the day before. In other words, 
the yoking was on the basis of whether the previous 
day's highest, middle, or least attractive item was 
tasted, More often than not, high-freedom subjects 
selected the item rated as most attractive, though 
there was some variation from tbat pattern, particu- 
larly on the fourth and fifth days. On the fifth day, for 
example, 247 of the choice subjects tasted an item other 
than the one rated highest in appearance. 

Altractiveness of new item. When the subject came in 
for her sixth and final session, she found a fourth item 
of food on the table, The experimenter explained in an 
offhand fashion that she had been asked to add another 
item to the set of items on which ratings were obtained. 
High-freedom subjects were told that they would be 
able to choose any of the four items, while low-freedom 
subjects were told that they would be assigned to one 
of the four items. The new item was either a small piece 
of pound cake, which pretesting had indicated would be 
moderately attractive to subjects, or a small piece of 
cherry cheesecake, which pretesting had indicated 
would be very attractive to subjects. While our major 
interest was in the rating of attractiveness of the old 
items in the presence of the new alternative, after these 
ratings were completed, the subjects were allowed to 
taste an item (those in the low-freedom condition were 
all assigned to the new item) and rate its taste. Three 
subjects in the high-freedom conditions (two in high 
attraction, one in low) selected an alternative other 
than the new item, but their data were not excluded 
because there would be no way to detect and exclude 
similar subjects from the low-freedom condition. The 
data of these three subjects do not appear different 
from those of other subjects in the freedom conditions. 
Upon completion of the taste rating, the subjects were 
interviewed and then fully informed about the decep- 
tions and true purpose of the study. No evidence of 
suspicion was uncovered in the interviews. 


Measure of Change in Attractiveness 


The ratings of desirability in appearance and taste 
were obtained on separate but identical rating sheets. 
The rating scale consisted of 31 points, labeled at one 
end, “Not at all desirable,” in the middle, “Moderately 
desirable," and at the other end, “Extremely desir- 
able.” Each scale was identified by a letter, A, B, 
etc., and there were five scales on each rating sheet 
in order to minimize surprise when a new item was 
introduced, 


portant, we adopted a suggestion made by Stuart 
Valins of telling low-freedom subjects that they would 
taste the foods according to a predetermined pattern. 
This variation in treatment of low-freedom subjects 
had no noticeable effect on responses, and the subjects 
were combined for the analysis of results. 


TABLE 1 


MEAN INITIAL ATTRACTIVENESS (Day 5) AND CHANGES 
IN ATTRACTIVENESS. WHEN THE NEW 
ALTERNATIVE. Was INTRODUCED 


Old alternative 


Condition 


| Akai Middle | Keeti 
Low freedom | | 
Initial score 13.62 11.54 5.83 
Change —1.96 —71 —.08 
High freedom 
Initial score 12.70 9.15 6.56 
Change T3 —.26 —.74 


Note.—n = 24 and 27 for the low- and high-freedom con- 
ditions, respectively, 


RESULTS 


On the sixth and final day, the subjects 
rated the attractiveness (appearance) of the 
new item of food as well as of the old items. 
These ratings of the new item should have 
been higher for the cherry cheesecake than 
for the pound cake, and the mean ratings did 
differ in the expected direction. However, 
an analysis of variance indicated that the 
rating of the high-attraction alternative (19.2) 
was not reliably different from the rating of 
the low-attraction alternative (17.4), and we 
may tentatively conclude that the manipula- 
tion of attractiveness of the new alternative 
Was not successful. Since analysis of the de- 
pendent variable also failed to show any 
reliable effect from the manipulated attrac- 
tiveness of the new item, the data for high 
and low attraction were collapsed in order to 
simplify the presentation of results. 

Because there was some day-to-day vari- 
ability in which of the old items was rated the 
most or least attractive, each subject's average 
ratings for the first 5 days were used to deter- 
mine which of the old items was to be con- 
sidered the highest, middle and lowest in 
attractiveness when the new item Wi 


s as intro- 
duced.4 The index of change in attractiveness 
was the difference between the appearance 


ratings given on the fifth and sixth days, and 


* Essentially the same Statistical results are obtained 
if the attractiveness scores on Day 5 are used to deter- 
mine which items are most and least attractive prior 
to the introduction of the new alternative. 


= — | Ls s Mee. 
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these mean changes, along 1 
scores of the old items on : ft} 
presented in Table 1. It : 

there were no serious discre 
conditions in regard to the 

fifth day (initial scores), and t 
attractiveness of each of t} 

less than that of the new ite: 

Changes in attractiveness ings 
old items in the low-freedom 
to be negative, as was anti 
the decrease for the initially. 
item was reliable (¢ = 3.48, df 23, p <W 
Thus, where the freedom to hive any of th 
old items was low, the introduction of a ne 
attractive item resulted in « e 
in the rating of the most attra 
items, some apparent tendenc: 
in the moderately attractive 
essentially no change in the ! 
old item. The decrease for the : 


indeterminant amount was presi: 
regression and other artifactual eec 
ever, because these same fact 
sumably at work in the high-f 
tion, the results for the low-freed 
serve as a base line for detection of the elfe 
of high freedom. 

Our first hypothesis predicted that whe 
the new item was introduced, the old item 
would increase in attractiveness more in t 
high-freedom condition than in the low i 
direct proportion to the old item’s preth 
attractiveness, As may be seen in Table] 
the mean changes in Attractiveness of the leas 
and middle attractive old items did not diffe 
in the high-freedom condition compared to 
low. Though the least attractive item d 
creased somewhat more in the high- than y 
the low-freedom condition, the difference d 
not approach statistical significance. On 
other hand, the most attractive old itel 
increased somewhat in attractiveness in E 
high-freedom condition, and this change Wi 
reliably different from its decrease in the 10 
freedom condition (F = 5.53, df = 1/4900 
< .025). Furthermore, a test of the interac io 
between changes for most and least attractiV 


EE» EE 
= —— 


dd items er low- and high-freedom condi- 


tons thro : comparison of difference scores 
revealed mificant effect (F = 7.815, df 
= 1/47, | 01). Quite clearly, in comparison 
to the h eedom condition, there was a 
tendency the high-freedom condition for 
the most active old item to increase in 
attractive relative to the least attractive 
old item. obtained interaction supported 
our first hesis. 

The tes: of the second hypothesis depended 
on the s :ful manipulation of the attrac- 
tiveness of the new item, and as noted pre- 
viously, t^^ attractiveness ratings of the new 
item did liffer reliably between high- and 
low-attra: conditions. Nevertheless, the 
general pattern of rating changes for old items 
Was consisient with the predicted interaction 
between a: unt of freedom and the attractive- 


ness of t} 
attractive 


w item, though no effects due to 
of the new item were reliable. 


The secon’ hypothesis was therefore neither 
Confirmed nor disconfirmed. 
Discussion 


The resulis supported one of our two hy- 
potheses ; nely, that when an attractive, 
new alternative is introduced, the increase in 
attractiveness of old alternatives in a condi- 
tion of high freedom is greater than that in a 
Condition of low freedom in proportion to the 
initial attractiveness of the old alternative. 
No evidence of consequence was found to 
Support the hypothesis that the more attrac- 
lve the new alternative, the greater the 
Mcrease in attractiveness of old alternatives 
When a person is free to select one. This latter 
hypothesis was apparently not well tested, 
Since ratings of the new high- and low-attrac- 
tion items failed to indicate they differed 
Substantially in attractiveness. 

The probable reason for the failure of the 
attractiveness manipulation was that in pre- 
‘esting we compared the low-attraction item 
Pound cake) with the high-attraction item 
cheesecake), but failed to compare it carefully 
With the old items. Then the old items turned 
Mt to be less attractive than we anticipated, 
"^d so the low-attraction new item turned 
Mt to be attractive relative to the old items. 
owever, it should not be overlooked that 
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there were trends in support of the second 
hypothesis even though the manipulation 
was very weak. 

While the pattern of results was largely 
consistent with our predictions, and while 
there do not seem to be obvious alternative 
interpretations, both caution and care are 
warranted in acceptance and use of our inter- 
pretation. Our essential notion holds that when 
there is freedom to select an old alternative, 
the introduction of an attractive, new alter- 
native will tend to result in the old alternative 
becoming more attractive. The results, how- 
ever, failed to show an absolute increase in 
attractiveness even for the most attractive 
of the old alternatives; rather, what they 
showed was a relative increase in attractiveness | 
of an alternative that was initially moderately 
attractive compared to the decrease in at- 
tractiveness of an alternative one was not 
free to choose. The crucial point is that when 
an attractive, new alternative is introduced, 
there seems to be an overall tendency for 
old alternatives to become less attractive, 
and this tendency may be greater for the 
relatively more attractive of the old alter- 
natives. Because this tendency for negative 
change works against the hypothesized effect 
of reactance, the two effects tend to cancel 
each other out. Whether any given alternative 
will increase or decrease in attractiveness as a 
consequence of the introduction of an attrac- 
tive, new alternative should depend upon the 
circumstantial factors that determine the 
magnitude of reactance, that is, the importance 
of the freedom of select the old alternative 
and the amount of threat to it, and upon the 
circumstantial factors that determine the 
magnitude of the negative change tendency. 
Quite clearly, all we have been able to show 
in the present experiment is evidence for the 
predicted effects of reactance relative to 
whatever negative change tendencies there 
may be. A prediction of reactance-produced 
absolule increases in attractiveness of old 
alternatives is not possible in our present state 
of knowledge. 

To the extent that the present data may be 
taken as support for our predictions, a sig- 
nificant aspect of the reactance formulation is 
highlighted. What theoretically gives rise to 
reactance is not so much a reduction in the 
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total amount of freedom one has as it is 
elimination or threatened elimination of 
specific freedoms. This "molecular" con- 
ceptualization of freedom means that when- 
ever the exercise of one freedom excludes the 
exercise of others, some freedoms must neces- 
sarily be threatened or lost, creating reactance 
and a consequent increase in desirability of the 
threatened or lost alternatives. The introduc- 
tion of an attractive, new alternative is simply 
a special case in which all of the prior freedoms 
will be threatened. What from a molar point 
of view appears to be an increase in the indi- 
vidual’s freedom can, in the molecular view, 
be a threat to prior freedoms. 

By the same token, the present study indi- 
cates a possible reason that people may not 
always be enthusiastic about acquiring new 
freedoms. Whenever the introduction of a new 
freedom implies that the individual should 
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give up one or more important freedoms al 
ready held, as would be the case when the new 
alternative is relatively attractive, the mo 
tivation to have the old alternatives will in 
crease and will tend to make a choice difficult 
and uncomfortable. Indeed, just as the old, 
less attractive alternatives are made more 
desirable, so should the new, more attractive 
alternative tend to become less desirable, 
compounding the difficulty of choosing. In 
short, what our analysis suggests is that people 


will frequently find the introduction of new 
freedoms to have unpleasant consequences, 
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and indirect semantic differentials). 


Kelly's psychology of personal constructs 
(Kelly, 1955) deals with the interpretation 
and anticipation of events by each individual 
1 terms of a system of unique bipolar con- 
cts. Within this theoretical framework, 
nister (1963, 1965) has sought to test 
e hypothesis that repeated validation of a 
t of constructs leads to a “tightening” of 
relationships or linkages between these 
nstructs for the individual. Validation is 
fined as the subjective compatibility be- 
n construct prediction and outcome; for 
fxample, a person construed as “honest” is 
und to have told the truth. Bannister had 
S subjects rank successive sets of photos on 
h of a number of constructs, He reported 
increase in the magnitude of intercorrela- 
ns among the constructs when the subjects 
Were given positive feedback (validation) 
garding the supposed accuracy of their 
Tatings, 
— Bannister used constructs with some initial 


linkage. His feedback was delayed and came 
"This study is based on a dissertation submitted 
the University of Wisconsin in partial fulfillment 
XI the requirements for the PhD degree. The author 
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tation chairman, and to Albert R. Marston, for 
JUI advice and counsel in the completion of this 
arch and to Alexander C. Rosen, Lowell H. 
ms, and William E, Broen for their assistance 
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BETWEEN PERSONAL CONSTRUCTS' 
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Within George Kelly’s theoretical framework, an attempt was made to demon- 
strate that the correlation or linkage between a pair of bipolar constructs 
could be influenced directly by appropriate validation. Eighty subjects rated 
100 photos of people on two bipolar constructs. Two groups of 20 subjects 
received feedback validating either a positive or a negative linkage, A third 
received feedback validating constructs, but random with regard to linkage. 
The fourth group received no feedback. The first two procedures yielded 
increases in the validated linkages, while the latter two groups showed no 
changes. Partial evidence suggested that this effect transferred to another set 
of elements (real, known people) and to other means of measurement (direct 


in the form of a score for the rankings on 
each construct. It may have been that the 
subjects interpreted their generally high 
scores on the set of constructs to mean that 
they were accurate in this whole area of con- 
struction; that is, the relationships which 
they posited between the constructs were 
valid. Therefore, the experimental effect may 
not have been produced indirectly by con- 
struct validation as hypothesized, but, in- 
stead, it may have been produced directly 
by what could be called linkage validation. 
The present experiment attempted to differen- 
tiate between the effects of linkage validation 
and construct validation alone. 


METHOD 
Subjects 


The subjects were undergraduates at the Uni- 
versity of California, Los Angeles, who received 
credit in introductory psychology classes for partici- 
pation in psychological experiments. Experimental 
data were collected from 104 subjects in order to 
obtained the desired 80 subjects who met the 
selection criterion explained below. 


Procedure 


On the basis of pretesting, a pair of bipolar con- 
structs were selected which had a near-zero cor- 
relation as applied by a college population. These 


were: constrained versus free and objective versus | 


subjective. Subjects began single individual sessions 


by filling out a role-title list consisting of 20 roles - 


(Kelly, 1955). Following this, they were randomly 
assigned to one of the four groups, the only con- 
straint on this randomization being that Group 3. 
subjects, who were yoked to subjects in Group 1. 
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BLOCKS 
Fic, 1. Corrected linkage scores for photo ratings: 


number of constrained-objective plus free-subjective 
responses minus operant score, 
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or 2, were run after their yoked counterparts. All 
subjects were then given the following instructions: 


The next part of the experiment concerns your 
ability to assess personality from facial appearance. 
I will be showing you a series of photos of people, 
and I will ask you to rate them on two traits, 
or personality dimensions. These dimensions are: 
“constrained” versus “free” and “objective” versus 
“subjective.” [The experimenter handed the subject 
a card with the constructs printed on them.] 
What I want you to do is look at each photo 
and try to predict, in effect, how the person would 
score on a personality test made up of these two 
dimensions. You are to decide which end of each 
dimension the person would be closest to. For 
instance, would the person be more likely to score 
in the “constrained” half, or the "free" half of 
the “constrained” versus “free” dimension? Con- 
sider each photo, and then tell me which ends of 
the two dimensions apply to each person. Any 
questions ? 


The subject was then given a stack of 100 randomly 
ordered 4X 6 inch cards, each with a 1X 1} inch 
photograph mounted on it. The photographs were 
of 50 male and 50 female college students. Each 
subjects ratings were scored by the experimenter for 
number of constrained-objective plus free-subjective 
responses given in each of five blocks of 20 photo- 
graph trials. A score above 10 indicates a greater 
tendency to link constrained with objective and free 
with subjective in applying the constructs to the 
photographs (positive linkage). A score below 10 
indicates a greater tendency to link constrained with 
subjective and free with objective in applying the 
constructs (negative linkage). Subjects were selected 
on the basis of having a tendency to apply the two 
constructs in a relatively independent manner in the 
first or operant block of 20 ratings, that is, 7-13 
constrained-objective plus free-subjective responses, 
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Group 4, a no-validation cont P, continued 
with a same rating procedure ve blocks of 7 
photos. Groups 1, 2, and 3 we ped after the 
operant block and were given t ving instruc 


tions before continuing with the 
of photos: 


ining blocks 


ome feed- 
is of these 
re "right" 
ned accu- 
ning either 


From now on, I am going to ; 
back on the accuracy of your ; 
people's test scores. I will tell y 

if you get both of the dimensi 
rately. If you are inaccurate i 
one or both of the traits, I wil! 
“wrong,” but I will not specify 
was. I want you to try to be a 
sible, to try to learn from the fec 
and to try to use the feedback t 
accuracy. This is a difficult task, 
find that people can do it at a bc 
rate and that they improve : 
feedback even though they may : 
verbalize exactly what it is about t) 
are learning to pick out. 


1 you ar 
the error 


r your 
we usually 
han chance 
ractice and 
e able to 
Photos they 


Actual feedback contingencies were determined 


differently for each of these three group Group 1 
received a positive-linkage validation cedure. For 
each five constrained-objective or fr jective re- 
sponses, four were randomly desi as right 
and one as wrong. For each five constrained- 
subjective or free-objective responses, one was ran- 
domly designated as right and four wrong. 
Group 2, a negative-linkage validation group, had 
these schedules reversed for the two response classes. 


Thus, Group 1 received validation at a differentially 
higher rate for making a positive linkage between 
the two constructs (constrained-objective or free- 
subjective) as they were applied to the phc tographs, 
and Group 2 received validation at an equally high 
rate for making a negative linkage between the two 
constructs (constrained-subjective or free—objective). 

Group 3 received a procedure validating constructs 
only. Half of these subjects were yoked to subjects 
in Group 1 and half to subjects in Group 2. Each 
Group 3 subject received the same feedback sequence 
as his yoked partner. Thus, the subject received 
feedback which was random with regard to linkage 
(zero-linkage validation). They did, however, receive 
the same overall level of positive feedback as the 
subjects in the first two groups, and this feedback 
presumably was at the same random level for any 
specific construct response, Therefore, Groups 1, 2, 
and 3 all received comparable construct validation. 

Following the photo-rating procedure, the subjects 
were asked to make the same ratings for each person 
on his previously constructed role-title list. Next, a 
set of semantic differential rating scales was admin- 
istered. Each of the four adjectives comprising the 
two experimental constructs (free, objective, con- 
strained, subjective) was rated on seven semantic 
differential scales. One of these scales consisted. of 
the experimental construct not containing the adjec- 
tive under consideration; for example, constrained 
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an "objective-subjective" scale and 
rated on a "constrained-free" scale. 
Defininz bjective and constrained ends of the 
wales as imerically “high” ends, then positive 
linkage v. licated by high ratings of the adjec- 
tives cor ed and objective and low ratings of 
the adjec iree and subjective. Negative linkage 
was indi by high ratings of the adjectives free 
and subj and low ratings of the adjectives 
constraine | objective. A single index of linkage 
was deris rom the sum of the constrained plus 
objective ngs minus the free plus subjective 
ratings. A tive value of this index indicates a 


was ratec 
objective 


positive | e, and vice versa. 

The ot ix semantic differential scales were 
composed two scales for each of Osgood, Suci, 
and Tann um's (1957) dimensions of value, activ- 
ity, and y. The scales used were: good-bad, 
kind-crue! ctive-passive, excitable-calm, strong- 
weak, ar evere-lenient. Positive linkage would 
imply th: nstrained would be rated similarly to 
objective free would be rated similarly to 
subjective gative linkage would imply that con- 
strained l be rated similarly to subjective, and 
free wou rated similarly to objective. Similarity 
scores for h of these four possible construct pole 
pairings (constrained-objective, free-subjective, con- 
strained-su jective, and free-objective) were ob- 
tained by summing the absolute differences between 
the ratings of each pairing over the six semantic 
differential scales for each subject. An index of link- 
age was tained by subtracting the constrained- 
objective plus the free-subjective similarity scores 
from the -objective plus the constrained-subjec- 
live similarity scores. The resulting index would have 
à positive value for positive linkage, and vice versa. 


After the subjects had completed these scales, the 
experimenter asked a series of questions to verify 
the subjective plausibility of the task. Following 
this, the subjects were debriefed as to the actual 
nature of the experiment. 


RrsurTs 
Photo Ratings 


| To control for the effects of base level of 
linkage, the data for each subject for each 
9f the four training blocks were transformed 
to deviations from operant block (Figure 1). 
The ANOVA of these data (Table 1) yielded 
4 significant groups effect. Using the Newman- 
Keuls method (Winer, 1962), Group 1 showed 
Sreater overall positive linkage than either 
Group 2 (q = 8.16, p < .01), Group 3 (q = 
417, p- 01), or Group 4 (g= 5.13, 
^ X 01). In addition, Group 2 also differed 
Significantly from either Group 3 (g = 3.98, 
P<.05) or Group 4 (q = 3.03, P< 05) 
the direction of greater negative linkage 


TABLE 1 


ANALYSIS OF VARIANCE: Correcrep Proto 
Ratincs Scores 


Source 4f | MS F 

Groups (A) 3 360.10 113° 
Ss within A 76 31.79 
Blocks (B) |) ^u 16.25 3.07* 
AXB 9 23.73 448** 

Linear (B) X A 3 | 4040 6.18** 
Ss X B within A 228 5.30 

Ss X Linear (B) 

within A 76 6.54 

Total | 319 | | 

*p «.05 

“p <.001 


between the constructs. Groups 3 and 4 did 
not differ significantly (q = .1.00). 

Scores for Groups 1 and 2 would be 
expected to change monotonically over blocks, 
with Group 1 increasing and Group 2 de- 
creasing. To test the linear blocks effects for 
each group individually, t tests were applied 
to the value of each linearly weighted group 
mean (used in calculating the linear Blocks 
X Groups component of the interaction). 
Group 1 yielded a positive linear blocks ef- 
fect (¢ = 3.23, df = 76, p < .005). Although 
the linear component of Group 2 was in the 
expected direction, the effect was not signifi- 
cant (t = —1.05, dj = 76, p < .20). Neither 
Group 3 (t= .503, df= 76) nor Group 4 
(t = .420, df = 76) yielded significant linear 
components over blocks. A ¢ test comparing 
the means of the first (operant) and last 
blocks of the uncorrected data for Group 2 
did demonstrate a significant increase in nega- 
tive linkage (¢ = —2.82, df = 30, p < 01). 


Role-Title List 


The mean number of constrained-objective 
plus free-subjective pairings for the 20 ratings 
from the role-title list were 11.40, 7.95, 10.02, 
and 9.00 for the four groups, respectively. A 
significant one-way ANOVA (F = 3.93, df = 
3/76, p< .05) was further investigated, 
using the Newman-Keuls method. Group 1 
differed significantly from Group 2 (q — 4.59, 
p<.01), as predicted. However, although 
Group 1 had the highest, and Group 2 the 
lowest, mean score, as predicted, neither of 
these groups differed significantly from either 
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Group 3 or Group 4. The latter two groups, 
as expected, did not differ significantly, 


Semantic Differential: Direct Ratings 


Group means for this variable were 3.85, 
=1.11, 2.15, and — 1.30, respectively, and 
an ANOVA of these data yielded a significant 
groups effect (F = 5.04, df = 3/75, p < .01).8 
Again, individual comparisons were made by 
the Newman-Keuls method. Group 1 was 
significantly different from Group 2, as pre- 
dicted (g = 4.40, p < .05). Group 1 was also 
significantly different from Group 4 (q = 4.57, 
$ < .05), but not from Group 3 (gis Hist: 
Group 2 did not differ significantly from 
either Group 3 (g — 2.91) or from Group 4 
(q = 26). Groups 3 and 4 did not differ 
significantly from one another (q = 3.06). It 
should be noted that for this set of data, the 
means were not ordered as expected. Group 4 
yielded a nonsignificantly greater negative 
linkage than Group 2. 


Semantic Differential: Indirect Ratings 


Means for the four groups on the indirect 
semantic differential variable were 9:/5,:2:59. 
4.80, 4.40, respectively. The relative order of 
the groups is as predicted, but the ANOVA 
did not yield a significant groups effect 
(F=118, df= 3/75, p < .50). However, a 
one-tailed ¢ test of the specific hypothesis 
that Group 1 would yield a more positive 
linkage score than Group 2 was significant 
(t — 1.84, df= 75, p< 05). 


Discusston 


The results indicate that the linkage be- 
tween two constructs can be modified directly 
by validation of the linkage and that the 
modification occurs in the validated direction 
as a function of amount of validation experi- 
ence. Validation of constructs alone had no 
significant effect on linkage, contrary to 
Bannister's (1963, 1965) conclusions regard- 
ing his data. There was also some evidence 
that the linkage which was modified by the 
validational procedure transferred in impor- 
fant ways. First, the positive and negative 


“Incomplete data from one subject in Group 3 
led to his elimination from these and the following 
semantic differential analyses, 


validation procedure produced differential ef -— 
fects as to the linkage or correlated applica- 
tion of the constructs to a different class of 
elements, namely, a list of re people known 
to the subject (role-title list). T transfer 
suggests that the experimental effect was not 
bound simply to the ratings of the photos, 
but, rather, the manipulations resulted in a 
change in the way subjects construed people 
in general on these construct dimensions. The 
addition of a variety of data known about 
familiar persons did not negate the experi- 
mental effect. 

Second, the differential effect of the posi- 
tive and negative validation procedures trans- 
ferred to other methods of measurement. On 
the direct semantic differential me: sure, the 
same linkage differences were measurable 
with regard to the way in which subjects 
made semantic associations among the adjec- 
tives making up the constructs. On the in- 
direct semantic differential measure, there was 
some evidence pointing to the same linkage 
in the relative distances among the construct 
poles in a more comprehensive semantic 
space. The validated pairs of adjectives be- 
came closer or more similar to one another 
in their relationships to other constructs. 
This finding tends to support the idea that 
a relationship was established between two 
constructs which was consistent with the sub- 
jects’ more stable cognitive structures. From 
a slightly different perspective, these results 
suggest that the experimental manipulations 
lead to differential “meanings” accruing to 
the pair of constructs in the context of a 
larger, more stable semantic Space. 
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COALITIONS IN THE TRIAD: 


INTRODUCING THE ELEMENT OF CHANCE 
INTO THE GAME STRUCTURE * 


MICHEL CALDWELL * 


University of California, Los Angeles 


Vinacke and Arkoif's Type V condition was replicated with two changes: 
(a) each player had a separate die; and (b) in the coalition wins condition, 
the subjects were told that, by the rules of the game, a coalition would win. 
In the other conditions, subjects were told that the first marker home would 
win. In the coalition wins conditions, the high-powered player formed signifi- 
cantly more coalitions, accepted a lower share of the prize, and earned fewer 
total points than he did in the other two conditions. Gamson's theory was 
interpreted as unable to account for these findings, while Thibaut and Kelley's 
formulation was found to be accurate and appropriate in predicting the data. 
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191, 3, 271-281 

| an experiment by Vinacke and Arkoff 
(1957), subjects played a modified form of 


a pachisi game in which they could form 
coalitions to pool their resources. When power 
appeared to be distributed among the players 
in the ratio of 4:3:2, coalitions between the 
two low-powered players occurred signifi- 
cantly more often than would be predicted 
assuming equal probabilities of coalition 
formation. This inequality of frequencies oc- 
curred despite the fact that a “rational” or 


- game theory (Von Neumann & Morgenstern, 


1953) analysis of the game reveals that any 
coalition would automatically win, thus im- 
Plying that all players were, in fact, of 
equal power. 

One reason for the Vinacke and Arkoff 
Subjects’ failure to perceive the true power 
distribution may have been Vinacke and 
Arkoff’s use of a die, In most parlor games, 
dice are used as a randomizing agent that 
Sometimes permit a lower powered player to 
Win. In the Vinacke and Arkoff experiment, 
however, the die had no function except, 
Perhaps, to determine how fast some games 
were concluded, This clearly is a change from 


‘This experiment was done as a first-year gradu- 
ate research project under the guidance of Barry E. 
Collins, to whom the author is deeply grateful. Addi- 
Moral thanks go to Michael Hoyt for his helpful 
criticisms of an earlier version of this paper. 

. Requests for reprints should be sent to Michael 
Caldwell, who is now at the Department of Psychol- 
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traditional usage and may have been a cause 
for the players’ failure to act “rationally.” 

Kelley and Arrowood (1960) implied such 
a possibility when they suggested that the 
complexity of Vinacke and Arkoff’s (1957) 
procedure may have prevented subjects from 
acquiring an adequate understanding of the 
true power relations. Using the 4:3:2 power 
ratings, they replicated Vinacke and Arkoff, 
with each group of subjects playing from 10 
to 70 games; they found that, with more 
detailed instructions, the departure from an 
even distribution for the first three games 
was reduced from Vinacke and Arkofi’s 
chi-square of 47.07 to one of only 8.12. 
Hypothesizing that training would further 
reduce this “misperception effect," they also 
calculated a chi-square for the last three 
games. This second chi-square revealed an 
essentially even distribution, as expected. 
However, the hypothesized difierences be- 
tween the first and last three games also 
fell short of statistical significance. 

Although Kelley and Arrowood raised the 
question of misperception of the game struc- 
ture, they refrained from investigating directly 
the effects of changing the structure of the 
game. If, as Kelley and Arrowood implied, the 
subjects’ actions were based on a mispercep- 
tion of what constitutes true power, changing 
the game structure should produce changes in 
the subjects’ bargaining behavior (assuming 
that in all cases the subjects are fully aware 
of the true nature of the game). In particular, 
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the high-powered player would be expected 
to bargain harder, making more concessions 
and entering into more coalitions when he is 
aware of his inability to defeat a coalition. 

Vinacke, Crowell, Dien, and Young (1966) 
challenged the Kelley and Arrowood hypothe- 
sis that the earlier results by Vinacke and 
Arkoff were due to the subjects’ misperception 
of the true power relationships and per- 
formed an experiment in which one, two, 
three, or none of the subjects were specifically 
briefed on the nature of the strategies im- 
plied by Caplow's (1956) theory and game- 
theory analysis. They presented evidence to 
indicate that those subjects who were told 
about the two strategies showed a greater 
awareness of the high-powered player's lack 
9f true power. They also stated that this fact 
was generally comprehended by the great 
majority of the subjects by the end of the 
experiment, However, in spite of their sub- 
jects’ apparent awareness that all players 
had, in fact, equal real power, significantly 
more weak coalitions, which excluded the 
high-power player, were formed. Moreover, 
no relationship could be found between the 
number of coalitions entered into by a sub- 
ject and whether or not the subject had been 
coached on the existence of the two strategies, 
(However, the authors did note a significant 
tendency for male subjects in the all-informed 
condition to learn to form coalitions which 
include the high-powered player.) 

Because Vinacke et al. (1966) based their 
conclusions in part on the effects of providing 
additional information, it may prove instruc- 
tive to examine more closely the nature of 
this information. While both a weighting- 
factor-oriented Strategy and a coalition- 
oriented strategy were discussed, it was not 
made clear to the subjects that victory was 
totally determined by the coalition pattern. 
Indeed, the use of the verbs “could win,” 
“can win,” and “try to defeat,” as opposed 
to “would win,” “will win,” and “will defeat,” 
tends to perpetuate the illusion that victory 
was not automatically assured and that a 
coalition could lose, Thus, Vinacke et al/s 
(1966) results fail to rule out the hypothesis 
that subjects in such situations act as if the 
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outcome were not totally det 
coalition pattern. What is needed i 
nipulate directly the degree | 
subjects perceived the outco 
partially due to chance. 

Chertkoff (1966) manipula 
bility of future success and í 
when high power was accompani 
probability for future success, 
for the high-powered player t. 
was greatly reduced; and ( 
powered player demanded and 
larger share of the prize when h 
of future success was high t 
was equal to that of the lo 
players. 

Applying Chertkoff's findings t 
race or pachisi game setting pr 
prediction that the high-powered player 
should be sought after and offered a high 
percentage of the prize when the ou 
are partially determined by chance 
when, therefore, it is possible for a 


excluded 
> higher 
eived a 
re »bability - 
when it 
l ;»owered. 4 


re horse 
ces the 


once formed, to lose). It should be 

that this prediction is contrary to the hy= 
pothesis set forth by this paper. If, as is 
hypothesized, Vinacke and Arkoíf's (1957). 3 
subjects did falsely assume that their out- 


comes were partially determined by the 
throw of the die, the effect of informing the 

subjects of the true deterministic nature of 
the game should have been (according to 
this paper’s interpretation of Chertkoff) to 
decrease the number of coalitions which — 
included the high-powered player rather than 
increase them. (The differences in prediction 
arise from the fact that Chertkoff’s findings 
deal with the perceptions and motivations of - 
the low-powered players, while present predic- 
tions are obtained by looking at those of 
the high-powered player.) 

However, several differences between Chert- f 
koff’s (1966) procedure and that of Vinacke 
and Arkoff (1957) and Kelley and Arrowood - 
(1960) should be noted: The most obvious 
difference lies in the game setting. Chertkoff — 
used a political convention paradigm, while - 
the other researchers employed a parlor game. - 
In addition, coalition formation was strongly 
encouraged by Chertkoff, and bargaining took 


i 


| 


the const: 
partners | 
with no 
Finally, ti 
varied in: 
ate barga 
players | 
future su 
might be 


ned form of choosing negotiation 
owed by two-person discussions 
ce-person discussions permitted. 
probability of future success was 
»endently of the players’ immedi- 
ig power, with the lower powered 
given equal probabilities of 
Thus, any generalizations that 

ide from Chertkoff’s results still 


require « rical confirmation using a parlor 
game procedure. 

Vinacke, Lichtman, and Cherulnik (1967), 
ina more direct test of this paper's hypothe- 
sis, had male or female triads play either long 
or short sames under conditions in which 
either common dice (deterministic conditions) 
or separate dice (stochastic conditions) were 
used. Whi'e they found that players tended 
to gamble more in the stochastic condition, 
relying on ‘he luck of the dice, there was no 
significan: effect of game structure on the 
number o! weak coalitions found. (A weak 
coalition was defined as one which excluded 
the high-powered player. This failure to 
observe a significant difference between the 


stochastic «nd deterministic procedures sup- 
ports Vinacke’s claim that people fail to 
realize that inclusion in a coalition is what 
is important, as well as this paper’s conten- 
tion that subjects in the deterministic condi- 
tions act as if they assume the game to be 
stochastic in nature. 

In order to properly test the hypothesis 
that subjects act “irrationally” in a determi- 
nistic setting because they assume the results 
to be paritally a function of chance, then, 
it is necessary to run an experiment in which 
the subjects are made aware of the differences 
between the two game structures. Under a 
Condition in which the subjects are told that 
any coalition would win by definition, the 
high-powered player should bargain more 
eagerly and consequently be included in more 
coalitions than under a condition in which 
his high weighting factor makes it theo- 
retically possible for him to defeat a coalition. 
Assuming that the subjects fully understand 
that personal success in such a coalition wins 
Condition depends on being accepted in a 
Coalition, they should conclude that all three 


players have, therefore, equal real power, and 
coalitions should form without regard to a 
person’s assigned weighting factor. Finally, 
the player with a large weighting factor 
should demand and receive a large share of 
the prize under the chance or short condi- 
tions (to be described later) than under 
a coalition wins condition. 


METHOD 
Subjects 


Sixty male psychology students enrolled in intro- 
ductory psychology courses at the University of 
California, Los Angeles, served in partial fulfillment 
of a course requirement. Each group consisted of 
three subjects, and as far as could be ascertained, no 
two subjects in any session knew one another, 


Procedure 


The three subjects in each group were randomly 
assigned positions around a table with the experi- 
menter facing them. On the table was a game board 
consisting of 150 spaces from START to HOME 
(see Figure 1) and three markers colored red, 
white, and green. The experimenter then explained 
the purpose of the experiment: 


This is an experiment to determine how people 
are able to get the most out of their resources in 
a bargaining situation (ie. "the most for your 
money"). What you will be doing is bargaining 
to form a coalition in a game we shall play for 
the next 2 hours. I am also interested in study- 
ing the conditions under which coalitions form or 
fail to form. 


The subjects’ attention was then directed to two 
large charts hung within easy reading distance and 
on which were printed the instructions and weight- 
ing factors. The red marker had a weighting factor 
of 4; the white, 3; and the green, 2. The instruc- 
tions read as follows: 


1. Object of the Game: To win as many points 
as possible. 

2. We will play six games in each of three con- 
ditions (18 games total). Prior to each game the 
experimenter will announce the condition and 
which of six games (see weighting factor chart) 
will be played. The players will then place their 
markers on START and may (if they choose) 
begin to bargain among themselves to form a 
coalition in an attempt to improve their chances 
of winning, 

3. Every 50 seconds the experimenter will roll 
the dice. Each player then moves his marker for- 
ward a number of spaces equal to the number of 
spots on his dice times that player’s weighting 
factor (see weighting factor chart). 
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Fic. 1, The game board, 

4. At any time two players may form a coali- the manner specified by the coalition contract. 
tion, No three-player coalitions are allowed. After The game ends when a coalition is Jormed. 
Teaching an agreement on how they want the Should, however, a player reach HOME before a 
experimenter to distribute the points in the event coalition is completed, he will receive the full 
the coalition wins, the two players will fill out a 100 points. (b) In the CHANCE condition, 
coalition contract provided by the experimenter, when the contract has been filled out the coalition 

5. After a coalition has been formed [it has members will use their most advanced marker 
been found that almost always a coalition will and will remove the other marker from the board. 
form], the winner of the game is decided in the For the rest of the game they will then have 4 
following fashion: (a) In the COALITION WINS Weighting factor which js equal to the sum of 
Condition the experimenter will divide 100 points in their individual weighting factors, The instant the 
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TABLE 1 
PERCENTAGE OF 2,000 GAwES IN Witch THE LOW-POWERED PLAYER Fintisies FIRST 
Head start for the low-powered player (or coalition) 
Weighting (spaces) 
tactor — —À Ed 

0 5 10 15 ^d 25 3 
3 versus 4 58 65 10.2 138 | 183 23.3 
2 versus 4 0 0 0 POAN LAS 2. SA 
4 versus 5 94 11.1 16.0 21.2 27.4 30.8 39.9 
3 versus 6 0 0 E. 2 6 6 14 
2 versus 3 A 6 i ae 2.6 48 | 6.1 9.6 
3 versus 4 7.0 8.1 11.7 19.0 | 21.9 30.1 39,2 
2 versus 4 0 0 0 0 E! 8 1.6 
4 versus 5 12.6 154 20.0 23.7 33.2 39.9 46.8 
3 versus 6 d 4 2 8 9 2.0 3.6 
2 versus 3 9 Ls TP, 83 124 20.9 
3 versus 4 10.0 10.5 16.9 26.5 33.2 41.7 58.8 
2 versus 4 BL E) a 1.0 21 44 10.8 
4 versus 5 12.9 14.0 24.9 31.2 37.2 45.6 60.9 
3 versus 6 oa 3 1.0 2.5 3.7 4.9 11.4 
2 versus 3 Es 3.0 6.2 9.4 19.8 26.0 38.9 
3 versus 4 10.9 15.6 23.1 38.9 43.6 59.2 73.2 
2 versus 4 E) E 24 4.7 9.8 18.9 36.0 
4 versus 5 16.6 22.8 27.9 35.1 50.9 60.4 70.0 
3 versus 6 11 14 34 8.3 10.1 20.2 34.4 
2 versus 3 3.6 6.2 14.1 21.9 37.6 52.4 69.5 


t marker (individual's or coalition’s) reaches 
ME the experimenter will distribute the 100 
ts to the winner(s). (c) The SHORT condi- 
n is similar to the chance condition except that 
first marker to reach 60 wins. 


Alter the subjects had finished reading the instruc- 
j the experimenter paraphrased them, showing 
the game was similar to a horse race or 
i game, how coalitions were formed in the 
conditions, and how the three conditions 
d. Pains were taken to show that, in the coali- 
Wins condition, the coalition would automati- 
be pronounced the winner. No mention, how- 
Was made concerning the irrelevancy of the 
hting factors in that condition, The experimenter 
described his criteria for determining when a 
ition was formed. A coalition was considered to 
been agreed upon when either (a) the excluded 
ber indicated he was willing to “go it alone" 
by not saying anything, that he had finished 
aining; or (b) when two successive offers from 
excluded person were refused by members of the 
n. In cases of doubt, the experimenter waited 

he was certain that a coalition was formed, In 

| attempt to keep each game independent, the ex- 
nenter concluded his explanation by suggesting 

at subjects pretend they were playing each game 
4 dollars worth of nickels and that, therefore, 
Was won or lost in previous games was irrele- 

- The subjects were not allowed to keep written 


track of their winnings and were discouraged from 
trying to keep track in their heads. 

Six games in each condition were then played in 
a random order, with each player playing in all 
power positions against all possible combinations of 
opponents. The order in which the chance and coali- 
tion wins conditions were played was randomized, 
with the restriction that every four groups would 
have two groups starting under cach condition. The 
short condition was always played last. Immediately 
after a coalition was agreed upon (or at the end of 
the game in the event no coalition was agreed 
upon), postcoalition questionnaires were passed out 
to each player. These questionnaires soon became 
a routine procedure and were often filled out auto- 
matically by the subjects. 

After each condition, postcondition questionnaires 
were completed; then the experimenter would review 
the rules for the next condition. At the end of the 
experiment, a postexperimental questionnaire was 
completed by the subjects. 


The Game 


While the probability of a coalition’s winning was 
defined to be 100% in the coalition wins condition, 
its value is less clear in the other two conditions, 
The number of spaces each player (or coalition) has 
remaining before he reaches HOME directly deter- 
mines the probability of an upset, and therefore the 
probability will vary as the game progresses. To 
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TABLE 2 


AwALYSIS of VARIANCE: Numer or COALITIONS 
AGREED TO By RED 


Source | a | Ms | oF 
Condition (A) — | 2|sas | 5.20* 
Order (B) I| 540 | 240 
AXB 2 | 1.55 | 95 
Error (Groups X Condition 
within Order) 36 dd 


*p«.. 


obtain estimates of the probabilities, computer simu- 
lations ® were run which varied both the number of 
spaces remaining for the high-powered player (or 
coalition) and the head start which a lower powered 
player might be presumed to have. Table 1 presents 
the results of these simulations. It is dear that while 
upsets are not uncommon in the short condition, 
the chance condition is similar to the coalition wins 
Condition in that a coalition formed early in the 
Same essentially guarantees success. 


RESULTS 
Behavioral Measures 


Differences between conditions. Table 2 
contains the results of an analysis of variance 
which tests whether changing the condition 
or structure of the game affects the number 
of coalitions that red (the high-power posi- 
tion) entered into. Red entered significantly 
more coalitions under the coalition wins con- 
dition than under the chance or short condi- 
tions. The mean number of coalitions entered 
into by red was 3.05 in the coalition wins con- 
dition, 1.90 in the chance condition, and 1.95 
in the short condition, Thus, the first hy- 
pothesis is clearly confirmed. Similar analyses 
for white and green, however, reveal no sig- 
nificant condition effects. No analysis for any 
position revealed a significant effect for order 
in which the conditions were played, or inter- 
action between condition and order, 

The fact that red entered more coalitions 
in the coalition wins condition than he did in 
the other two conditions may have resulted 


* The author wishes to thank Robert Koegel who 
wrote and ran the simulation programs. Computing 
assistance was obtained from the University of Cali- 
fornia, Los Angeles, Campus Computing Network 
and from the Health Sciences Computing Facility, 
University of California, Los Angeles, the latter 
facility being sponsored by National Institutes. of 
Health Special Research Resources Grant RR-3, 
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from his toning down his de 
age share of the prize in 
red agreed to (regardless oi 
the coalition won) is preser 
Red appears willing to accept 
when he is in the coalition 
Moreover, in the coalition wi 
share of the prize was not 
ferent from the 50% presc 
theory or an analysis based v 
Our third hypothesis is ther 
firmed. 

Differences from chance. 1n 
pare our results to the earlier o 
and Arkoff (1957) and Kelley 4 


(1960), chi-squares were com; or each 
condition (excluding groups w! > coali- 
tion formed). Whereas chi-squa of 31.7 
and 27.3 were found for the cha d short 
conditions, the coalition wins condition chi- 
square was only 5.96, which is signifi- 
cantly different from chance even ; ming an 
independence of games. A more anding 
test of our second hypothesis, howey er, in- 
volves the specific test of whether or not red 
enters into fewer coalitions than would be 
predicted by chance in the coalition wins 
condition. This was found to be significant 
(t = 3.09, df = 18, p < .01), a resuit which 
is inconsistent with the hypothesis that peo- 


ple follow a “rational” or game-theory strat- 
egy. Even in the coalition wins condition, 
there is a tendency for red to remain single. 

Number of points won. Simmel (1950) hy- 
pothesized that the low man holding the bal- 
ance of power generally profits far out of pro- 
portion to his resources by allying himself 
with one of the two stronger members of the 
triad. Was this claim borne out? In the coali- 
tion wins condition, all positions had equal 


TABLE 3 


RED's AVERAGE SHARE OF THE PRIZE 


N 


lhe aver- 
tions that 
er or not 
Table 3. 
iller share 
ondition. 
lition, his 
intly dif- 
by game 
ıl power. 
iso con- | 


to com- | 
Vinacke 
rrowood 


Item Coalition wins Chance Short 
Ns 19 16 
M 53.71 57.03 


Note.—Coalition wins versus chance ¢ = 1.79, Pr 
< .05, one-tailed) ; coalition wins versus short / = .64, 


= 33; short versus chancel = 1.53, df = 33 


" [Prd ^ " e 
aN = number of groups in which red formed at least on! 


coalition, 


TABLE 4 
ALYsIs Or Vouuasce: Rep's Desire TO 
For ALITION 
| a | ats mE 
2 | 73.73 | 14.75° 
1 4.56} 35 
2| 3.34) 6 
(Groups X Condition | 
within Order) 36 | 5.00 


al power, and so we would expect green, 
ith a weight of 2, io win about 200 of the 


ints. In the chance and short conditions, 
e low-powered player received an average 


e, however, are above the 133.33 points 
would have been received had rewards 
allocated strict!y on the basis of re- 
. Thus, while the low man does benefit 
his position oi holding the balance of 
; he does not achieve a position of equal- 


Westionnaire Measures 


To obtain a direct measure of how much 
players wanted to form coalitions when 
were in the various power positions, 
onnaires were used in addition to be- 
oral measures. Results from analyses of 
Postcoalition and postcondition question- 
es follow. 
- Postcoalition questionnaire. The behavioral 
leasures, while clearly describing what hap- 
do not necessarily reflect the desires of 
Players involved. For example, red may 
excluded from a coalition because he is not 
ted by white or green as Gamson (1961) 
implied, or because he may not choose to 
pate, feeling he can do better by “go- 
it alone," "Therefore, an attempt was made 
heasure a subject's desire to form a coali- 
by the postcoalition questionnaire. Sub- 
Were asked how much they wanted to 
1 a coalition with each of the other two 
ers, what they felt their minimum ac- 
le percentage of the prize was, and 
5 they felt was a fair share. A factor 
Sis of the within-condition-within-power 
"on correlations indicated that a single 
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factor could be used which accounted for 
more than 43% of the total variance, Using 
the within-cell factor loadings as weighting 
coefficients, a composite score which measures 
the desire to form any coalition was com- 
puted, and these composite scores were ana- 
lyzed for each power position. Table 4 pre- 
sents the results of that analysis, and Figure 
2 presents the associated graph of the com- 
posite cell means. Condition effects for the 
individual questions ranged in significance 
from p < .001 to p < .05. Red clearly wanted 
more to be in a coalition in the coalition wins 
condition than in the chance or short condi- 
tions. Moreover, there were no significant 
effects for white or green in the composite 
scores, and the only significant individual 
question effect was a tendency for low-pow- 
ered players in the chance condition (particu- 
larly when chance was played first) to con- 
sider a smaller share of the prize to be fair 
in the event a red-green coalition were 
formed (F= 3.76, df =2/36, p< .05). 
Since this tendency was not reflected in two 
other closely related questions and since the 
cell differences did not occur when the coali- 
tion wins condition was played first, this 
difference may have been the 1 in 20 that we 
would expect by chance. 

Postcondition questionnaire. To determine 
how well the subjects understood the true 
nature of the game under each set of rules, a 
postcondition questionnaire was administered 
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which asked what percentage of the time 
white or green would beat red, assuming no 
coalition, and what percentage of the time a 
coalition of green and white would beat red. 
If the instructions were understood com- 
pletely, the answer to this last question 
should have been a trivial 100% in the coali- 
tion wins condition. However, this turned out 
not to be the case. Even with detailed instruc- 
tions, a coalition was estimated to win only 
! 86.7% of the time in the coalition wins con- 
l dition. In only 2 of the 20 groups did all three 
y members indicate that a coalition would win 
i 100% of the time! Furthermore, there was no 
- significant condition effect for this question 
(F = .857, df = 2/36). 

) These findings create a paradox in that 
they imply that the experimental manipula- 
tion was ineffective, and yet the behavioral 
and questionnaire dependent variables showed 
significant manipulation condition effects. The 
finding that subjects in the coalition wins con- 
D dition indicated a substantially greater ten- 
: dency to answer “each equally” or “this is a 
7 meaningless question,” when asked who con- 
tributes more to a coalition, tends to reassure 
us that the manipulation was at least partially 
successful. We are still, however, left with the 
question as to why subjects failed to mark 
“100%” in the coalition wins condition. 

One possible explanation may be that the 
subjects were reluctant to mark such an ex- 
treme answer. Some subjects have a general- 
ized bias against extreme answers. Others may 
prefer to avoid answering “always” or “never” 
on any question, feeling that such answers 
are too simplistic and therefore probably 
wrong. Another explanation may be that the 
subjects failed to internalize the instructions, 
The employment of dice may have caused the 
subjects to believe that luck did play a part 
and therefore that nothing was guaranteed 
100% of the time, As a result, they may have 
failed to perceive that the dice were, in fact, 
irrelevant and that the game only appeared to 
be similar to other parlor games. In any event, 
the failure of the subjects to estimate a 100% 
chance of a white-green coalition victory could 
account for the nonconfirmation of our second 
hypothesis (i.e, that red would enter into 
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two-thirds of the coalitions or what would be 
predicted by chance). It should be noted that, 
in the two groups where all members indi- 
cated that a white-green coalition had a 100% 
chance of winning, red entered into slightly 
more coalitions than would be predicted by 
chance—a finding which is in accord with the 
prediction. 
Discussion 

Stryker and Psathas (1960) pointed out the 

dangers of drawing conclusions about the 


manner in which coalitions are formed solely 
on the final outcomes with no observation of 
the process. Often they found bargaining to 
be highly rational and unmotivated even 
though the number of coalitions suggested 
otherwise. For this reason, a word about the 
bargaining situation itself is appropriate. 

Each group seemed to have its own distinct 
style. Some preferred to roll the dice several 
times and reach a bargaining solution nearer 
the end when there was less uncertainty. Oth- 
ers preferred to use their sense of probability 
and make a deal before the dice were rolled. 
The pace of bargaining varied from an al- 
most bored sedateness to a hectic rapidity 
punctuated by nervous laughter. An example 
of the bargaining (particularly in the coali- 
tion wins condition) might sound like the 
following: 


: PIL go 50-50 with you, B. 
OK. 


: PI go 50-50 with you, B. 

: TII take it, 

: I'll give you 65. 

: TII take that. 

: How about ws going together 50-50, A? 


OBS sore 


Often, particularly in the short condition, 
groups would develop norms in an attempt to 
find an end to the bargaining. These took the 
form of handshakes, or saying “Closed, 
which indicated that the player would not 
listen to any offers made by the third man. 
Sometimes the norms took the form of pre- 
scribing a 50-50 split or a 60-40 split be- 
tween white and green, with red “taking his 
chances.” Whether or not these norms grew 
out of fatigue, experience with the game, or à 
realization by the low-powered players that 


PW 


had to reach an agreement rapidly can- 
je determined." 

little further reflection leads to another 
resting observation. In the coalition wins 
dition, the subjects are given, in effect, an 
juble task. All players have equal real 
and must choose who wins and who is 
jd man out.” To sit there haggling through- 
experiment would only serve, in Thi- 
tt and Kelley’s (1959) terms, to raise 
one's costs. It is, then, perhaps to the 
dit of Stryker and Psathas’ (1960) sub- 
ts (who often tossed a die to determine 
‘coalition and the division of the prize, 
ed one another as to the correct strategy, 
lved norms as to what was a fair share) 
t they often chose a rational, if unex- 
tted, approach to solve the problem. More- 
, those subjects who decided to use norms 
3 means of distribution had a few that 
ready made. They could use the experi- 
s weighting factors to determine who 
the coalition. Since who received which 
ng factor was determined in the 
e and Arkoff (1957) and Stryker and 
(1960) experiments by chance, such 
sion should have seemed fair to all con- 
(Such an explanation recalls Orne's, 
discussion of demand characteristics 
Was also pointed out by Vinacke et al., 
6.) In any event, the labeling of such 
ies as "irrational" may be misleading 
that such a description implies that sub- 
actions were not governed by cognitive 
sses. 

mson (1964) postulated an “utter-con- 
1 theory" which would be discriminated 
by the convention of reporting only 
t findings. What support for this 
can we find? In the bargaining, there 
instances where, in the rush to form a 
tion, rational behavior seemed to be lost 
nfusion, There were comments expressed 


should be noted that Vinacke et al. (1967, 
) found a tendency for subjects in their short 
lon to bargain less, to make more 50-50 deals, 
form no coalitions. Since their triads were run 
Y one condition, we may tentatively rule out 
Or experience as the most likely explanation 
t results. 
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which indicated a “What have I done?” or 
“Now I know why we lose at the peace con- 
ferences” sentiment, but, on the whole, ra- 
tional processes seemed to have had the upper 
hand. One factor which  utter-confusion 
theory might postulate to be important is 
physical position around the table. Because 
the seats were assigned randomly, this pre- 
diction may be tested by looking at the aver- 
age winnings of each physical position. Com- 
puting the average difference between each of 
the three seats failed to produce any signifi- 
cant results. Thus, it seems unlikely in this 
experiment that one's location had a signifi- 
cant effect in the bargaining. 

The results of this experiment appear to 
support Kelley and Arrowood's (1960) con- 
tention that the subjects in Vinacke and 
Arkoff (1957) type experiments are led to 
believe that the weighting factors imposed by 
the experimenter represent real indexes of 
power. Moreover, awareness that coalitions 
hold the key to success appears to be ac- 
companied by the subjects’ abandonment of 
the weighting factors as a criteria in deter- 
mining the share of the prize. On the other 
hand, the data also indicate that even when 
the dice are allowed to play a part in deter- 
mining the results of the games, many sub- 
jects persist in underestimating the full power 
of a coalition. (According to the simulation 
data, a coalition results in a 90% probability 
of success in the chance condition.) Thus, the 
subjects may be accused of behaving "irra- 
tionally" in the sense that strategies based on 
weighting factors fail to achieve optimum 
results, 

One current theory which appears unable to 
accommodate the findings of this experiment 
is Gamson's (1961) notion that the higher 
powered player will be rejected because the 
other players expect him to demand his, 
“rightful” share. The data presented here in- 
dicate that "strength is weakness" only be- 
cause it leads to an overestimation of one's 
own power and worth. In this experiment, red 
priced himself out of the market rather than 
being thrown out. Moreover, in the short con- 
dition, his high self-evaluation seems to have 
been realistic. 
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The theory of Thibaut and Kelley (1959) Gamsoy, W. A. Experimental studies of coalition 


3 3 formation. In L. Berkowitz (Ed.), Advances in 
pu ur provide ts fairly adequate UN lana- experimental social psychology. Vol. 1. New York: 
tion of the present findings. In the chance Academic Press, 1964. 
and short conditions, red had a high compari- Kerrey, H. H., & Arrowoon, A. F. Covlitions in the 
son level for alternatives (CLany). He could triad: Critique and experiment. Sociometry, 1960, 
and did reject the offers of the other two ot Pe On eda T 
play ES feeling—and SUID CHBIGS stating—that logical experiment: With “partic lar reference to 
he would rather go it alone and try for the demand characteristics and ir implications. 
full 100 points. Moreover, red had power over American Psychologist, 1962, 17, 783. 
the other two positions in that, with a weight- Simmer, G. The sociology of George Simmel. (Trans. 
ing factor of 4, he could still partially deter- _ & ed. by K. Wolff) Glencoe, Ill: fice Press, 1980. 
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George idler (1964) proposed and tested 
the hypot : that the interruption of an 
“organized response sequence” produces a 
state of “arousal” and, moreover, that the 
more organized or familiar the organized 


‘ence, the more arousal there will 
its interruption. 


Tesponse s: 
be following 


In the present study, Mandler’s hypotheses 
Were retested employing an interruption tech- 
mque and physiological measure of arousal 
different from Mandler’s. These and other 


thanges were introduced to provide a better 
Tepresentative of interruption and a suitable 
method of measuring arousal both before and 
after interruption, 

An organized response sequence, according to 
Mandler, is a unitary organization of pre- 
viously separate behaviors. He suggested that 


When discrete units of behavior are organized, the new 
Unit behaves in the same unitary fashion as its previous 
Constituents. Once initiated, organized responses run 
p of smoothly and without pause; they have an inevi- 
lability of completion which is just as determined in 
Ong sequences as it is in such short sequences as 
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LARY DILATION BEFORE AND AFTER INTERRUPTION 
OF FAMILIAR AND UNFAMILIAR SEQUENCES 
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Hypotheses regarding the physiological effects of recalling and interrupting familiar 
ind unfamiliar digit or word sequences were combined and tested using pupillary 
llation as the physiological measure. Recall was interrupted by stopping pacing 
metronome clicks. Interrupted subjects were those who halted or stopped their 
-ecall when the clicks stopped. ‘The pupillary dilation during recall was expected to 
e smaller for the familiar than for the unfamiliar sequences; conversely, dilation 
following interruption was expected to be greater for the familiar than for the un- 
familiar sequences. The predicted inverse relationship, represented by a Familiarity 
< Stages interaction, was confirmed (F 
arded as a contribution to Mandler's theory of interruption. 


= 10.11, p < .001). The results are re- 


speaking a word, pressing a button, or swallowing food 
[Mandler & Watson, 1966, p. 264]. 


The interruption of an organized response 
sequence simply implies that an organized 
response sequence which has been initiated 
has been blocked in some unanticipated way. 
As long as the situation remains essentially 
unchanged, a tendency to completion is as- 
sumed to exist. The manifestation of this 
blocked tendency to completion is visceral 
arousal, having potentially measurable effects 
in all autonomically innervated organs. 

Mandler (1964) demonstrated that the inter- 
ruption of an organized response sequence such 
as a memory-recall task results in differential 
arousal of the galvanic skin response (GSR), 
depending on the level of organization of the 
recalled material. Specifically, he found that 
the interruption of the recall of simple sen- 
tences elicited a greater GSR than the inter- 
ruption of the recall of relatively difficult 
scrambled word or digit sequences. 

Although Mandler regarded the substitution 
of unexpected words or digits simply as a 
means of inducing interruption, he may only 
have demonstrated that an unexpected word 
or digit put into a familiar sequence of words 
or digits may have more novelty value than an 
unexpected word or digit put into an unfamiliar 
sequence of words or digits. 

The notion of interruption, as distinct from 
novelty, may be preserved, however, by the 
operation used in the present experiment: the 
stopping of pacing metronome clicks. In con- 
trast to the substitution of words or digits, 


281 


" 
b 
i 


GEM ZWVUCUETPCTNTEYyw T 


E (To. 
EET 


the stopping of a metronome is equally novel, 
whether occurring simultaneously with the 
recall of a sentence or a string of digits. Conse- 
quently, it serves as an interruption in the non- 
expectancy sense that Mandler intended, pro- 
viding the possibility of a more adequate test 
of his theory. 

Pupillary dilation, rather than the GSR used 
by Mandler, was chosen as the measure of 
physiological activation for the present ex- 
periment, because it is not only sensitive to 
changes in levels of arousal (Loewenfeld, 1958) 
but has the additional advantage of sensitivity 
to the apparent levels of processing involved 
in the performance of mental tasks. 

Hess and Polt (1964) discovered that pupil- 
lary dilation varied directly with the difficulty 
of solving mental arithmetic problems, Subse- 
quently, Kahneman and Beatty (1966) demon- 
strated that pupillary dilation conforms to the 
apparent “momentary processing load" in- 
volved in doing simple digit-span recall tasks, 
with pupillary dilation increasing during the 
presentation of the digits and decreasing as the 
digits were successively recalled. These recall 
tasks were of the same type as those used in the 
present experiment. 

One important feature of the design of the 
present study is the use of two types of recall, 
Silent and aloud. Kahneman and Beatty ob- 
served that vocalization, regardless of content, 
was accompanied by pupillary dilation. Conse- 
quently, a silent recall was introduced into the 
present experiment in order to have a measure 
of mental activity unconfounded with a vocali- 
zation effect. Since the present experiment was 
designed, Kahneman, Onuska, and Wolman 
(1968) have reported finding significant pupil- 
lary response associated with silent recall. An 
aloud recall with interruption of it rather than 
the silent recall was included for other reasons: 
(a) Mandler had employed an aloud recall, and 
(b) the act of vocalization combined with recall, 
it was assumed, would foster the probability of 
successfully interrupting the subject. 

In keeping with Kahneman and Beatty's 
findings, the pupillary dilation during the two 
recall stages was expected to be smaller for the 
familiar than for the unfamiliar sequences. In 
keeping with Mandler’s findings, the pupillary 
dilation following interruption (the post- 
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interruption stage) was expected to be greater 
for the familiar than for the unfamiliar se 
quences, Statistically, this two-sided hype 
thesis was tested as a predicted interaction of 
the levels of familiarity over the recall and post- 
interruption stages. 

Using the terms of Mandler Kahneman 
and Beatty, this prediction can be restated in 
a more general fashion : /ncreases in momentary 
processing load during execution of an organized 


response sequence vary inversely wilh increases im 


arousal level following interruption. of tht 
organized response sequence. 
METHOD 
Subjects 
Subjects were 44 female nursing students from tw 
training centers in Cincinnati, Ohio. All subjects were 
between the ages of 18 and 22. Representing about hall 
of the students tested, they satisfied the criterion that 
the cessation of the metronome clicks result either im 
partial or complete stopping of their aloud recall; the 
remainder of those tested neither hesitated nor stopped 


in their aloud recall. 


Procedure 


Subjects were initially randomly assigned to one of 
the four sequences of recall stimuli. Differential inci 
dence of effective interruption among the sequences 
led to attempts to equalize the JV of first one sequence, 
then others. As a result, only the first half of the sue 
cessfully interrupted subjects can be said to have been 


randomly assigned. The final N for each sequence | 


was I1. 

Each sequence, including two practice sequences, was 
presented in the same format. Each was conducted in 
time with background metronome clicks sounding ata 
one-per-second rate. Following a few clicks, the subject 


would hear the experimenter’s recorded voice sa, | 


“Keep your eye on the mark," followed by a three-click 
pause. Then the experimenter said, “Ready,” followed 
by another three clicks. Then the experimenter recited 
the recall stimuli, saying one per click. The task of the 
subject was to wait for three clicks, then recall the 
stimuli silently to herself, one per click, wait for three 
more clicks, and then recall the stimuli aloud, one per 
click. Interruption was the cessation of the clicks follow- 
ing the recall of the second of the spoken stimuli. n 
was achieved by the experimenter's shutting off t 
recorded signal for approximately 4 seconds following 
the moment the subject recalled the second stimulus- 
Early or late recall by 1 or 2 seconds was acceptable, 
and the interruption was advanced or retard 
accordingly. 

The two practice sequences had the same recall 
stimuli. The experimenter simply counted, “One, tW 
three, four,” in time to the clicks. 


crements 


(in cm.) 


Magnified Pupillary Dilation 


Post-later 
Fic. 1. 


The four experimental sequences varied on two di- 
mensions: symbolization and familiarity. Symbolization 
of the sequences was either in words or digits. Famili- 
arity of the sequences was varied by choosing a familiar 
Sequence of words or digits and jumbling the words or 
digits for the unfamiliar order. The specific recall 
Stimuli employed were: familiar digits—1, 2, 3, 4, 5, 6, 
unfamiliar digits—5, 2, 3, 6, 7, 4, 1; familiar words— 

and Jill went up the hill; unfamiliar words—up 
and Jill the hill went Jack. 


Measurement 


d 
Pupillary changes and subjects’ verbal responses 
Were recorded by means of a television camera with a 
1 telephoto lens, a microphone, and a videotape recorder. 
1 Measurement of the pupil was made with a millimeter 
[ e on the face of a television monitor. 
The measures of pupillary response for the silent 
ind aloud recalls were determined with reference to a 
435€ period consisting of the 2-second period before the 
4 "Ready" signal was spoken. Measures during the base 
Period and the silent and aloud recall periods were made 
at the 1-second-interval points when the metronome 
Clicks could be heard on the recording. The base-period 
Measure was the mean of the two 1-second-interval 
Points described, 
he silent recall measure was the mean of the first 
eur 1-second-interval points minus the mean of the 
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Stages 


fean pupillary dilation increments at each stage for subjects who stopped when interrupted, 


base period. Although the experimental recall stimuli 
consisted of seven words or digits, taking 7 seconds to 
recall, only the initial 4 seconds of the silent recall 
period entered into the calculation of the mean for that 
period. The purpose of this limitation was to avoid the 
attenuating effects at the conclusion of a recall period 
when the pupil tends to return toward its initial level 
(Kahneman & Beatty, 1966). 

The aloud recall measure was the mean of the first 
three 1-second-interval points minus the mean of the 
base period. Since the interruption was the absence of a 
metronome click at the outset of the third second, 
there was no audio marker for the third 1-second- 
interval point. Consequently, the tape was hand 
advanced to a point approximating a 1-second interval 
or slightly less, This point served not only as the last of 
the aloud recall measurements but also as the base 
measure for the postinterruption dilation measures. 

The postinterruption measure was the maximum 
pupillary dilation during the 3 seconds following inter- 
ruption minus the base measure at interruption. 


RESULTS 


Figure 1 presents the mean pupillary dilation 
increments for the familiar and unfamiliar digit 
and word sequences at each stage. The means 
for the digit sequences are presented in the 
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TABLE 1 

E AwALvsiS OF Variance ror Sunjecrs WHo 
, Srorren Wien IxTEkRRUPTED 
Source 4f MS F 
I - | 
| Familiarity (A) 1| 2856 | 507* 
- Symbolization (B) | 1 0938 1.67 

AXB 1 2856 5.07" 
Ss within groups 40 | 0563 

Stages (C) 2 | 461 1.79 
L AXC | 2 | .2598 | 10.11** 
j BXC 2 | 0748 | 2.91 

AXBXC | 2 | 0940 | 3.66* 
4 C X Ss within groups | 80 | .0257 | 

i 


left panel, and the means for the word se- 
quences are presented in the right panel. 
Within each panel, the mean for each familiar 
. . and unfamiliar sequence is plotted, from left 
to right, for each of the three experimental 
stages. 
The left panel of Figure 1, illustrating the 
means for the digit sequences, shows the pre- 
= dicted interaction pattern. The means for the 
two recall stages were smaller for the familiar 
than for the unfamiliar digits. Conversely, the 
means for the postinterruption stage were 
... greater for the familiar than for the unfamiliar 
— digits, 
E With respect to the word sequences, shown 
in the right panel of Figure 1, the interaction 
pattern differed both in shape and magnitude 
- from the pattern for digits. As expected, the 
means for the silent recall stage were smaller 
for the familiar than for the unfamiliar words, 
but, unexpectedly, the means for the aloud 
recall stage were slightly greater for the familiar 
than for the unfamiliar words. The means for 
the postinterruption stage followed the ex- 
pected pattern, namely, greater for the familiar 
than for the unfamiliar words. 

The data were analyzed with an analysis of 
variance (see Table 1). The statistic of pri- 
.. mary interest in the analysis of variance was 
the Familiarity X Stages interaction term. 

This interaction was highly significant (F 
= 10.11, p < .001). The analysis of variance 
also revealed other significant effects: the main 
.. effect due to familiarity (F = 5.07, p < .05), 
the Familiarity X Symbolization interaction 


(F = 5.07, p < .05), and t 
Symbolization X Stages inte: 
p < .05), 

The significant Familiarit 
action indicates that the effect 
familiarity differed in the var 
finding is necessary but not s 
firmation of the experimental } 
the interaction could have be: 
to a differing familiarity effect 
stages. However, planned com; 
the means of the two recall s 
significantly different (F = 1 
while the mean of the postint« 
was quite significantly different í 
the recall stages (F = 18.62, p 

The digit sequences showed 
effects both with greater consisten: 
er magnitude than did the word 
accounting for the unpredict: 
triple interaction, the Familiarity 
zation X Stages interaction (/ 
X .05), and, more specifically, 
stages versus postinterruption-sta 
ent of the Familiarity Symbol 
Stages interaction (F = 7.30, p < .025). 

The difference in the effect of the digit ant 
word sequences also accounted for two of tht 
other unpredicted significant effects: t 
Familiarity  Symbolization interaction (l 
= 5.07, p < .05) and the recall-stages vers 
postinterruption-stage component of the Sym 
bolization X Stages interaction (F = 5, 0, 
p < .025). ) 

There were 44 subjects on whom measure 
ments were made who failed to stop the 
aloud recall when the metronome <5 
stopped. These 44 subjects were distribu ec 
unequally among the four treatments, and K 
statistical analysis was made of these meas re- 
ments. Figure 2 illustrates the mean dilation 
increment associated with each treatment K 
each stage. Comparison of the interacti 
patterns of Figure 2 (nonstoppers) with tho 
of Figure 1 (stoppers) reveals their similari! 
particularly for the unfamiliar digit 
familiar word sequences. 


DISCUSSION 


The present study provided support for the 
findings of Mandler and the findings of Kahne- 
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atty. However, this study’s chief 

ewly found relationship: increases 

ry processing load during execu- 

organized response sequence vary 

ith increases in arousal level fol- 
erruption of the organized response 
s his finding broadens Mandler's 
interruption theory, providing a more complete 
description of the physiological correlates of an 
organized response sequence. 

Neither Kahneman and Beatty’s nor Mand- 
ler's hypothes expressly anticipate this 
finding. However, Mandler (1964) discussed an 
idea which appeared in an earlier report 
(Kessen & Mandler, 1961) which can be use- 
fully applied to this finding. Kessen and 
Mandler suggested that organized responses 
function as inhibitors of arousal or anxiety. 
Mandler (1964, p. 175) added that in contrast, 
interruptions function as disinhibitors of 
arousal. 

These proposals have an obvious limitation, 
however, While under some circumstances an 
Organized response sequence probably inhibits 


Sequence. 


PUPILLARY DILATION 


Post—Interruption 
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can pupillary dilation increments at each stage for subjects not stopped when interrupted. 


arousal, the execution of an organized response 
sequence in this study was always accompanied 
by some disinhibition of arousal. Hence, it may 
be preferable to conceive both an organized 
response sequence and interruptions as princi- 
pally disinhibitors. One desirable consequence 
of this conception is that it reduces the present 
findings to the operation of a single mechanism, 
Correspondingly, the hypothesis of this study 
can be restated as follows: disinhibition of 
arousal during execution of an organized 
response sequence varies inversely with dis- 
inhibition of arousal following interruption of 
the organized response sequence. 

This hypothesis is not only simpler than the 
initial one, it also better fits Mandler's larger 
program. Mandler (1964) regards his findings 
on interruption as the first step in establishing 
the organized response sequence as the basic 
unit of behavior in an "energy-less and ex- 
pectancy-deprived psychology [p. 164].” Re- 
garding the arousal associated with an or- 
ganized response sequence as disinhibition 
this program by avoiding such 


supports 


“energy”-linked concepts as goal gradients 
and drive strength. 

This program has still wider aims. Mandler 
has reported findings to support anxiety and 
.. even complex social phenomena as examples of 
___ the effects of interruption of organized response 
sequences (Mandler & Watson, 1966). 
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F-PRESENTATION, EMBARRASSMENT, AND FACEWORK AS 


VALUATION, CONDITIONS 


OF SELF-PRESENTATION, AND FEEDBACK 
FROM OTHERS 
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Subjects who had failed or succeeded on an anagrams test (bigh or low selí- 
evaluation) described themselves to another subject when they either did or 
did not expect to meet her and when she would or would not know the sub- 
ject's name (public versus private). Subjects then received positive or negative 
feedback. There were no between-condition differences in the direction of sub- 
jects’ self-presentations, but those low in initial self-esteem (measured inde- 
pendently 2 weeks earlier) presented themselves more negatively than those 
high in it when they succeeded, and most negatively when they had failed 
privately. When they expected to meet the other, however, low-self-esteem 
subjects who failed were as positive as high-self-esteem subjects, in line with 
results of a previous study. Low-self-evaluation subjects seemed to have 
shielded themselves against embarrassment by decreasing their involvement 
in the interaction, but generally subjects were more embarrassed by negative 
than by positive feedback. Low-self-esteem subjects were most embarrassed 
when they failed publicly and received negative feedback, The positivity or 
negativity of subjects’ self-presentations was not related to subsequent embar- 
rassment. Facework generally varied with differences in embarrassment, but 
under public conditions, subjects discredited and derogated others who evaluated 
them in a manner inconsistent with their self-esteem. The results generally 
support an approval rather than a consistency theory of social interaction. 


Investigations of the processes linking selí- 
erception and self-evaluation to self-presen- 
tion and reactions to feedback from others 
lave generally relied on one of two explana- 
concepts: cognitive “consistency” or so- 
dal “approval.” The present study was 
undertaken to compare the ability of these 
explanations to account for variations in self- 
‘Presentation, embarrassment, and facework. 

_ The main assumption underlying the ap- 
- proval interpretation is that people desire 
- approval from others. According to this view, 
all persons, regardless of their level of self- 
esteem, will present a positive self-picture to 
others to solicit their approval. However, we 
‘Suggest that since those who present them- 
es may be asked to defend their claim 
approval should they ever confront the 
| Tecipient of their presentation, the anticipa- 
lion of future interaction may be crucial. 


‘The authors would like to thank Ed Krupat and 
Yan Ayers for help with the computer analyses. 

ests for reprints should be sent to Ronald 
Cohen, who is now at the Department of Psychol- 
Bennington College, Bennington, Vermont 05201. 


Thus, in a “private” situation in which future 
interaction. is not anticipated, one might 
expect people low in self-esteem, having more 
to gain by others’ approval, to risk im- 
modesty to gain it by presenting themselves 
more positively than those high in self- 
esteem. On the other hand, in a “public” 
situation in which future interaction is antici- 
pated, people low in self-esteem might worry 
about not being able to maintain their claim 
to others’ approval, and consequently present 
themselves less positively than those high in 
self-esteem who would feel confident about 
substantiating their claim to approval. This 
reasoning is consistent with Goffman's (1961) 
characterization of staking and maintaining 
claims to interaction outcomes as a major 
dilemma for participants. 

Those who employ an approval interpreta- 
tion expect embarassment to follow disap- 
proval from others (see Modigliani, 1966; 
Sattler, 1965, for summaries of this litera- 
ture). In Goffman’s terms, “face” has been 
“Jost.” Persons low in self-esteem, being more 
self-conscious and prone to expect disapproval 
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from others, should be more susceptible to 
embarrassment than persons high in self- 
esteem (Modigliani, 1966, 1968). 

Finally, the approval interpretation would 
make certain predictions concerning “face- 
work," the term Goffman (1955) used to 
describe the process of cognitive manipula- 
tion to avoid or correct a loss of face and 
the embarrassment it produces. Any or all 
of the various facework strategies (remaining 
aloof from the interaction, denying an un- 
usual occurrence, denying responsibility for 
the outcome, discrediting the reactions of 
others, or engineering a loss of face for them) 
would be employed to the extent that embar- 
rassment had been produced; that is, an 
increase in embarrassment would lead to an 
increase in facework. Thus, since it assumes 
that the face to be regained is a positive one, 
the approval conception leads one to expect 
that the person with low self-esteem who 
experiences disapproval, having farther to 
g0 to regain a positive face, will have to do 
more facework. 

The consistency explanation assumes that 
the most important characteristic of rela- 
tionships among self-related (as well as 
other) attitudes is their psychological com- 
patibility. Thus, this explanation asserts that 
a person acts to assure reactions from others 
which validate his self-conception so that 
he can avoid or reduce cognitive discrepancies 
(e.g., Secord & Backman, 1961). From such 
a view it follows that the lower (more nega- 
tive) a person’s level of self-esteem, the 
more negative is his self-presentation. Since 
a self-presentation inconsistent with one’s 
level of self-esteem is assumed to produce 
internal psychological tension and the pres- 
sure to reduce it, the presence or absence 
of expectations of future interaction with 
the person to whom a self-presentation is 
made should not affect the nature of the 
presentation, 

With regard to embarrassment, the con- 
sistency interpretation would argue that the 
amount of embarrassment a person feels after 
presenting himself varies inversely with the 
amount of similarity between his level of 
self-esteem and the reactions of others. Thus, 
someone low in self-esteem would be most 
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embarrassed by others’ app: and least 
embarrassed by their dis il, while 
someone high in self-esteem be most 
embarrassed by others’ disap; and least 
embarrassed by their appro: veutsch & 
Solomon, 1959; Heider, 195 re again, 
as long as the person kno the in- 
consistency, neither anticipa f future 
interaction nor public anno: t of the 
others’ reaction is necessa produce 
embarrassment, 


Finally, like the approva! roach, the 
consistency approach predicts increased face- 
work with increased embarrassment, but the 
embarrassment is predicted hers to follow 


inconsistent rather than disapproving evalu- 
ations. Highly embarrassed p s, either 
those low in self-esteem recei ocial ap- 
proval, or those high in self-este > receiving 
disapproval, should en; in more facework 
than persons embarra to a ie degree, 
either those low in self-esteem receiving 
disapproval, or those high sclf-esteem 
receiving approval. 

Previous results concerning presenta- 
tion, embarrassment, and facew give no 


clear indication of the relative lidity of 
the consistency and approval anations. 
Studies of self-presentation have foliowed one 
of two research traditions. In one, the sub- 
jects are typically asked to rate how often 
they have disclosed specific items of informa- 


tion about themselves, including information 
about their personalities, In three of these 
studies in which self-esteem was measured, 
the investigators assessed only the amount, 
not the positivity, of the information dis- 
closed (Fitzgerald, 1963; Mullaney, 1964; 
Shapiro, 1968). The second tradition, much 
of which is summarized in Jones (1964), has 
been concerned with the positivity of self- 
presentation, but self-evaluation and self- 
esteem have not been employed as inde- 
pendent variables. In fact, the typical 
procedure is to measure initial self-esteem 
and then discard the subjects with extreme 
scores, presumably to increase the effects of 
situationable variables. 

A recent study by Schneider (1969) pro- 
vides the single exception to this trend. 
Subjects first took a “social sensitivity” test 
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y succeeded, failed, or received 
and then were told that an 
uld attempt to assess them on 
erent aspect of social sensitivity. 
to receive his evaluation of 
f did not. As predicted, failure 
more positive in their self- 
han success subjects when feed- 
pected, while the reverse was 
iback was not expected. Within 
re-feedback condition, subjects low 
ı presented themselves more posi- 
ubjects high in self-esteem. Thus, 
ted independent effects for selj- 
essed as a relatively stable 
characteristic) and situational 
(induced by task success or 
ese results seem to support an 
'erpretation, but it is not clear 
ects anticipating feedback also an- 
future interaction. The present 
S taken without knowledge of the 
Schneider study, which at the time was an 
Unpublished doctoral dissertation, provides a 
lst of the argument by manipulating the 
Anticipation of future interaction. 
idence for the hypotheses concerning 
nent is currently inconclusive. 
Horowitz (1962) found that the number of 
embarrassing incidents remembered was 
po sitively correlated with self-esteem, and 
ince the items seemed to presume disap- 
oval, the data provide some support for 
e consistency interpretation, However, the 
z of selective recall weakens that 
Support. Modigliani (1966, 1968) has done 
the only work which investigates the effects 
Of manipulated self-evaluation and feedback 
from others on embarrassment, but since no 
Subjects received inconsistent reactions from 
; thers, the consistency interpretation cannot 
be tested. He did find that initial self-esteem 
Was negatively correlated with experimentally 
Produced embarrassment, but only one of 
four Correlations was significant. 
Finally, evidence concerning the relative 
ility of the two interpretations to account 
pr variations in facework is inconclusive. In 
ne with consistency predicts, Deutsch and 
lomon (1959) found that females (but not 
Males) who failed an anagrams test pre- 
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ferred others who evaluated them negatively 
to those who evaluated them positively. 
Weist (1965) obtained similar results for 
self-esteem, as did Harvey and Clapp (1965), 
who found that subjects low in self-esteem 
and expecting low ratings from others re- 
acted more negatively to unexpectedly un- 
favorable ratings from others, but also re- 
acted less positively to unexpected favorable 
ratings. Jones (1966) failed to replicate these 
results, but Jones and Schneider (1968) 
did replicate the results with subjects who 
were certain of their self-evaluations. On the 
other hand, and more in keeping with ap- 
proval predictions, Dickoff (1961) found 
that regardless of their initial self-evaluations, 


the subjects were simply more positive towar » 


others who were more positive toward them. k 


Finally, Walster (1965) found that subjects 
initially low in self-esteem were more positive 
toward a more positive other. 

In the present study, self-presentation, em- 
barrassment, and facework were investigated 
in a design where subjects differing in self- 
evaluation presented themselves to others 
under conditions differing in anonymity and 
the anticipation of future interaction. Sub- 
jects then actually received approval or dis- 
approval, allegedly from those to whom they 
had presented themselves. An approval inter- 
pretation leads to the prediction that subjects 
will present themselves favorably to others, 
and that subjects low in self-evaluation will 
present themselves more favorably than those 
high in self-evaluation. When subjects antici- 
pate having to interact further with others, 
however, low-self-evaluation subjects will pre- 
sent themselves more negatively than those 
high in self-evaluation. Low-self-evaluation 
subjects should be more embarrassed by dis- 
approval, embarrassment should be greater 
when anonymity and brevity are lacking, and 
low-self-evaluation subjects should engage in 
more facework. On the other hand, a consist- 
ency interpretation would predict differences 
in the direction of self-presentation to follow 
differences in self-evaluation, such that low- 
self-evaluation subjects would present them- 
selves less favorably than high-self-evaluation 
subjects, with the anonymity and brevity of 
the interaction making little difference. Low- 
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self-evaluation subjects would be more embar- 
rassed by approval and less embarrassed by 
disapproval than high-self-evaluation subjects, 
and differences in the use of facework should 
follow these differences in embarrassment. 


METHOD 
Subjects 


Ninety-six female undergraduates at the Univer- 
sity of Michigan served as subjects as part of the 
requirements of their introductory psychology 
course. Subjects participated in groups of four, five, 
or six, but were separated in individual rooms. 


Design 


The experimental design was a 2 X 2 X 2 factorial, 
the independent variables being self-evaluation (high 
or low), public versus private conditions of self- 
presentation (identity disclosed and future inter- 
action anticipated versus identity not disclosed and 
no future interaction anticipated), and nature of 
feedback from others (positive versus negative). 
Experimental rooms were assigned condition num- 
bers, the order in which conditions were run being 
random. The condition in which individual subjects 
participated was determined by which room they 
entered. The illusion that choice of room was im- 
material was necessary to avoid suspicion that the 
rooms contained different materials. 


Procedure 


Several weeks before participating in the experi- 
ment all subjects were notified by mail and asked 
to send in background information. Items relevant 
to the experiment were two Measures of self-esteem, 
both short to permit reassessment during the experi- 
ment. These two measures differed in format, how- 
ever, one being of the semantic differential type 
(Aiken, 1965), while the other simply asked the 
subjects how they felt about themselves in general 
(Rosenberg, 1965). 

When appointments for the experiment had been 
made, it was stressed that the experiment concerned 
impression formation. After the subjects had ar- 
rived for the experimental session and had been 
seated in separate rooms, however, they were halt- 
ingly told that while the experiment did indeed 
have to do with impression formation, the experi- 
menter first had an urgent request to make of 
them. The professor for whom he worked was 
conducting another study comparing the “cognitive 
abilities” of students in different colleges and uni- 
Versities across the nation, and would like them to 
take a test measuring “cognitive flexibility,” an 
ability highly related to general intelligence. While 
they could not get extra credit for taking the test, 
the sponsor had agreed to give the subjects some 

idea of how they had done by reporting to them 
both their own scores and the average score ob- 
tained by other students who had already taken the 
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test. In spite of occasional protestations, all 
agreed to take the test, which was adı 
Modigliani’s anagram task. Differential self-ey, 
tion was induced by (a) giving the subjects im 
high-self-evaluation condition easy 
grams and the subjects in the low-self-evaluati 
condition relatively difficult ones, and (5) giving th 
former subjects norms constructed to show the 
age score to be 2 or 3 points below their 
and the latter subjects norms 2 and 3 points 


relatively 


had done compared to the national average, 
satisfied they were with their performance, and h 
important the ability in question was, all on 
millimeter graphic scales. (The self-evaluation m 
nipulation was separated from the presentation ta 
so that the subjects would not feel obligated to 1 
it as a basis for their self-presentations.) 
The experimenter then apologized for the 
ruption and announced that the subjects 
finally undertake the task for which they origi 
had come, Before forming impressions of oi 
they should first be clear about their views of thet 
selves. Subjects then filled out a parallel form 
the Aiken self-esteem measure and the Rosem 
measure with items now in graphic form. 
measures, a reassessment of the general selí-e 
measures, mailed to subjects 2 we 
Served as a check on the effects oí the 
manipulation on self-esteem and as a source 
information about oneself to be presented to oth 
For the presentation task, the subjects were 
to place six adjectives from a list of those used 
the Aiken measure and four others of their 0 
choosing into a paragraph containing blanks. for. 
adjectives and their modifiers, By specifying 
number of adjectives to be used, we were able 
control the amount of information that the si 
jects disclosed. By restricting the modifiers to “ve 
“fairly,” and "only somewhat,’ we were able 
measure the positivity-negativity of the subje 
self-presentations. Instructions on the parag 
sheet either stressed (public) or omitted (privi 
the fact that the subjects would meet the other 
whom they were presenting themselves; and 
subjects in the public condition signed their 
to the paragraphs. Later in the experiment, à 
subjects indicated on graphic scales the likeliho 
of their meeting the other and oí her knowing 
subject's name. 
All subjects then received a description, alleg 
a self-description from another subject in 
session, constructed to be relatively neutral (a 
balance of positive and negative characteristics) 
Subjects gave their impressions of this other 
filling in a short paragraph similar to that. 
in their selí-descriptions, and then received a sin 
evaluation of themselves, allegedly from the pé 
who received their self-description. In the post 
feedback condition, the other's impression was 
positive; in the negative-feedback condition, it 
fairly negative. In each case, one of the adject 


ws either “bright” or “dull,” respectively, so that 
Be feedback would have some relationship to the 
gi-evaluation manipulation, 

Subjects’ reactions to the feedback and the pre- 
gious manipulations were recorded in two groups of 
andomly ordered graphic scales. One group had to 
db with their feelings about themselves. Two items 


(iked-disliked and approved of-disapproved of) were 
wed as a check on the feedback manipulation. Two 
items dealt specifically with embarrassment (embar- 
mssed~not embarrassed and ashamed-unashamed), 
while four others served with these two as a general 
index of (confident-shaken, tense-relaxed, 
at ease-self-conscious, and awkward-poised). The 
we of role distance as a facework strategy was 
measured by the adjectives indifferent-involved; the 
denial of responsibility for outcomes by unlucky- 
lucky. The second group of scales recorded their 
feelings about the persons who had described them. 
The discrediting of others’ impressions was tapped 
by careless-conscientious, joking-serious, superficial- 
profound, and not insightful-insightful, and. general 
derogation by unlikable-likable, cold-warm, untrust- 
Worthy-trustworthy, unimportant-important, weak- 
strong, unfair-íair, 


tension 


H RESULTS 
Manipulation of Self-Evaluation 


An analysis of variance of change scores 
on the two self-esteem measures indicated no 
significant differences between the positive 
and negative conditions, a result which is not 
Surprising, given the relative specificity of the 
self-evaluation manipulation task. What is 
surprising is the subjects’ unwillingness to 
admit satisfaction or dissatisfaction with their 
Performance on the anagram test (F = 1.58, 
dj = 1/34, ns), since they readily admitted 
the level of their performance relative to the 
| Mational average (F = 5.30, df=1/34, p< 
05). This may have been a function of the 
Subjects having been female (Schneider’s and 
Modigliani’s were male); they did rate the 
ability in question as being very unimpor- 
tant.? However, one can at least assume at 
this point that the manipulation affected the 
Subjects’ perceptions of having succeeded or 


"That performance on the anagram test was not 
l Sucia] for the general self-esteem of our female 
subjects was not an oversight on our part. We 

Covered that it did not change general self- 
‘steem in pretesting, but when we changed to norms 
on dating behavior, the manipulation was so threat- 
tning that we reverted to the anagram test and 


ped to settle for differences in situational self- 
evaluation, 
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failed. Whether their refusal to evaluate 
themselves accordingly represents a defensive 
reaction can only be determined in light of 
subsequent results. 


Public versus Private Manipulation 


Subjects in the public condition were 
clearly more likely than subjects in the pri- 
vate condition to think that they were likely 
to meet the person to whom they had pre- 
sented themselves (F = 23.48, df = 1/88, 
p<.01), and that this person would be 
likely to know their name (F = 15.06, 
df = 1/88, p < .01). 


Feedback Manipulation 


There were highly significant differences in 
the subjects’ feelings of being liked and ap- 
proved of between positive- and negative- 
feedback conditions (F = 143.72, df = 1/88, 
p< .01), indicating that this manipulation 
was also successful. 


Selj-Presentation 


One could justifiably argue that filling out 
a self-esteem questionnaire was as much a 
self-presentation for subjects in the present 
experiment as describing themselves to other 
subjects. For this reason, it was necessary 
to introduce supplementary evidence that 
self-presentation was not simply being 
correlated with itself. Evidence that the 
experimental manipulations had changed the 
subjects self-esteem would have been appro- 
priate, but such an effect was not obtained. 
A second method was to partial out self- 
esteem scores by analyses of covariance. This 
procedure was used here, all analyses reported 
having controlled for scores on both the 
Aiken and the Rosenberg measures. Since 
there were no differences in mean self-esteem 
scores or standard deviations between experi- 
mental conditions, interactions between selí- 
esteem and the experimental conditions can 
be investigated by the size and direction of 
intracell correlations. 

Mean self-presentation scores are given in 
Table 1. The score labeled “specific” is 
simply based on the subjects! use of adjectives 
related to intelligence in the paragraphs de- 
scribing themselves to others. (Three such 
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TABLE 1 


MEAN ADJUSTED AND ORIGINAL SELEF-PRESENTATION 
Scores BY CONDITION 


TABLE 


CORRELATIONS BETWEEN SrLr-I 


AND Optoma 
SELrF-PRESENTATION S 


|- Specie | General 


| Specific General 
Self-evaluation | —  — A — Self-evaluation | —— — = 
| Public | Private | Public | Private | Public | Priva Private 
High (success) | 103 | 100 | 128 | 161 High (succes) 24 04 | 4ó* | à$ 
(5.3) (5.3) | (25.9) | (27.8) 13 (t 38 AT* 
| 
Low (failure) 99 9.7 14.1 12.5 Low (failure) 34 X 4 om 
| (53) | (8| (26.0) | (28.1) —.22 H 2 sa" 
L Lum 4s tr emis k — 
Note.—The higher the mean, the more negative tlie score, . Note.—AÀ positive correlation indica he lower a sab- 
Original scores are in parentheses. ject's self-esteem, the more negat ntation. In thi 
and subsequent tables, correlations wit n measure are 
given first, then those for the Rosenberg 
*p5«.05. 
grat i 1. A ] ** p <0. 
adjectives were included in the list of adjec- 
tives from the Aiken self-esteem measure.) r z . ' 
An original score of 12 would indicate that tion, that failure subjects woud be more 
no such adjectives were mentioned in the cautious in the public condition, was clearly 
subject's description; a lower score would not obtained. Other eviden« irthermore, 
indicate that she presented herself positively. lends some support to Sche-- -r’s (1969) 
As can be seen in Table 1, the subjects in hypothesis. Correlations between self-esteem 
all conditions did present themselves posi- and self-presentation are given in Table 3. 
tively on this dimension. There was a very While there were no differen ; the spe 
slight tendency for subjects who failed to cific measure, on the general seli-presentation 
present themselves more positively than sub- measure, the subjects low in self-esteem pre 
jects who succeeded, but the analysis of sented themselves more negatively tham the 
variance summarized in Table 2 indicates that subjects high in self-esteem in al! conditions 
this effect is anything but reliable. The except the low-self-evaluation public. In 
"general" score is that for the entire self- other words, when they succeeded, low-self- 


presentation paragraph. Since a balance be- 
tween positive and negative adjectives would 
be represented by a score of 35, it is evident 
from Table 1 that the subjects in all condi- 
tions presented themselves positively, and the 
analysis of variance summarized in Table 2 
also indicates no significant differences what- 
soever between conditions, 

The hypothesized interaction between selí- 
evaluation and the public-private manipula- 


TABLE 2 


ANALYSIS OF VARIANCE Or ADJUSTED 
SELF-PRESENTATION SCORES 


Specific General 
Source df p d 
MS F MS F 
Self-evaluation (A) 1 | 7.45 | 1.26 | 2597 <1 
Public vs, private (B) 1 | 7.16 | 1.21 | 18.50 <1 
AXB 1 | 6.42 | 1.09 | 29.75 | 21 
Within groups 44 | 5.90 40.51 


esteem subjects were more modest than high- 
self-esteem subjects. However, while they 
were most self-deprecating when they failed 
and did not expect to interact further with 
others in the experiment (private condition), 
they were as positive about themselves as 
high-self-esteem subjects when they had failed 
and expected further interaction (public con- 


TABLE 4 


MEAN ADJUSTED RATINGS OF EMBARRASSMENT 
BY CONDITION 


Positive feedback | Negative feedback 


Self-evaluation |—— cian —— SEC 
Public | Private | Public | Private 

High (success) 

Specific 302 | 290 | 300 2 

General 191.8 195.0 288.1 5 
Low (failure) 

Specific 248 | 196 | 444 E 

General 307.2 à 


159.4 | 1384 


mente 


ss 
Note,—The higher the mean, the greater the embarra! 


TABLE 5 


sis OP VARIANCE or ApjusTED 
EMBARRASSMENT RATINGS 


_— 
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TABLE 7 
Mean Apyustep Ratinos or Rote Distance 
AND ATTRIBUTION TO LUCK 


| Specife General 
| af | 
|' [cs T] ce UR 
LI — PE HEC 
u v) | 1| 21622) <t | 2539200| 1.94 
^ m) 1| 626.70) 1.23 | 22114.00| 1.69 
k 1 | 3191.25 | 6.24% | 235566.00 | 18.00** 
1 | 1299.90 2.84 15515.00 | 1.19 
1|167546| 3.28 | 3871.00) <1 
1| 9714) «1| 1113100) <1 
1| 5213) <1 | 4421.00) <1 
13086.04 | 


| 88 | 1510,89 | 
| 


hus, the effect found by Schneider 
blic condition was obtained here 
xœ private, and not the public, 


ment 


atings of embarrassment as mea- 

by embarrassment-specific and general 
‘oms are presented in Table 4. There 
en that failure under public condi- 

f self-presentation and negative feed- 
roduced the most embarrassment, but 
lysis of variance indicates that there 
as ‘nificant three-way interaction ef- 
. As indicated in the summary in Table 5, 
ere was a main effect for the feedback ma- 
nipulation on the specific items (F = 6.24, 
df = 1/86, p < .05), a main effect for feed- 


TABLE 6 


CORRELATIONS BETWEEN SELF-ESTEEM AND ORIGINAL 
e EMBARRASSMENT SCORES 


Positive feedback | Negative feedback 
Self-evaluation 
M Public |Private| Public | Private 
High (success) 
- Specific —16 | .46 | .10 | .60* 
7 —.14 32 -03 :29 
General "ag ee este pU ada tae 
" 17 .39 —.05 19 
Low (failure) 
Specific gat* | 53 | —.25 | 42 
62* 36 —.04 AS 
General 73% | 36 | —25 | 36 
68* 37 00 | .46 


Note.—A positive correlation indicates that the lower a sub- 
lets self-esteem, the higher her embarrassment. 


I 
Positive feedback | Negative feedback 


Self-evaluation — 
| Public | Private | Public | Private 
High (success) | | 
Role distance 463 | 376 | sit | 41.8 
Attribution 373 | 388 | $2.2 | 30.0 
Low (failure) | 
Role distance 44.5 674 | 891 64.1 
Attribution | 37.9 | 373 | 619 | 51.3 
Note.—The higher the mean, the REES ade distance 


and attribution, 


back on the general index (F = 18.00, 
dj = 1/86, p < 01), and a weak interaction 
between feedback and self-evaluation (F = 
3.28, df= 1/86, p<.10). Subjects were 
more embarrassed by negative public condi- 
tions in the first instance, and by negative 
feedback in the second. 

Product-moment correlations between selí- 
esteem and embarrassment under each of the 
experimental conditions are given in Table 6. 
They are remarkably large and consistent, 
given the extremely small ns (11 or 12). The 
pattern conforms to neither of the hypotheses 
formulated earlier, however, although it seems 
to incorporate some elements of both. In line 
with consistency notions, the lower the sub- 
jects’ self-esteem, the more embarrassed they 
are by positive feedback when they have 
failed, but this is true only in the public 
condition. The greater embarrassment of 
low-self-esteem subjects who have succeeded 
and received negative feedback is contrary to 


TABLE 8 


ANALYSIS OF VARIANCE OF ApjusTeD ROLE DISTANCE 
AND ATTRIBUTION Scores 


Role distance Attribution 


Source df 
MS F MS F 

Self-evaluation (A) 1|5100.10/6,90**| 162.90| <1 
Public vs. private (B) 1| 14135| <1 209.23} <1 
Feedback (C) 1| 611.73) «1 | 6020.11 | 25.11** 
AXB 1| 3044.16 | 4.12* 162.15| <1 
AXE 1 17.30) <1 205.24 <1 
BXC 1} 50514| <1 273.96 | 1.14 
AXBXC 1| 432.09) <1 155.43 <i 

Within groups 88 | 738.84 239.76 

*p <.05, 
Dy < 01 
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TABLE 9 


MEAN Apvjustep DISCREDITING AND 
DEROGATION Scores 


Positive feedback | Negative feedback 
Self-evaluation — 


Private | Public | Private 


| Public 
High (success) 
Discrediting 188.9 | 165.6 | 222.6 | 206.6 
Derogation 237.9 | 211.6 | 3248 | 323.9 
Low (failure) 
Discrediting 1544 | 212.8 | 271.5 | 227.6 
Derogation 198.9 | 2480 | 375.3 | 345.8 


Note.— The higher the mean, the greater the discrediting and 
derogation, 


the consistency approach, but its confine- 
ment to the private condition also makes 
it difficult to interpret in terms of the 
approval approach. 


Facework 


Table 7 indicates between-condition differ- 
ences in the use of role distance (uninvolved— 
involved) and the attribution of outcomes to 
chance (unlucky-lucky) as facework strate- 
gies. The analyses of variance are summarized 
in Table 8. There was a significant main effect 
of self-evaluation on role distance: the sub- 
jects who failed claimed to be less involved 
in the interaction than those who succeeded 
(F = 6.90, df= 1/86, p<.01), especially 
when they expected the interaction to con- 
tinue (for the Self-Evaluation X Public- 
Private interaction: F-412, df= 1/86, 
$ < 05). Whereas failure seems to have in- 
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creased the use of role distance as a facework 
strategy, only feedback from others affected 
the attribution of outcomes to chance. Spe- 
cifically, the main effect for feedback in 
Table 8 indicates that subjects who received 
negative feedback were more likely than sub- 
jects who received positive feedback to feel 
unlucky (F = 25.11, df= 1/86, p< 0t). 
Initial self-esteem had no systematic effects 
on either role distance or attribution. 

Means and summaries of analyses of vari- 
ance for discrediting and derogating others 
are provided in Tables 9 and 10, There it can 
be seen that negative feedback from others 
brought more discrediting than positive feed- 
back (F = 14.10, df = 1/86, p < .01). Sup- 
porting an approval interpretation is a 
three-way interaction between self. -evaluation, 
public-private, and feedback, such that least 
discrediting occurs when subjects have failed 
under public conditions and receive positive 
feedback, and most discrediting when sub- 
jects have failed under public conditions 
and receive negative feedback (F = 4.18, 
df = 1/86, p<.05). The same ordering 
occurs with derogation, but here only the 
feedback effect is significant (F = 43.61, 
df = 1/86, p < .01). 1 

A straightforward approval interpretation 
is hampered, however, by the effects of self- 
esteem on discrediting and derogating others, 
the product-moment correlations within ex- 
perimental conditions being given in Table 11. 
These correlations indicate that the lower the 
subjects’ self-esteem, the more they discredit 
and derogate others who evaluate them nega- 


TABLE 10 
ANALYsIs OF VARIANCE or ADJUSTED DISCREDITING AND DEROGATION SCORES 


3 Discrediting Derogation 
Source df 1 

Ms F MS E. 
Self-evaluation (A) 1 s <1 
Public vs. private (B) 1 Aye n ea «1 
Feedback (C) 1 62414.06 14.109 316249.50 43.61 
AXB 1 4205.44 1.01 3433.50 <1 
AXC 1 4856.75 1.17 7594.50 1.11 
B C M 1 13307.75 3.20 5521.50 <i 
AXBXC 1 17383.75 4.18* 14614.50 2.08 

Within groups 88 4162.26 7504.55 


tively. Notice, however, that while this sup- 
ports the consistency approach, correlations 
are only significant in the public conditions. 


Selj-Presentation, Embarrassment, and 
Facewor 


The correlations in Table 12 indicate that 
the positivity or negativity of the subjects’ 
self-presentations had no effect on either em- 
barrassment or facework, as do the correla- 
tions within experimental conditions. Gen- 
erally, the greater the embarrassment that 
the subjects experienced, the more facework 
they engaged in, as would be predicted 
by both the approval and consistency ap- 
proaches. An important exception is role dis- 
tance, however, which varies inversely with 
embarrassment and hardly at all with two of 
the other three facework strategies. 


Discussion 


The failure to obtain significant between- 
condition differences in self-presentation is 
difficult to fathom. Since the subjects ac- 
cepted the differential perceptions of success 
and failure implied by the self-evaluation ma- 
nipulation, one would expect either a replica- 
tion of Schneider’s (1969) results or the hy- 
pothesized interaction between self-evaluation 
and the public-private manipulation. The 
failure to obtain either could be a function of 
“noise” in our self-presentation instrument. 
Subjects seemed to have little trouble using 
it, however, and discussions with them sug- 
Sest that it was a meaningful task. Other 
Possibilities are that success or failure on 
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TABLE 11 


CORRELATIONS BETWEEN SELF-ESTEEM. AND ORIGINAL 
DISCREDITING AND DEROGATION SCORES 


| Positive feedback | Negative feedback 
Self-evaluation — $ ———— 
Public 
High (success) 
Jiscrediting | .72* 
49 
Derogation 75** 
55 
Low (failure) 
Discrediting 59° 
52 
Derogating .67* 
.57* 


a These two miss being significant at the .05 level by .008 
and .005, 


achievement tasks is much less important for 
females than for males, or that differences 
between success and failure subjects were 
obscured by defensiveness on the part of 
those who failed. The near replication of 
Schneider’s results when initial self-esteem is 
taken into account is consistent with both. 
The greater modesty of low-self-esteem sub- 
jects who succeeded is an interesting effect 
which was not investigated in Schneider’s 
study. These subjects, being more “deprived” 
of approval, would have more to lose by 
immodesty. 

The key to an understanding of subjects’ 
reactions to failure would seem to lie with 
the tendency for the subjects who failed to 
establish greater role distance, and with the 
inverse relationship between embarrassment 


TABLE 12 
CORRELATIONS AMONG SELF-PRESENTATION, EMBARRASSMENT, AND FACEWORK STRATEGIES 


Variable 1 | 2 | | 4 | 5 | 6 | 7 
d. Presentation; — 
2. Presentations 04 — 
4 Embarrassment, 01 Fe m 
» “mbarrassment 403 . . em 
- Role distance s —.00 —.09 —.28** —.31** = 
. Attribution .00 —.05 1 40** —.05 — 
» Discrediting 08 -06 : 25% mate e -- 
» Derogation 13 .06 226 A9** 10 A2** wom 


Note.—Subscript 1 refers to specific, Subscript 2, to general. 
4b «.05 


> c 0t. 
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and role distance. Gofiman (1961) clearly 
assumed role distance to be primarily a pre- 
ventive facework strategy; that is, by re- 
maining aloof from the interaction, a person 
shields himself from others’ disapproval and 
thus prevents embarrassment. Although we 
have no way of checking on the time se- 
quence in which the various facework strate- 
gies were used, since actual feedback did not 
affect role distance, it seems reasonable to 
assume that the subjects who failed, and par- 
ticularly those who expected to have to face 
others (see the Self-Evaluation X Public- 
Private interactions in Tables 7 and 8), de- 
fensively detached themselves from the situ- 
ation so as to lessen the embarrassing effects 
of anticipated disapproval. That this strategy 
worked is suggested by the large negative 
correlations between role distance and em- 
barrassment; that is, the more role distance 
that the subjects established, the less embar- 
rassed they were by disapproval when they 
actually received it. 

In general, however, the subjects were more 
embarrassed by disapproval than approval. 
These results were not confined to conditions 
where the subjects were identifiable and an- 
ticipated interaction with their evaluator, but 
they are generally in line with an approval 
interpretation. Such an interpretation is 
strengthened by the fact that the negativity 
of self-presentation was unrelated to subse- 
quent embarrassment (Table 12), and by the 
fact that subjects were more likely to attrib- 
ute outcomes to luck, to discredit the other’s 
evaluation, and to derogate her when they 
received negative than when they received 
positive feedback. 

Self-evaluation had interesting effects on 
embarrassment and facework in the present 
study. Aside from the differences in role dis- 
tance, the only evidence for a strictly situa- 
tional effect is the greater discrediting of 
negative evaluators by low-self-evaluation 
subjects in the public condition (see the 
three-way interaction in Tables 9 and 10). 
However, other interesting results were ob- 
tained when initial self-esteem was taken into 
account (see Procedure section). The tend- 
ency for low-self-esteem subjects to be more 
embarrassed by public failure and positive 
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feedback, and private success and negative 
feedback (Table 6), is compi ed, but still 
interpretable by approval no *. It is rea- 
sonable to assume that thc ‘bjects were 
most likely to expect to a € Or retain 
others’ disapproval in these two conditions, 
Specifically, the subjects in failure/pub- 
lic/positive condition knew t while the 
other had evaluated them posi ely initially, 
she was bound to learn that : subject had 
failed and change the eyaluati Subjects in 
the success/private/negative co: ion, on the 
other hand, knew that they would not meet 
the other who had evaluated them negatively 
and hence would not be able to change the 
evaluation, even though they could have made 


a firm claim for approval beca.:< they had 
succeeded. Since the approval -»proach as- 
sumes that low-self-esteem subj are more 
concerned about others’ disapproval, it should 


follow that low-self-esteem subjects would be 
more embarrassed by these conditions than 
high-self-esteem subjects, 

That low-self-esteem subjects 
likely to discredit and derogate others who 
evaluated them positively, and less likely to 
discredit and derogate others who evaluated 
them negatively (Table 11), is more difficult 
to interpret within an approval framework. 
While these results imply a consistency in- 
terpretation, as do similar results obtained 
by Jones and Pines (1968) with manipulated 
self-esteem alone, the confinement of signifi- 
cant correlations to the public conditions sug- 
Eests that something else was involved as 
well. A plausible interpretation is that the 
subjects desired consistency between their 
self-esteem and others' initial evaluations of 
them so that they could better orient their 
behavior toward getting the other to like 
them when they actually met. In a public 
situation, the subjects derogate others when 
those others do not evaluate the subjects on 
the basis of their performance. There may be 
à perceived norm to reject praise, as well as 
criticism, that is unjustified on the basis of 
previous performance. Conformity to this 
norm may in itself be a source of public 
approval. On the other hand, in the private 
situation, the norm that requires one to reject 
unwarranted praise is less operative because 


vere more 


there is no 
acy, that 
cerning re 


one to enforce it. Thus, consist- 
is, conformity to expectations con- 
ons to evaluations, may be used 


as a tactic for approval (cf. Jones, 1964, 
who points to a similar phenomenon in the 
strategic of conformity as a tactic of 
ingratiation). 

This explanation deserves further attention 
because these particular results leave doubts 


concerning the very question that the present 
study was designed to answer: whether and 
under what conditions people want positive 
evaluations from social interactions, or evalu- 

tent with their self-conceptions. 


ations cor 
Future studies might have the subjects actu- 
ally meet their evaluators. If the present 
explanation is correct, previous inconsistency 
should be debilitating, but subsequent ap- 
proval from the evaluator should change the 
subjects’ evaluations of him from negative 
to positive. 
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one’s break-even point were optional and could be lies. The design wasa3 X 2 x 2 


by the accomplice—$30, $40, or $50; 


stated her break-even point—1 in 10 offers or 3 in 10 offers; (c) bargaining order > | 
the subject went first or last on all 20 rounds. The more extreme the accomplice's 
statement, the more favorable to the accomplice the subject's last offer, the more ] 
often the subject agreed to the accomplice's demands, and the fewer the number 

of offers in the bargaining. The frequency of statements had no effect, In those 
sessions lasting the full 20 rounds, the subject yielded more often when going last 


rather than first. 


The research on bargaining strategy has 
seldom focused on the statements that bar- 
gainers make to each other about their con- 
cession limits (i.e, the least favorable offer 
they can accept). Ikle and Leites (1962) 
maintained that such statements are an im- 
portant part of the process of political nego- 
tiation. They suggested that it would be 
beneficial if I could modify my opponent’s 
estimate of my concession limit (called “mini- 
mum disposition” by them) so as to “make 
him believe that the terms at which I would 
prefer no agreement are, in fact, less favorable 
to him than he had first thought [p. 23].” 
According to Ikle and Leites, one way this 
might be achieved is "By convincing my 
opponent that it would be disastrous or im- 
possible for me to agree to less than my 
Proposed terms [p. 24]." Convincing my 
opponent that I cannot accept less than a 
certain amount should be beneficial in eco- 
nomic as well as political bargaining. 

Often, experimenters have designed their 
studies so as to prohibit statements about one’s 


1 The research was supported by Indiana University 
Faculty Research Grant 10-441-21. The authors are 
indebted to Joan Capo, Carol Cohen, and Matie 
Flowers for their assistance in conducting the experi- 
ment, and to William Shaffer for his help with the data 
analysis, 
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concession limit. In other experimental designs, | 
the bargainers know each other's concession | 
limit, so the strategy of lying about your con- 
cession limit is not a reasonable one to use. In 
their bargaining experiment, Kelley, Beckman, 
and Fischer (1967) kept the bargainers igno- 
rant of each other's concession limit (called 
"minimum necessary share") and allowed lying 
about the concession limit. The bargainers had 
to agree on the division of 9 points, with the 
profit obtained by a bargainer being. the 
difference between his share in the negotiated 
agreement and his minimum-necessary share. 
Kelley et al. found that when the minimum 
necessary share values of the bargainers were 
unequal, the low-minimum-necessary-share 
person tended to lie more than the high- 
minimum-necessary-share person. Their data 
also revealed that the high-minimum-neces- 
Sary-share person overestimated the low- 
minimum-necessary-share person's true D) 
mum necessary share, and the low mo 
necessary-share person made the greater pro! 3 
Perhaps, then, lying about one's con" 
limit is a successful bargaining strategy. 


the other hand, it may have been something 


else the low-minimum-necessary-share person 
did which caused his success. EB 
How can I get my opponent to believe e 
my concession limit is what I say it is? In ke 
Study we tested the applicability of the e 
technique to statements about one's concess 


i AC E pna = Ste. * > 


nit. The more I exaggerate my concession 
mit statements and the more frequently I 
iterate my statements, the more successful 
Will be in bargaining. 

In addition to testing this hypothesis, we 
e interested in testing the hypothesis that 
if a bargainer had the last clear chance to 
gree or not, he would be more likely to yield 
lo the demands of his opponent than if he were 
mot in such a position. Schelling (1963) stated 
that 


In bargaining, the commitment is a device to leave the 
last clear chance to decide the outcome with the other 

y, in a manner that he fully appreciates; it is to 
inquish further initiative, having rigged the in- 
tives so that the other party must choose in one's 
or [p. 37]. 


METHOD 
Subjects 


- The subjects were 180 female students enrolled in 
introductory psychology lecture and laboratory courses 
it Indiana University. Fifteen subjects were randomly 

to each of the 12 experimental conditions. 
in experiments was a course requirement. 


The procedure is a variation of the one used in 
Experiment I of Kelley et al. (1967). The experiment 
as a bargaining session, allegedly between two subjects 
Who were to bargain over the division of a hypothetical 
$90. In all cases, however, one bargainer was actually 
& female accomplice who bargained according to one of 
‘Six predetermined strategies. 
The bargainers were seated in opposite booths, sepa- 
ited by a partition so they could not see each other. 
There was a small slot in the partition through which 
‘they could pass the bargaining sheet. 

After the bargainers had been seated, they were 


— This experiment is designed to test your bargaining 
skill. You will bargain with the other person over the 
division of $90, 
—— At the beginning of the bargaining session, each of 
= you will draw a card indicating your break-even 
point. Let me explain the significance of your break- 
© &Ven point. If you drew a card with 45 on it, you 
- would have to obtain greater than $45 in the bar- 
- Eaining in order to make a profit. Your profit will be 
determined by subtracting 45 from the amount you 
_ obtain in the bargaining. If you reached an agreement 
- of $50 for you and $40 for the other person, your 
Profit would be $50 — $45 = $5. Your break-even 
Point, then, is the amount you have lo exceed in order to 
make a profit, and the more you exceed your break- 
. Ven point, the greater the profit to you. You will 
= hever accept less than your break-even point, for 
that would mean a loss to you. 


Avrticasnury or re Bro LE TrcHWIQUE 


The break-even points of you and the other person 
will not sum to more than 90, though they may sum to 
less. In order to ensure that they sum to no more than 
90, the following procedure will be used. First one of 
you will draw from a box containing two slips for 
each possible number from 0 to 90 in increments of 5. 
In other words, 0, 5, 10, ..., 90 will each be on two 
slips. After one of you has drawn, the experimenter 
will arrange a box for the second person. Then the 
second person will draw from a box with all the 
possible numbers in increments of 5 between 0 and 
the number which when added to the number drawn 
by the first person, equals 90. For example, if the 
first person drew 45, the second person would draw 
from a box containing 0, 5, 10, ..., 45. Enough slips 
of each number will be placed in the second box to 
ensure that the second person will have no idea from 
the quantity of slips whether the other person drew 
a high or low number. The second person will know, 
just as the first does, that the sum of the two break- 
even points cannot exceed 90, though it may be 
less. 

All offers and comments will be made on a paper 
and passed between you. On the bargaining sheet, 
you will write your offer and a comment, if you wish 
to make one. The only comment you will be allowed 
to make is to tell the other person your break-even 
point. You do not have to be honest about it. Jf you 
want to tell him that your break-even point is something 
other than what is really is, you may. Likewise, if he 
tells you what his break-even point is, he may be 
telling the truth or he may not. 

You will bargain alternately, with 1 minute allowed 
each time to fill in your offer and comment. 7/ should 
be perfectly clear that you never have to tell the other 
person your break-even point if you do not want to and 
you do not have to change your offer when it is your turn. 
You may continue to offer the same amount for as 
many successive times as you wish. However, you 
may not bargain in bad faith, meaning, to lower the 
amount for the other person and raise the amount for 
yourself, For example, once you make an offer of $45 
for yourself and $45 for the other person, you can 
never make a later offer of more than $45 for yourself 
and less than $45 for the other person. That would be 
withdrawing your offer or bargaining in bad faith. 
You have only two options, then, on each turn, You 
can continue to offer the same amount or you can 
raise the amount for the other and lower your own 
amount. Remember, the total amount to be divided 
is always $90. 

The bargaining will continue for a maximum of 20 
rounds, with both of you making an offer on each 
round. Zf no agreement is reached by the end of the 
twentieth round, the bargaining is over and neither of 
you will make any profit. 

This experiment is designed to test your skill in 
bargaining. Make every effort to get the greatest 
profit for yourself that you can. 

Tf there was anything about the instructions which 
you did not understand, please take as much time as 
you want to reread them, If something is still unclear, 
please ask the experimenter about it. 


300 
TABLE | 
Apyustep Means or Susject’s Lasr OFFER 
mremrsverroint Bargaining order 
er i 2 Total 
Extremity | Frequency | Subject first | Subject last 
50 3in 10 37.72 40.26 | 39.62 
| 1in10 | 40.12 40.38 
40 3in10 43.39 42.34 42.53 
lin 10 43.10 41.27 
I 
30 | 3in10 43.72 42.16 43.44 
lin 10 43.55 44.31 
Total | 41.93 41.79 


Note.—Numbers represent the amount demanded by the 
subject for herself. The total for 3 in 10 was 41.60, and the total 
for 1 in 10 was 42,12. 


When the subject had finished reading the instruc- 
tions, the experimenter assigned each bargainer a letter, 
A or B. Half of the time, the real subject was assigned 
the letter A and began the bargaining. Half of the time, 
the accomplice was assigned the letter A and went first. 
Since the bargainers alternated offers, if the bargaining 
went the full 20 rounds (20 offers by each), the person 
not opening would ultimately be in the position of either 
accepting the other's last offer or determining that no 
agreement was reached. 

The experimenter then showed the bargainers where 
on the bargaining sheet offers and break-even-point 
statements were to be written. The bargaining sheet 
consisted of two pieces of paper with the numbers 1-10 
on the left margin of the first sheet and 11-20 on the 
second sheet and blanks for offers and break-even-point 
statements, 

The break-even points were then distributed, The 
real subject was always asked to draw first from a box 
with folded slips of paper. The subject was led to believe 
that these slips contained numbers ranging from 0 to 90. 
Actually, all slips contained the number 30, so all 
subjects had a break-even point of $30, The subject 
was then asked to show the experimenter her break-even 
point so the experimenter could make up a box of break- 
even points for the other bargainer in such a way that 
the two break-even points would not sum to more than 
90. The experimenter made up this box within sight of 
the real subject and then presented it to the accomplice, 
who was asked to select one. Subject A was then told 
to begin the bargaining. The bargaining was always 
to be in amounts divisible by the number five, (A pilot 
study had shown that in the rare instances where one 
bargainer conceded in smaller than $5 increments, the 
other usually responded in kind, making an agreement 
within 20 rounds extremely unlikely.) 

With regard to opening offer and concessions, the 


accomplice bargained with all subjects in the following 
way: 


Opening offer. The accomplice’s first offer was $70 
for herself and $20 for the real subject. According to 
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data from Experiment I in Kelle 1967), where 
9 points were to be divided, the opening offer 
ranged from about 6.7 for peo; i break-even 
point of 3 up to about 7.1 for somec: a break-even 
point of 5. A pilot study using our : lure yielded 
opening offer data similar to tha ley et al. An 
opening offer of $70-$20, then, « ot seem ex. 
tremely high or low in any of the ms regarding 
what the accomplice later maintain 15 her break- 
even point, 

Number of concessions. Kelley ct | not present 
data on number of concessions, but i lot study, we 
found an average of about 2 conces per 10 offers. 
The numbers 2-10 were placed in on: ind 11-20 in 
a second box, and two numbers we 


vn from each 
s determined 
3, 9, 11, 


box. As a result of this procedure 
that the accomplice would concede o: 
and 16. 


Magnitude of concessions. Kelley reported that 
in Experiment I the magnitude of concession from first 
offer to settlement was about 2.2 for someone with a 
minimum necessary share of 3 and yped to about 1.5 
for someone with a minimum necess: are of 5, We 
found comparable data in our pilot dy. It was 
decided, therefore, to hold constant t tal amount 
of concession at $20 in our experiment. Every concession 
was $5, since this was the usual n de of con- 
cessions by subjects in the pilot study. 1 

The accomplice employed one oí six strategies in- 
volving here break-even-point statements. She con- 
sistently maintained that her break-even point was 
either $30, $40, or $50. A cross-cuttir riable was 
the frequency with which the accompi reated her 
break-even point. She either reported it once every 10 
rounds or three times every 10 rounds. The rounds 


within each 10-round block for making the statements 
were determined by drawing numbers from two boxes, 
one with Slips 1-10 and the other with Slips 11-20. The 
rounds were 6 and 17 in the 1 in 10 conditions and 5, 8, 
10, 11, 14, and 17 in the 3 in 10 condition. 

The bargaining was finished when there was agree- 
ment on the division of the $90 or when the 20 rounds 
were over. 


Design 


The design was a 3 X 2 X 2 factorial design with the 
factors being (a) extremity of break-even-point state- 
ments by the accomplice—$30, $40, or $50; (b) fre- 
quency with which the accomplice stated her break-even 
point—1 in 10 offers, or 3 in 10 offers; (c) bargaining 
order—the subject went first or last on all rounds. 


RksurLTS 
O peniug Offer 


À3X2X2 analysis of variance was per- 
formed on the data for the subject’s opening 
offer. The only experimental variable manipu- 
lated up to this point was whether the D 
went first or last, since the accomplice ha 
made no statement about her break-even point 
before the subject’s first offer, There was n9 


* 


for the bargaining order factor, 
ant interaction was found between 
rder and extremity of break-even- 
ments (F = 520, p < .01). Since 
tv variable was not manipulated 
nificant interaction could be due 
failure in the attempt to obtain 
groups with respect to opening 
andom assignment of subjects to 
Since the subjects were randomly 
ialysis of covariance is appropriate 
statistically for opening offers. All 
analyses, therefore, were 3 X 2 
es of covariance. 


isted cell means for the subject’s 
re given in Table 1. The last offer 
‘ect could be on the twentieth round, 
id be earlier if an agreement were 
"efore the last round. The only sig- 
ect (F — 9.85, p « .001) was for 
of the break-even-point statement. 
c extreme the accomplice's break-even- 
iement, the more favorable to the 
accomplice was the subjects last offer. 


Number of Agreements 


Getting the subject down to terms more 


favorable to the accomplice would have been 


à hollow triumph if that offer were still short 
of the demands of the accomplice. If an agree- 
Ment occurred, a score of 1 was given; if no 


- agreement occurred, a score of 0 was given. 


The adjusted cell means for the number of 
agreements is given in Table 2. There was a 
Significant main effect for extremity (F = 8.55, 
b < 001). The more extreme the accomplice's 
Statement, the more often the subject yielded 
to the demands of the accomplice. 

A significant main effect for bargaining order 
Was also obtained (F = 7.08, p < .001). When 
the subject went last, there was a greater 
number of agreements than when the subject 
did not go last. This finding does not un- 
€quivocally demonstrate the pressure to yield 
resulting from being the one with the last 
Clear chance to determine whether or not an 
agreement is reached. The difference could 
have resulted from behavior prior to the last 
Tound. Collapsing data across frequency and 
Extremity of break-even-point statements, we 
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TABLE 2 
ADJUSTED MEANS OF NUMBER OF AGREEMENTS 


Break-even-point 


statement Bargaining order 


aS) z 14 Mu 
Extremity | Frequency | Subject first | Subject last 
50 | 3in10 | 8841 | 8506 
Lin 10 | -7257 ‘g174 | “S195 
| 
40 | 3in10 | .3965 | .7380 | 
| in 10 | ‘6550 | "7361 | 98 
| 
| | | 
30 3in10 | .3965 7384 
| findo | 3255 $02 | 19" 
Total | .5639 3361 | 


Note,—A score of 1 was assigned when an agreement was 
reached; a score of 0, when there was no agreement. The total 
for 3 in 10 was .6674; and the total for 1 in 10 was .6326, 


found that when the subject went first, 54 of 
the 90 negotiations lasted until the final round, 
and when the subject went last, 51 of the 90 
negotiations lasted until the final round. Ob- 
viously, bargaining order had no effect until 
the last round. The 105 cases lasting until the 
final round are critical for testing the effect of 
being the bargainer with the last clear chance. 
When the subject went last, she yielded to the 
demands of the accomplice in 28 cases and did 
not yield on 23. When the subject was not the 
last to make an offer, she yielded 15 times and 
did not yield 39 times. A fourfold contingency 
table was formed from the above data, and a 
significant chi-square was obtained (X? — 7.98, 
p < 01). When the subject had the last clear 
chance, she was more likely to yield. 


Tolal Number of Offers 


The total number of offers by both bar- 
gainers (maximum possible was 40) reflects 
the speed with which the subjects yielded. The 
only significant effect was a main effect for 
extremity (F = 5.92, p< .01). The more 
extreme the statement of the accomplice, the 
fewer the number of offers in the bargaining 
(see Table 3). 


DISCUSSION 


'This experiment was designed to test the 
applicability of the big lie technique to bar- 
gaining. The more extreme the accomplice's 
statement about her break-even point, the 
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TABLE 3 
ADJUSTED MEANS OF TOTAL NUMBER OF OFFERS 


Jerome M. CHERTKOFF AND SUZANNE L. BARD 


Break-even-point 


ini der 
statement Bargaining or 


- et A Total 
Extremity | Frequency | Subject first | Subject last 
50 3in 10 30.68 33.95 33.89 
lin 10 35.04 35.88 
40 3in 10 3741 34.83 35.49 
Lin 10 34.65 35.08 
30 3in 10 35.28 37.82 37.15 
lin 10 37.64 37.86 
"Total 35.12 35.91 


———————És1 2.4 


Note.—The total for 3 in 10 was 35,00; and the total for 1 in 
10 was 36.03, 


more successful was the accomplice. The 
opponent yielded more, the opponent was 
more likely to agree to her demands, and the 
bargaining was concluded in a fewer number 
of offers—all this in a situation where the 
opponent was told that lying was permitted 

and that statements about break-even points 
might be lies. 

Why was the extreme break-even-point 
Statement successful? Several interpretations 
are possible. Perhaps it was successful because 
the bargainer who made such a statement was 
using for his own benefit the responsiveness of 
the other bargainer to Homans? (1961) norm 
of distributive justice or Adams’ (1965) equity 
principle. According to Homans or Adams, 
rewards should be divided in proportion to 
costs or investments. Viewing the break-even 
point in this experiment as equivalent to cost 
or investment in the Homans or Adams sense 
may not be justified. The break-even point 
merely reflected a base line from which profit 
was to be calculated, and there was no attempt 
in the instructions or procedure to relate this 
base line to costs or investments. Perhaps the 
view of most subjects was that the break-even 
point was merely a base line determined by 
chance and that all bargainers were equal in 
Costs or investments. If this view prevailed, 
then the fair solution according to distributive 
Justice or equity principle would be to divide 


profits equally, not in proportion to the break- 
even points, 


Schelling (1963) has suggested that 
gaining, a fair solution may be reac] 
mainly because people want to be fi 
because the solution provides a focal poi 
situation which lacks one. 


Even in those cases in which the only distingu 
characteristic of a bargaining result is its 
"fairness," by standards that the particip 
known to appreciate, we might argue that the 


force of fairness is greatly reinforced by the po 
"fair" result to focus attention, if it fills the va 
indeterminacy that would otherwise exist [pp. 


Most subjects did not immediately agr 
either a split-the-profit solution or a 
based on rewards being proportional to 
even points once the accomplice sta 
break-even point. Since the veracity t 
statement was in doubt, perhaps this is i 
surprising. Eventually, though, enough 
jects may have placed enough credence in 
statement to let it affect their judgments 
cerning an equitable agreement. 

An alternative interpretation of the 
tiveness of the break-even-point state 
would be in accordance with the Siegel 
Fouraker (1960) conclusion that bargai 
behavior is affected by one’s level of aspir itio 
If I tell another that my break-even point 
$50, he may believe that I will be very: 
luctant to yield from a demand of, say, 
1f, on the other hand, I tell him that my b 
even point is $30, he may expect that I 
quite likely to yield further from a deman 
$55. In the latter case his level of aspirati 
would be higher, and, therefore, he would b 
less likely to yield. 

Varying the frequency of statements al 
the break-even point had no eflect, either 
or in interaction with the other varial 
Repetition is a widely accepted propa: 3 
technique, as Brown (1963) indicated: “4 
propagandist is confident that, if he repeal 
statement often enough, it will in time come 
be accepted by his audience [p. 2]." Of co 
our experiment differs widely from the si 
ation confronting the propagandist, partict 

larly if he is using the mass media. Repetit 
may be a good technique in mass n 
persuasion where the number of people expo 
to any single message presentation is usual, 
small part of the population one wants to T 
where the amount of information in a S 


message may be great enough to cause even 
| an attentive person to miss certain points, and 
where considerable forgetting may occur in the 
time interval between the message and the 
occasions where the behavior advocated in the 
message would be performed. Furthermore, 
if the repetition is by different people, a 
person’s certainty about the statement may be 
increased. Chalmers (1971) found this for 
unfavorable information, but not favorable 
information, in impression formation. 

In our experiment where the other bargainer 
is certain to see the statement, where forgetting 
is unlikely, since the sheet with the statement 
is before the other bargainer every trial, and 
where the reiteration is made by the same 
person, repetition had no effect. Perhaps, then, 
our experiment represents the conditions under 
which repetition will be ineffective, although 
it is always possible that we would have ob- 
tained an effect if repetition had been greater 
than 3 times in every 10 offers. 

This experiment was also designed to test 
the applicability of the last clear chance 
doctrine to bargaining. The pressure to yield 
on the person with the last clear chance was 
evident in the data from those negotiation 
sessions which went to the final round. How- 
ever, being last on the final round was con- 
founded with being last on every other round. 
The only way to completely eliminate such 
confounding would be to have simultaneous 
offers up until the last round. It was felt that 
such a procedure might appear peculiar to 
subjects. Furthermore, it might make ex- 
tremely salient to the subjects the hypothesis 
being tested, thereby maximizing the potential 
effects of demand characteristics. Since the 
bargaining order did not affect the magnitude 
of the subject’s opening offer or the subject’s 
likelihood of yielding before the final round, it 
seems reasonable to attribute the greater 
frequency of yielding on the last round when 
the subject went last to the order on the final 
round, 

Although the effect of being last was demon- 
Strated, in 23 of the 51 cases where the subject 
Went last on the final round, the subject 
refused to agree to the final offer of $40 for 
herself, a $10 profit, and $50 for the other. 
Only when the accomplice stated her break- 
even point at $50 did the majority of the 
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subjects, 10 out of 14, yield in the last position 
on the last round. If the accomplice's break- 
even-point statement of $50 could be believed, 
the subject would be making a $10 profit, and 
the accomplice, no profit. When the accom- 
plice’s stated break-even point was $40, 5 of 
the 11 subjects yielded. If the accomplice's 
statement could be believed, the profit division 
in this case would be $10 for each. When the 
accomplice's stated break-even point was $30, 
13 of 26 subjects yielded. In this case, the 
profit would be $20 for the accomplice and $10 
for the subject if the accomplice’s statement 
were true. 

A number of subjects, then, chose to forsake 
a $10 profit rather than agree to the accom- 
plice’s last offer. These people were probably 
concerned with their profit relative to the 
profit of the accomplice, so they would not 
settle for the $10 profit when it was certain or 
seemed likely to them that the other bargainer 
was making the greater profit. One might 
hypothesize that a person would be less willing 
to forsake the $10 if the payoff were actual 
money rather than hypothetical. On the other 
hand, it is possible that a person especially 
interested in relative gain would be even more 
concerned with relative gain if the other were 
beating him for something tangible. 
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THE ELUSIVE WEAPONS EFFECT: 


DEMAND AWARENESS, EVALUATION APPREHENSIO: 
AND SLIGHTLY SOPHISTICATED SUBJECTS: 
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It was predicted that the 1967 Berkowitz and LePage weapons effect would 


accounted for by awareness and cooperation with 


the aggression-reve 


purpose of the guns. Two replication attempts failed before a third x 


successful. The effect was obtained only with 
who were aware of the purpose of the guns. Aware subjects who wer 
ently experiencing little evaluation apprehension 


slightly sophisticated s 


cooperated with the hypot! 


sis. Those awares who did experience evaluation apprehension did not 
in accordance with the hypothesis, It was concluded that there is a com: 


interplay between sophistication, 


demand &wareness, and evaluation appreh 


sion in experiments designed to study a negative behavior. The Weapons expe 
ment is not evidence for the conditioning theory and cannot be generalized 


nonlaboratory situations, 


The development of the laboratory decep- 
tion experiment has enabled social psychology 
to make considerable progress, especially dur- 
ing the past two decades. There is growing 
evidence, however, that the experimental 
study of humans can be very complicated in- 
deed. As Silverman and Shulman (1970) 
pointed out, doing experiments with humans 
is like doing chemistry with dirty test tubes. 
The contaminants in this case are the needs, 
motives, and role expectancies of the experi- 
mental subjects themselves. Orne (1959, 1962, 
1969, 1970) has convincingly argued that 
subjects in experiments try to figure out the 
hypothesis and discover what is expected of 
them from the demand characteristics of the 
situation, Orne’s formulation of the social 
psychology of the experiment very broadly 
describes the subjects’ definition of role in 
the experimental situation, The perceptual or 
cognitive aspects of the subject’s conclusions 
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from the situation regarding 5 expected 
of him may be designated by more spe- 
cific and operationalizable con: f demand 
awareness (Page, 1968, 1969, a). Simi- 
larly, Rosenberg ( 1965, 1969 suggested 
that subjects are on their gi: uring ex- 
periments, motivated to present themselves 
in a favorable light. Demand reness (a 
largely perceptual concept ) "valuation 
apprehension (a largely motiva concept) 


may be viewed as being complementary 
processes that together describe the social na- 
ture of the psychological experiment from the 
subject's point of view. These processes, when 
allowed to operate unnoticed and uncontrolled 
by the unwary experimenter, can contaminate 
an otherwise well-designed experiment. In this 
sense, the most poorly controlled experiments, 
containing the most blatant demand cues, 
may be the most effective in obtaining pub- 
lishable results. 

While respecting what the laboratory ex- 
periment has contributed to our field and be- 
lieving that it can be very effective, we sus- 
pect that it has not yet produced as many 
facts having ecological validity as is currently 
believed. That some social-psychological phe- 
nomena can be easily brought into the labora- 
tory does not mean that all can be. All ie 
menters believe that their deceptions are et- 
fective and that their postexperimental ques- 
tionnaires are valid, but this does not neces- 
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sarily make it true. For many experiments the 
question can be legitimately asked: Who is 
fooling whom, the experimenter or the sub- 
ject? Aronson and Carlsmith (1968) sug- 
gested that some things, such as aggression, 
are extremely difficult to introduce into the 
laboratory. But, if some have come to this 
conclusion, why have others been able to 
study aggression in the laboratory with ap- 
parent ease? Could it be that they have been 
inadvertently capitalizing upon demand char- 
acteristics? 
A few years ago, Berkowitz and LePage 
(1967) brought guns into the laboratory and 
placed them right next to the shock button in 
a revenge situation. While they tried to dis- 
tract the subjects as to the true purpose of 
the guns by saying that they just happened to 
be left there by someone else, the obvious 
question is whether the subjects believed 
them. Wouldn't the presence of the guns so 
blatantly telegraph the hypothesis that the 
verbal disclaimer might be ineffective? 
Since Berkowitz and LePage (1967) got a 
significant difference between subjects in their 
revenge condition who had guns on the table 
and those who did not, they concluded that 
their classical conditioning theory (that wea- 
pons elicit aggression) was supported. Berko- 
witz (1968) reported in a popularized version 
“The finger pulls the trigger, but the trigger 
- may also be pulling the finger." If this hy- 
pothesis is true, then it is likely to be taken 
and used as "scientific evidence" by the ad- 
vocates of gun control legislation. Thus, the 
= question of whether the experiment is evi- 
dence for this conclusion becomes a socially 
relevant question as well as a scientific one. 
Since social psychology has so few social 
policy-relevant facts, it would behoove us to 
make certain that the weapons effect is a real 
effect having ecological validity. 
Did the subjects conclude that the guns on 
the table were supposed to have an effect on 
them, and did this knowledge somehow medi- 
ate the effect? If so, the experiment is not 
Valid evidence for the conditioning theory. 
Berkowitz and LePage (1967) appeared to be 
aware of this possible interpretation of their 
data as their discussion consisted mainly of 
arguments against a demand-characteristics 
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interpretation, In the popularized version, 
Berkowitz (1968) again took pains to empha- 
size that the subjects were not aware of the 
weapons. But, in the absence of acceptable 
evidence, disclaimers do not answer the ques- 
tion. Berkowitz and LePage would argue that 
their postexperimental interview and the fact 
that they dropped 39 subjects, mainly for 
being suspicious, is evidence that demand 
awareness cannot account for the data they 
report. But, because they used a brief inter- 
view schedule containing only general ques- 
tions rather than a carefully constructed ex- 
tended interview, their interview results are 
inconclusive. It has been shown elsewhere 
(Page, 1969; in press) that only when an 
extended interview employing a funnel tech- 
nique (general questions followed up by spe- 
cific ones) is used does one get a valid indi- 
cation of the extent of demand awareness. 
Vague and general questions tend to elicit 
vague answers that deal in generalities. Such 
answers are impossible to score validly regard- 
ing the specifics of a subject’s demand aware- 
ness. With such interview techniques there is 
always the strong possibility that false nega- 
tives are left in the data. 

At this point, something that unfortunately 
was not clearly noticed until after the second 
experiment should be brought in. Berkowitz 
and LePage (1967) were most concerned with 
eliminating subjects who were suspicious of 
deception (that there was a stooge and the 
subject was the only one being studied). 
Their only question relevant to demand 
awareness was couched in very general terms. 
It may have elicited reports of suspicion 
from very suspicious subjects, while not elicit- 
ing reports of knowledge of the weapons hy- 
pothesis. However, being suspicious that the 
experiment was a fake (Stricker, Messick, & 
Jackson, 1969) and being demand aware that 
the guns are supposed to have an important 
effect are not the same. Berkowitz and Le- 
Page dropped subjects only for suspicion of 
deception, not for demand awareness. In the 
published version of the experiment, Berko- 
witz stated that no subjects suspected that the 
guns were a part of the experiment. An un- 
published script of the exact procedure writ- 
ten by LePage and supplied to us by Berko- 
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witz flatly contradicts this statement. If we 
accept the report of the man closest to the 
data, they dropped subjects who suspected 
the stooge but retained those few who re- 
ported that they thought the guns were a 
part of the experiment. Also, in view of the 
nature of their interview procedure it is un- 
likely that all subjects aware of the guns re- 
ported this awareness, or that those who did 
report awareness of the guns revealed the 
full extent of their awareness. This is sug- 
gestive evidence for the demand-awareness 
hypothesis. Had the subjects who knew the 
guns were a part of the experiment been 
asked further questions about their ideas re- 
garding the purpose of the guns, perhaps they 
would have been found to have connected the 
guns with an attempt to increase their aggres- 
sion. 

The present study reports three attempts to 
replicate the “weapons effect” experiment. 
Only in the last attempt was a successful rep- 
lication of the originally reported effect ob- 
tained. In each experiment, extended funnel- 
type postexperimental interviews were con- 
ducted. The two unsuccessful studies are re- 
ported because in each case something im- 
portant about the social psychology of this 
experimental situation was learned. What be- 
gan as a rather routine investigation of the 
relation between demand awareness and the 
weapons effect became a complicated problem 
involving evaluation apprehension and sub- 
ject sophistication as well. 


: EXPERIMENT I 
Experimental Design 


Because the Berkowitz and LePage (1967) 
study obtained the greatest effect in the wea- 
pons-associated condition (ownership of the 
guns was attributed to the stooge), and also 
because they reported more suspicion in that 
condition, the weapons-associated condition 
was chosen for replication, Also, the aggres- 
sion variable was replicated (1 versus 7 shocks 
received by the subject from the stooge) in 
order to assess the possibility that a good 
deal of the “aggression effect” was due to a 
deviation from objectivity on the part of 
subjects in the 1-shock condition, In other 
words, the no-weapons subjects in the 7-shock 
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condition could be giving an objective rating 
(M = 4.67, where 5.0 is the midpoint of the 
scale) of an actually average paper written by 
the stooge. The subjects in the 1-shock condi- 
tion, on the other hand, may have felt obli- 
gated to give the stooge fewer shocks because 
he had not been objective by giving only one 
shock. Is the construct of aggression really 
necessary, or do the data fit better an expla- 
nation which emphasizes a norm oí reciprocity 
(“He didn't shock me more than he had to 
so I felt obligated to repay him") operating 
in the 1-shock condition? This norm-of-reci- 
procity idea was to be tested by extensive 
postexperimental interviews, as was the hy- 
pothesis that the weapons effect would be as- 
sociated with demand awareness and coopera- 
tion, 


Method 
Subjects 


The subjects were 62 male undergraduates from 
the introductory psychology course at the Univer- 
sity of Nebraska. They were required to participate 
in experiments but could choose the experiments they 
participated in, This particular experiment was rep- 
resented to them as a study in physiological reactions 
to stress. Two subjects were dropped from the 
analysis because this was not their first psychology 
course. These two subjects had taken a course in 
the College of Business Administration entitled 
“Human Behavior” which included discussion of a 
number of deception experiments, including Mil- 
gram's (1963) obedience study which also involves 
deceptive use of shock. This would not have been 
discovered without the extended interview. One of 
these subjects was in the weapons 7-shock condi- 
tion. (It is important to note that he had the ex- 
periment figured out and was the only subject to 
give 8 shocks) The other subject, though he saw 
no weapons, was more suspicious than any of the 
other subjects. The data from this study would not 
have differed significantly by including these sub- 
jects, but in view of previous research on subject 
sophistication (Page, 1968, 1969, 1970a), it was 
decided to replace these subjects. The fact that the 
one subject who reported previous knowledge of 
deception was the only one to show a weapons 
effect was crucial in the reasoning that led to Ex- 
periments II and III. The 60 retained subjects were 
assigned in equal numbers to one of the four cells 
of the 2X 2 analysis of variance design described 
above. 


Procedure 


The procedure followed was a step-by-step (a 
sofar as was possible to reconstruct) replication ^ 
the procedure reported in the published version 


original weapons effect experiment for the 
ons-associated condition (Berkowitz & LePage, 
). In the present study, the only difference from 
dition to condition was the presence or absence 
shotgun and a pistol on the shock-button table 
the number of shocks (1 or 7) that the subject 
wived from the stooge. What is reported here is 
ly a brief description. 

sign-up sheets, posted on a bulletin board 
ide the introductory psychology room, contained 
stitious names signed in different hand writing for 
ach of the time slots available. Subjects could then 
ign up on the blank lines and see that another 
erson was to be there at the same time. 
When the subject arrived at the laboratory, the 
sperimenter (both authors ran approximately equal 
umbers of subjects in all studies) took him to an 
perimental room and left him there alone to wait 
for the other subject. At the end of 5 minutes, the 
experimenter returned and said he was going to try 
9 find an undergraduate over in the North lab who 
ht substitute for the missing student. In a few 
utes he returned with the stooge, who was in- 
ced as an undergraduate major who had been 
another experiment. Instructions were then 
n to the effect that each subject would have 5 
inutes to give a written answer to a problem, then 
ey would be separated and would judge the qual- 
of each other’s work by giving from 1 to 10 
electric shocks. 
Following the problem solving, the stooge was 
hosen by the experimenter to give the shocks first, 
"and was sent into the room containing the shock 
button, The subject was then taken into the room 

aining the shock and galvanic skin response 

ctrodes (they were separated, supposedly, so that 
they would not influence each other's physiological 
tions), The subject then received shocks (1 or 
supposedly as his partner’s rating of his essay. 
n the subject and the stooge switched rooms. 
"They were allowed to see each other as they passed 
in the hall. How this was handled in the original 
‘study was not reported in the published article 
Berkowitz & LePage, 1967). As will be reported 
T, the demand characteristics associated with the 
in which subjects were not allowed to see each 


inning 


“study. Upon entering the second room with the 
Subject (if the weapons were next to the shock 
tton), the experimenter told the subject to disre- 
Ward the weapons and that they belonged to his 
artner who was conducting some sort of experi- 
“ment with them. Leaving the stooge's paper (mem- 
f orized and written exactly the same each time) with 
the subject to read, the experimenter went to “wire 
“up the other subject" and then returned to instruct 
“the subject to give the shocks. The subject was left 
“alone while he gave the shocks. A count of the 
Dumber of shocks given by the subject was the de- 
‘Pendent variable in the study. As in the original 
study, mood ratings were taken, but they were not 
| SCored, 
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The experimenter then entered the room and began 
the postexperimental interview which was conducted 
orally while the experimenter took extensive notes. 
Since the argument of this paper depends upon 
interview procedure, the entire questionnaire is re- 
produced following the orienting instructions. It is 
argued and supported by data elsewhere (Page, in 
press) that an interview this extensive must be used 
in order to obtain a valid assessment of what sub- 
jects knew or did not know during an experiment. 
The following orienting instructions preceded the 
interview: 


The experiment proper is now over and this is 
the postexperimental interview. In order to de- 
termine what the results of the experiment actually 
mean it is very important to find out what your 
thoughts were during the experiment. The only 
way I can think of to find out what your thoughts 
were is to just have you open up and to tell me 
what you thought. I'm interested in any ideas you 
had about the purpose of the experiment and 
any suspicions you may have had about any of 
the procedures. Keep in mind that there are no 
right or wrong answers at this point. What I am 
interested in is whatever you actually thought 
during the experiment and before this question- 
naire was introduced. 


PosTEXPERIMENTAL QUESTIONNAIRE 


1, What did you think the experiment was 
about? 

2, What did you think my hypothesis was (i€. 
what did you think I was looking for, trying 
to study, etc.) ? 

3. How did you feel toward your partner? 

4. What was your partner role in the experi- 
ment? 

5. During the experiment and before this ques- 
tionnaire was given, what suspicions did you 
have, if any? 

6. If you were suspicious, when did you become 
suspicious and what things made you sus- 
picious? Please answer in detail if you haven't 
already. 

7. What were your reactions to giving the 
shocks? 

8. Did you ever come to believe or suspect dur- 
ing the actual experiment that I was control- 
ling the number of shocks that you received, 
or that I told him how many to give you? 
Mese No 
If yes, how suspicious were you of this? 

Only slightly suspicious 


Quite suspicious 


Please explain if necessary. 

9. Did you ever suspect that your partner was 
in on the experiment during the actual ex- 
periment? Yes . No. — 
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li yes, how suspicious were you of this? 
Only slightly suspicious 


Quite suspicious 


Please explain if necessary. 
10. Did you ever suspect that the guns had some- 
thing to do with the experiment? 
Nes Nea 
If yes, how suspicious were you of this? 
Only slightly suspicious 


Please explain if necessary. 

1i. If you thought the guns had something to do 
with it, what were your ideas about their 
purpose? 

12. My hypothesis was that the gun on the tablè 
would increase your aggression toward your 
partner. Did you have this approximate idea? 
Yan Na 
lí yes, how certain were you of this? 

Only slightly certain 


Quite certain 
Please explain if necessary, 

13. If you thought you knew what I probably 
predicted about the number of shocks you 
would give, how cooperative were you with 
what I was looking for? 

Noncooperative 
Cooperative 
Please explain if necessary. 

14. If you thought you knew what I probably 
predicted about the effects of the guns on 
you, how cooperative were you with what I 
was looking for? 

Noncooperative 
Cooperative 
Please explain if necessary, 


Questions regarding weapons awareness were not 
included for subjects not in the weapons groups. 
Following the interview, subjects were further de- 
briefed and sworn to secrecy. The questionnaire was 
scored on each of four variables listed below, along 
with the hierarchies of scoring rules, 

Suspicion of the stooge. Was the subject suspicious 
of the stooge’s role in the experiment? 

1. Absolutely no indication of suspicion. Reported 
consistently that he believed everything, 

2. Possible indication of suspicion but unclear be- 
cause of any combination of vagueness, contradic- 
tions, or low certainty ratings. 

3. Was clearly suspicious at some point, but rates 
himself near the middle on certainty or is vague 
before the more direct questions, 

4. Reports before the direct questions that the 
stooge was a fake or that the experimenter actually 

gave the shocks and rates himself a 5 or higher on 
certainty. 


Monte M. Pace AND Rick J. SCHEIDT 


Revenge hypothesis. Did th lieve that 
the main purpose of the ex vas to elidi 
revenge or aggression, or its oj he 1-shock 
condition? 

1. Absolutely no indication ti subject had 
the revenge hypothesis. 

2. Possible indication but unclear because of any 
combination of vagueness, cor ictions, or low 
certainty ratings. 

3, Report contains words lik ger," "aggres- 
sion,” or “revenge” in describing t ypothesis, but 
still not absolutely clear that he w for certain 
that the experimenter expected n r less) shocks 
to be given because more (or k vere received; 
that is, there is no evidence to suggest that the 
subject did not believe this but he does not express 


himself clearly. 

4. Clearly aware that revenge 
the main purpose of the study 

Weapons hypothesis. Was th: 
the purpose of the weapons wa 
aggression? 

1. No mention of weapons, inclu 
sponse to Question 10. 


zgression was 


ect aware that 
cit revenge or 


ag a “no” re- 


2. Believed that weapons were a part of the study, 
but incorrect or unclear indication of what they 
were for. 

3. Weapons are mentioned along wiih ideas such 
as revenge or aggression, but nol mentioned before 
the specific questions and a certainty rating on 


Question 12 of 5 or below. 
4. Clearly aware that weapons \ 
cause more shocks including a certainty 


re supposed to 
rating above 


Negativism or resistance in an aware subject. Did 


the subject know what was expected but tried to 
act either independently or in the opposite direction 
from the hypothesis (scored for revenge awares 


when no weapons were present and for weapons 
awares when they were) ? 

1. Clear evidence of resistance. Three or lower 
rating on one of the cooperation scales and other 
supporting information, that is, saying “I tried to 
be objective,” “I didn’t want to appear to be an 
aggressive and vengeful person,” or “It was a stupid 
experiment.” 

2. Some evidence of resistance such as 3 or lower 
cooperation ratings or mention of objectivity motive 
but other evidence absent or unclear. p 

3. No evidence either way or conflicting evidence. 

4. Some evidence of cooperation including coopera- 
tion ratings of 5 or higher. 

5. Clear evidence of cooperation including coopera- 
tion ratings above 5 and clear explanations, that is, 
“I compared mine to his and because mine was bet- 
ter I had to give him more,” or “I gave him : 
couple more than his paper deserved because © 
What he gave me and the guns." 


Results 


Data on the number of shocks given by 
subjects under the four conditions of the ex- 


Ament are sun 
e seen that a st 


ed in Table 1. It may 
replication of the “ay- 


effect” (1 is V shocks received) 
red (F = 14 dj = 1/56, p < .001). 
, the inte n (weapons effect) re- 
ed by Berkowitz and LePage (1967) was 
replicated (F 59, df = 1/56, ns). In 


ithe original study, t 
column under th: 
4.67, respectively 
ere 4.67 and 4.80 
M a situation appar 
Study, they had 
in the case of 
for reasons 
ed earlier. Wh 
Was included in 


means in the 7-shock 
ditions were 6.07 and 
this study, the means 
hile the weapons were 
ly similar to the origi- 
ibsolutely no effect, ex- 
e subject who was re- 
sophistication as ex- 
the sophisticated sub- 
* data, the means were 
ly not significant. This 
akes it impossible to 
cen demand awareness 
will be argued later, 

o replicate when cer- 
lain critical demand-characteristics cues were 
hadvertently absent may be interpreted as 
support for the demand-awareness hypothe- 
iis. In other words, when demand awareness 
absent (reported in the next paragraph), 
e weapons effect is absent. While proving 
Rothing, this is consistent with the demand- 
Awareness hypothesis. 
The results of the postexperimental inter- 
ew were as follows. Only one subject in the 
weapons condition was scored as 

aware of the purpose of the weapons (3 or 
ligher), and he was given a score of 3 rather 
làn 4 because he appeared uncertain and 
id not mention the guns until asked on Ques- 
on 10. He also reported an attempt to be 
bjective in his rating of the stooge's paper 
ther than to cooperate. In other words, not 
ne subject was both weapons aware and co- 
perative. Twenty of the 60 subjects were 
Cored as aware (3 or 4) on suspicion of the 
tooge, Only 3 of these were in the 7-shock 
Weapons group. Data for suspicious subjects 
Nere no different than for nonsuspicious sub- 


the relation be 
nd weapons effect 
however, the failur: 


On awareness of the revenge hypothesis, 24 

ibjects were scored as aware (rating of 3 or 
1). Of these subjects, 17 were in the 7-shock 
‘ondition, with 9 in the weapons group and 8 
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TABLE 1 


Mean Nuuner or Suocks Given IN. Eactt 
Conprrion or Exrermest I 


Shocks received 


Condition | = HD. 
| 1 | 7 
Weapons 1.93 | 4.67 
No weapons 2.13 | 4.80 


in the no-weapons group. Only 1 of these 17 
subjects reported cooperation. The other 16 
subjects emphasized that their reaction was 
to be objective rather than to get revenge, 
for example, “I tried to evaluate his answer 
objectively—I had it in my mind not to let 
his shocks influence my evaluation.” These 
data were taken at face value as indicating 
that subjects were trying to be objective. Af- 
ter all, an objective rating of an “average 
paper” would be 5 on a 10-point scale. 

The low number of subjects in the 1-shock 
condition (7 of 30) who saw a relation be- 
tween the number of shocks received and the 
number they were “supposed” to give is evi- 
dence that demand awareness cannot account 
for the large difference between the 1- and 
7-shock groups. The shocking behavior of the 
aware and unaware subjects was the same 
(consistently low). In contrast to the other 
group, subjects in this group all reported that 
they were not objective in their ratings. The 
number they decided to give was not based on 
the worth of the answer but on the fact that 
the stooge had given only one shock, as the 
following examples illustrate: (a) “I gave 
him two because he gave me one”; (5) “He 
gave one shock, I gave him three and I felt 
guilty—compared my answer to his”; (c) “I 
was surprised that he gave me only one— 
thought my answers were worse. I didn’t 
know how many to give—thought he could 
get mad, I’d have an enemy if I gave him 
three or four shocks”; (d) “His answer 
could have been better but I didn’t want to 
give him more than two shocks—I felt reluc- 
tant, he was a nice guy"; (e) “I was expect- 
ing three or four—when he gave me one I 
felt obligated not to give him many no mat- 
ter what." In summary, a norm of reciprocity 
operated strongly in this condition. Most of 
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the subjects were not aware of what the ex- 
perimenter hypothesized—but they were fo- 
cusing on what would be fair play considering 
the circumstances. 

Discussion 

The results regarding the originally hy- 
pothesized relation between awareness of the 
purpose of the weapons and the weapons 
effect are inconclusive. Since neither factor 
appeared in the absence of the other, the 
hypothesis remains plausible, but further in- 
vestigation is required. In attempting to ac- 
count for the failure to replicate, two consid- 
erations were particularly salient. First, while 
the replication attempt followed the published 
description of the procedure (Berkowitz & 
LePage, 1967), there was no way of knowing 
how close our procedure matched that of the 
original study. Second, while speaking at the 
Nebraska Symposium on Motivation, Berko- 
witz (1970) stated that subjects at the Uni- 
versity of Wisconsin participate in many ex- 
periments during the introductory psychology 
course. Thus, the possibility that the weapons 
effect could be obtained only with sophisti- 
cated subjects (either because of previous 
deception or use of a different textbook) had 
to be considered as a. possible explanation for 
the failure to replicate—especially since the 
only sophisticated subject run under the wea- 
pons condition in this study was the only one 
to give 8 shocks and was the most clearly 
aware. 

Considering the possibility that the proce- 
dure of this study could have been subtly 
different from the original study, Berkowitz 
was asked for a copy of the original script 
which was subsequently supplied. When this 
was studied, it became apparent that the two 
experiments actually differed in many subtle 
ways. Basically, the present experiment was 
inadvertently covered better for demand cues 
than was the original study. For example: 
(a) When subjects arrived at the lab, the 
fact that two subjects would participate was 
revealed in the original study by saying “It’s 
just that I didn't want two friends signing 
up together, so I had two sign-up sheets" 
(demand cue to the subject, something about 
this study would be affected by friendship). 
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(6) When subjects changed rooms after the 
real subject had just been shocked, “I am 
going to get him to go into a third room while 
you change rooms because I don’t want you 
two to meet when you cross over” (demand 
cue, meeting between the shock exchange 
would affect the experiment). (c) In the 
instructions just before the subject was to 
give his shocks he was told “This is the last 
time shocks will be given in this experiment. 
There is something I want him to do when 
this is over so you won't see him when you 
leave" (demand cues, if you did not know 
he was not going to have a chance to get 
you back, this might inhibit you, and if you 
thought he was going to see you right after- 
ward, this might inhibit you). 

In other words, these cues could have been 
subtly saying “It’s alright to give a lot of 
shocks, in fact it's expected; you're strangers; 
you don't have to look at each other or ever 
see each other again and he won't have a 
chance to get you back if you go ahead an 
pay him back." This kind of subtle deman 
usually does not get retained in a condensa 
tion for publication. But from a subject 
point of view, such may be al 
important in determining his reaction. In the 
present study, since the "get him? demand. 
cues were not present, perhaps even subjects 
who had otherwise devined the revenge hy- 
pothesis were not as certain of it or not as 
sure that they would not reveal something 
bad about themselves if they did get revenge 
Revenge demand awareness without coopera- 
tion was the obtained result in both the 
weapons and no-weapons groups. Perhaps n 
the original experiment, weapons 2 
and these additional demand cues com ; 
to convince at least some subjects that ke 
only did the experimenter want to see ift a 
would get revenge or not, but he es 
that they would, and they might even 00 
bad if they did not. Ec. 

"Finally, the finding of a norm of recipio 
in the 1-shock group deserves further at ^ 
tion. Considering just the basic revenge 4 
aggression effect and disregarding the Mur 
effect, the group receiving only ! shoc za 
the active group. They deviated gi i 
jective rating, and it was not an artita 


cues 
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emand awareness. This effect is not easily 
explained by a theory that considers the 
I-shock group the “control” group and the 
7-shock group the “aggression” group. In 
what sense is the aggression group getting 
‘revenge by giving an average of more than 
,2 shocks less than they received? Many of 
‘these subjects knew that they were supposed 
fo get revenge, but they were not willing 
^o do so because they believed it would re- 
lect negatively upon them. In the 1-shock 
Fondition, aware or not, the natural and so- 
cially acceptable response to someone who has 
lone a favor is to return it as these subjects 
id. This is obviously not the proper base-line 
ontrol for assessing the aggression effect. 


EXPERIMENT IT 
Experimental Design 


Considering the initial failure to obtain the 
weapons effect, it was decided to simply try 
and replicate by changing everything thought 
to be wrong with the first attempt. It was 

‘reasoned that if the original subjects had 

- been previously deceived, then subjects who 
had knowledge of the use of deception should 
show the effect strongly with a replication 
of the original script. Therefore, students 
from an introductory social psychology course 
were selected as subjects. The shock in- 
density was reduced to a very mild level 
(as suggested by Berkowitz, personal com- 
munication, July 27, 1970), and relay noises 
present in the first study were eliminated. 
It was decided to run only the 7-shock condi- 
tion, since this was the only condition show- 
ng a weapons elfect in the original study. 

these changes, two groups Were 
both having the "weapons-associated" 
troduction of the stooge, one with and one 
without weapons on the table. 


Method 
ub jects 


The subjects were 34 undergraduate students from 
an introductory course in social psychology. This 
Was the second psychology course for most of the 
Students. They were run during the first tbird of 
he semester. All of the subjects had seen à film 
depicting four famous experiments (Focus on Bekav- 
ior: The Social Animal), three of which involved 
deception. Also, several deception experiments had 
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been discussed in class. Six subjects were eliminated 
from the analysis; two because of equipment mal- 
function, two for misperceiving the number of shocks 
they had been given, and two because they con- 
fessed to having heard about the experiment from 
a friend prior to participation. These confessions 
would not have been obtained without the extended 
interview. 


Procedure 

The procedure was a word-for-word replication 
of the original script, with the exception of the 
initial contact between experimenter and subject. 
Subjects had signed an appointment sheet passed 
around in class (under the inducement of a small 
amount of extra credit). When the subjects arrived 
at the lab they were met by the experimenter who 
asked them if they were from the introductory or 
social class. The experimenter then explained that 
he had signed up subjects from different classes 
because he did not want two friends to sign up 
together. From this point on, the replication was 
exact. The orienting instructions for the postexperi- 
mental interview were given orally, but the inter- 
view itself was given in written form. Each ques- 
tion appeared on a separate page, and the subjects 
were instructed to go through the booklet one page 
at a time. 


Results 


The mean number of shocks given by sub- 
jects in the weapons condition was 3.86, and 
the mean for the no-weapons condition was 
5.14. This difference was significant (t = 2.21, 
df = 26, p < 05, two-tailed) in the opposite 
direction from what was expected. These 
highly sophisticated subjects reacted by 
giving fewer, not more, shocks when they had 
weapons present. 

On the postexperimental interview, 7 of 14 
subjects were aware of the purpose of the 
weapons. This is in contrast to only 1 such 
subject in the first experiment. Twenty-three 
of 28 (82%) were suspicious of the stooge. 
This is in contrast to 30% in the comparable 
group of the first experiment. Twenty-five of 
28 (89%) were aware of the revenge or 
aggression hypothesis, compared to 57% in 
the first experiment. All but 1 subject (he 
was in the no-weapons group) was scored as 
3 or lower on negativism or resistance versus 
cooperation. This 1 subject gave eight shocks. 
He was suspicious of the stooge and aware 
of the revenge hypothesis—he said: “I thought 
it might be to see if the subject would show 
revenge to a certain extent. It may have been 


> 


E 
$ 


312 


a setup to see how I would react. I gave 
more than I would have if I had been first, 
but I didn't have many apprehensions be- 
cause I knew the shocks weren't painful and 
I did personally believe my paper was better 
than his.” Apparently he did not feel evalua- 
tion apprehension at cooperating with the 
revenge hypothesis because of what he told 
himself about his own paper really being 
better. 


Discussion 


Subjects in this study were highly suspi- 
cious and often seemed hostile toward the 
experimenter but not the stooge. Apparently 
this is a clear demonstration of what Masling 
(1966) has called the “screw you" effect. 
Most of them knew what they were 
“supposed” to do, but especially with the 
weapons present, they bent over backward 
trying not to comply. 

It was observed that most of the subjects 
had not taken the introductory course the 
immediately preceding semester. Also, it was 
deduced that most of the subjects had never 
experienced deception and debriefing first- 
hand as not many deception experiments were 
run at the University of Nebraska during the 
time period that they had taken introductory.* 
At this point it was concluded that a high 
degree of sophistication from secondhand 
sources, such as films and lectures, might not 
be equivalent to an introductory student’s 
slight sophistication after having been per- 
sonally deceived in an experiment a short 
time previously, The only possible sophistica- 
tion that subjects in the original study could 
have had was the latter, not the former. 
Perhaps what students learn from films and 
lectures on deception experiments is that the 
subjects appear stupid. Perhaps what a re- 
cently deceived subject learns is that experi- 
menters have hypotheses about what they 
study, and perhaps he does not want to get 
caught again not doing what he is “supposed” 
to do. In other words, the difference between 
our sophisticated group in this study and the 
slightly sophisticated group proposed for the 
following study may not lie in demand- 
awareness differences but in differences in 
evaluation apprehension aroused by the situa- 
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tion. Another plausible explanation at this, 


point was that both suspicion and demand! 
awareness turn subjects off, rather than on,! 
in studies of negative be r such as ag- 


gression. It could then be argued that what 
is needed are completely e and unsuspi- 
cous subjects. But then iy of the sub- 
jects in Experiment I were both naive and 
unsuspicious. 

At this point the differe 
cion of deception and d 
made in the introductior, became clearl 
salient. These subjects were certainly demand 
aware (especially of the revenge hypothesis) 
but they were also quite picious of th 
whole experiment, includi: : reality of t 
stooge. Perhaps it was this feeling that th 
whole experiment was a fac 
salient by the weapons) an‘ ‘hat the subject 
was the only one being stu 
portant in arousing evalua! 
Demand awareness is not 
cion of deception. A dei 
may believe the experimen! 
the experimenter is sincere 
size that “these guns ars 
something to me.” A dem 
highly suspicious subject 
himself into role playing ‘he expected hy 
pothesis confirming behavior without expeti 
encing too much evaluation apprehension. A 


between suspi- 
ind awarenes: 


: apprehension. 
e same as suspi- 
d-aware subject 
s real and that 
but also hypothe 

ipposed to d? 
id-aware but not 
then, may 


ExPERIMENT II 

Experimental Design 

Considering the two previous failures to 
obtain the weapons effect and wondering if it 
was possible to replicate the effect (perhap 
because of some uncontrolled differences 
tween subject pools at the University í 
Nebraska versus the University of Wi 
consin), it was decided to return to the intro 
ductory subject pool. This time previous eX 
perience with deception was included as "d 
independent variable. Again, only the 7-shoc 
condition was included, It was decided to "e 
twice as many subjects in the weapons as a 
the no-weapons condition, Thus, if a d 
effect were obtained, there would be a und 
sample for purposes of examining the Te’? A 
between weapons awareness and the T 
sion effect at an individual difference evel. 
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Since it was reasoned that suspicion of 
deception might arouse evaluation apprehen- 
sion while demand awareness alone might 
induce cooperative role behavior, the proce- 
ure was slightly redesigned. An all-out at- 
tempt was made to render the stooge and 
other -:pects of the procedure, such as the 
GSR, credible while not changing the demand 
for revenge cues. In addition, the way in 
which the presence of the guns was excused 
was le less explicit. In short, the proce- 


tightened insofar as suspicion was 
d and loosened in regard to de- 
varacteristics. In view of this goal, 
a change was made from the weapons- 
d condition of the original study to 


the weapons-unassociated condition. 
Method 

Subjects 

The bjects were 75 introductory psychology 
studen en were dropped from the analysis. Seven 
of these misperceived the number of shocks they 
received. and the other 3 confessed to prior knowl- 
edge oi the experiment from a friend. Of the 65 
retained, 33 had participated less than a month pre- 
viously in a deception experiment involving physio- 
logical arousal, a placebo pill, and attraction ratings 
of a very beautiful female lab assistant. In that 
study, none of the subjects was demand aware, and 


there was some suspicion of the placebo. All subjects 
were thoroughly debriefed afterward. Perhaps this 
was just the situation to motivate some of the 
subjects to try and figure out and confirm the 
hypothesis of their next experiment. The present 
experiment was the first participation for the 
32 naive subjects. 


Procedure 


The naive group was recruited by means of the 
Sign-up sheet procedure described in Experiment I. 
lt was specified that this must be the subject's 
first experimental participation. The sophisticated 
group was recruited by a telephone call. It was 
explained that they had been randomly selected 
from the course roster, and they were asked if they 
Needed any more experimental credit. At this point, 
12 subjects said they had already completed the 
requirement, and the conversation was terminated. 
All but 2 subjects who needed credit agreed to par- 
ticipate. They were told that the nature of the 
experiment could not be revealed beforehand, but 
that it involved making interpersonal judgments and 
Tequired two people. Strangers were needed for the 
Study, and since friends tend to sign up together on 
the bulletin board, this was the reason for the tele- 
Dhone calls. In this way, the real reason for the 
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telephone calls was covered, and the friendship de- 
mand cue mentioned earlier was introduced before 
the subject arrived at the lab. 

The stooge arrived 2 or 3 minutes prior to the 
appointed time; usually the other subject was al- 
ready there. The stooge appeared to be in a strange 
surrounding—said “hi” and sat down. He did not 
initiate any conversation, but if spoken to, he at- 
tempted to maintain his cover by making comments 
appropriate for an introductory student. If asked, 
he gave the false name that he had been assigned 
for that particular hour. The experimenter entered 
the waiting room, asked each their names, and 
escorted them to an experimental room. 

It should be noted that the acting in this study 
was upgraded over the last one. Careful attention 
was paid to such things as eye contact and expres- 
sive movements. The experimenters tried to encour- 
age a sense of morale in themselves and the stooges 
to the extent that each session became a challenge 
to see if the stooge could be “sold” to the subject. 
It is difficult to describe in writing the subtle dif- 
ferences between this procedure and that of the 
previous study. One specific thing was that some of 
the stooge’s lines were eliminated because they did 
not seem to ring true. In general, the stooge brought 
his behavior more in line with a typical subject by 
becoming less verbal and attempted to appear credi- 
ble by nonverbal means. Another specific change 
was the way in which it was decided who would 
receive shocks first. After the problem solution and 
“explanation” of the physiological and shock rating 
aspects of the study, a “random” drawing was held. 
The subject always won this because the number 
“1” was on both slips of paper. Then the subjects 
were separated and taken to two adjoining rooms. 
In the previous studies, the arbitrary choice of the 
stooge to give the shocks first had been a source of 
suspicion. The experimenter’s expressive and non- 
verbal behavior was also modified so as to come 
across as more forceful and confident. Since some 
previous subjects had not believed the explanation 
of the galvanic skin response, instructions were 
modified at that point, and the experimenter delivered 
them with a good deal of eye contact and confidence. 

After the subject had been shocked and had filled 
out his mood rating scales, he came into the other 
experimental room as instructed. There the experi- 
menter, attempting to be very forceful, confident, 
and sincere, seated the subject at the shock button 
and weapons (if they were present) and gave the 
instructions and the stooge’s paper. No mention was 
made of the weapons unless the subject asked; then 
he was told “Don’t worry about these.” It was 
hoped that this would make the weapons more 
salient. Typically, subjects glanced furtively at the 
weapons but did not say anything. It had been 
noted that all but one naive subject in Experiment I 
had believed that the weapons really were for 
another experiment. However, in that study the 
weapons were inadvertently covered better than in 
the original study because of following the wording 
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of the published summary rather than the actual 
script. In the second experiment, the wording of 
the actual script was followed, and half of the 
subjects were weapons aware, Several unawares 
reported that they had dismissed the guns when 
told that the weapons were for another study. 
Therefore, it was decided that perhaps the present 
procedure would uncover the weapons demand for 
more subjects. It might be argued that the exact 
procedure should have been followed here. However, 
in view of the previous failures, it was considered 
justified to attempt to obtain the effect by reducing 
the impact of variables believed to work against 
the effect and by increasing the impact of those 
variables believed to work for it. 


Results 


The means for the four groups of this 
experiment along with the ns in each group 
are presented in Table 2. Considering only 
the sophisticated group, at last the elusive 
weapons effect was obtained. These means 
were almost exactly the same as those of 
comparable groups in the original study, and 
they were significantly different (4 = 2.35, 
df =31, p < .05). A surprising finding was 
the nonsignificant reversal of the weapons ef- 
fect for naive subjects which in itself argues 
that some subjects in the original study may 
have been previously deceived. When the 
data in Table 2 were analyzed in an unequal 
"s analysis of variance (Winer, 1962), only 
the interaction was significant (F = 6.74, 
df = 1/61, p < .02). 

An explanation for this interaction was 
sought in the analysis of the interview ques- 
tionnaire. Considering first the weapons- 
sophisticated group, only 5 of 22 were scored 
as unaware of the weapons hypothesis. Few 
of the 17 awares would have been detected 
by the single general question used in the 


TABLE 2 


Mean NUMBER or SHOCKS Given IN Eacn 
CONDITION OF ExPERIMENT III 


Sophistication 
Condition 


Sophisticated 


5.86” 
4.64" 


Naive 


5.40* 
6.33* 


Weapons 
No weapons 


sn = 20, 
by = 22, 
on 12, 
an om Dn. 
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original study. Of the 17 awares, 8 were 
scored as resisters. The combined mean of 
the 13 subjects scored as unawares and aware 
resisters was 5.08, while the mean shocks 
given by the 9 cooperating awares was 7,00. 
The difference between these two groups was 
highly significant (f= 3.35, df= 20, p< 
.01). Only these 9 subjects showed a revenge 
or aggression response (giving 7 shocks for 
an average answer after having themselves 
received 7), but they all knew that the study 
was about revenge, and they all thought the 
weapons were there to make them aggressive. 

Next, the level of revenge hypothesis aware- 
ness and suspicion of the stooge for all so- 
phisticated subjects were considered. Twenty- 
seven of 33 subjects (81%) were scored as 
aware of the revenge hypothesis. Eighteen of 
these were in the weapons group. Since 17 
had the weapons hypothesis, only 1 subject 
who had the revenge hypothesis did not have 
the weapons hypothesis. Concerning suspicion 
of the stooge, only 1 subject suspected that 
the stooge had prior knowledge and was 4 : 
fake. This was in sharp contrast to Experi- 
ment II where subjects frequently used words 
like "setup," *fake," and "stooge." Thirteen 
subjects (39%) were suspicious that the ex- 
perimenter actually controlled the number of 
shocks given to see how they would react; 
for example, (a) “I thought that if your 
purpose was to determine any aggression I 
might hold for him after having been shocked 
myself, you would have to be the one con- 
trolling how many shocks I received”; (6) 
"T was suspicious that we never really gave 
each other the shocks and that the main 
testing was not our judging or operating 
under stress but the amount of stress pro- 
duced by first receiving then giving punish- 
ment and the revenge motive." Comparison 
of these comments with those obtained in | 
Experiment II suggested a clear difference 
between suspicion that the experimenter was 
in control of his variable and the suspicion 
that the entire experiment was fake. It was 
concluded that the combination of changing 
the cover on the stooge and changing the | 
operational definition of a sophisticated sub- 
ject resulted in less suspicion and suspicion 
of a more benign kind in this experiment. 


THE ELUSIVE WEAPONS EFFECT 


Thus, while demand awareness was found to 
be greater, evaluation apprehension was 
inferred to be lower in this experiment. 

Nine of 11 no-weapons sophisticated sub- 
jects had the revenge hypothesis, but not one 
of them claimed cooperation with that hy- 
pothesis. Taking another approach, the ques- 
tionnaires of all sophisticated subjects were 
content analyzed and classified according to 
whether they mentioned being objective as a 
motive in rating the stooge’s paper. This was 
only a rough variable because nowhere was 
a subject specifically asked if this were his 
motive; he had to volunteer it on the un- 
structured questions. The data, however, were 
significant ( x?— 6.74, df = 1, p < .01), with 
only 6 of 22 weapons subjects mentioning the 
objectivity motive, while 9 of 11 no-weapons 
subjects did. In other words, the no-weapons 
revenge-aware subjects were more motivated 
to make sure that the experimenter knew 
that while they knew the revenge hypothesis, 
they had tried to ignore it. Perhaps this had 
something to do with concern over a favor- 
able self-presentation. On the other hand, 
perhaps weapons on the table made the hy- 
pothesis so much more salient that the co- 
operating subjects failed to view their own 
compliance as possibly reflecting negatively 
upon themselves. 

Considering the questionnaire data for the 
naive weapons group, fewer subjects were 
aware of the weapons hypothesis (11 of 20), 
and fewer awares (3 of 11) were scored as 
cooperative. Without these 3 aware coopera- 
tors, the mean for the remaining 17 unaware 
and uncooperative subjects dropped to 5.00, 
very similar to comparable subjects in the 
sophisticated group. Considering all naive 
subjects, 26 (81%) had the revenge hypothe- 
sis, and 14 (4496) suspected that the experi- 
. menter had actually given the shocks. On 

these two variables, naive and sophisticated 
subjects were very similar. 

The most puzzling data were those of the 
haive no-weapons group—9 of 12 gave 6 or 
more shocks. Two of these (each giving 8 
shocks) did not report awareness or suspicion 
of anything. They both indicated that their 
main concern had been to give their partner's 
answer an objective evaluation as instructed. 
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Another subject who gave 7 shocks wrote the 
following statement late in his questionnaire: 
“I think the experiment won't be too reliable 
because there might be some sense of revenge 
in treating the opponent.” This suggested 
that perhaps he was actually angered and 
revengeful during the experiment. Ordinarily 
the subjects indicated that they were not 
really angry because it was just an experi- 
ment. The other 6 high-shocking subjects all 
reported revenge-hypothesis awareness, and 
3 claimed to have cooperated (something that 
the subjects in the no-weapons sophisticated 
group did not do); the other 3 subjects were 
scored 3 or neutral on cooperation. 


Discussion 


There is clear evidence in the unpublished 
script and procedure of the original study 
that some of the subjects had previous experi- 
mental experience. In discussing the results 
of his postexperimental interview, LePage 
(1966) stated: “Some realized as soon as the 
stooge entered the room because of their pre- 
vious experience with psychology experi- 
ments [p. 7]." There is also clear evidence 
that the subjects dropped from the study were 
dropped for suspicion of the stooge but not 
for connecting the weapons to the experiment. 
This study reduced suspicion of the stooge 
by modifying the procedure rather than by 
dropping subjects, increased the saliency of 
the weapons, and used previously deceived 
subjects and replicated the original effect. 
Though further research on the details of the 
situation is called for, the present results ap- 
pear to seriously question the results of this 
experimental situation as support for the 
original classical conditioning theory. It 
seems impossible to generalize the results of 
this experiment to the effects of weapons 
outside the lab as Berkowitz (1968, 1970) 
has done. 


GENERAL DISCUSSION 


What appeared to be aggressive behavior 
to the original investigators seems to have 
been a sham or an artifact. Demand awareness 
and cooperation give a better account of the 
data. In general, few subjects got angry (per- 
haps surprised or disappointed) because it 
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was only an experiment. Some of the slightly 
sophisticated subjects were so fully convinced 
that “these weapons are supposed to do some- 
thing to me” that they role played or talked 
themselves into rating the stooge’s paper by 
comparing it to their own and considering the 
number of shocks he had supposedly given. 
But they readily admitted that they were not 
really angry, it was just a part of the experi- 
ment. Again, we refer to Aronson and 
Carlsmith (1968): 


The experienced social psychologist knows that in 
the laboratory it is difficult either to convince sub- 
jects to believe that someone is angry at them or 
to get subjects to express aggressive behavior. That 
is, most subjects believe that such behavior is in- 
appropriate within the confines of an experimental 
situation [p. 24]. 


After running nearly 200 subjects in this 
so-called aggression experiment, this state- 
ment certainly seems correct. 

More generally, the data from this study 
provide empirical evidence for the recent 
theoretical integration of the concepts of de- 
mand characteristics and evaluation. appre- 
hension by Silverman and Shulman (1970). 
Cooperation with demand characteristics is 
not the only reaction available to a demand- 
aware subject, particularly when a behavior 
with potentially negative implications is being 
studied. In all three studies, there was a good 
deal of awareness of the revenge hypothesis 
with or without weapons on the table. How- 
ever, in most cases the subjects made an 
attempt to give honest and objective ratings. 
The primary reaction to knowledge of the 
revenge hypothesis was to attempt to ignore 
it. The experimenter had told the subjects 
to give 5 shocks for an average answer, and 
this was used as a reference point for their 
judgment. If we assume that evaluation ap- 
prehension was aroused at the prospect of 
presenting oneself to the experimenter as an 
aggressive and revengeful person, then the re- 
fusal to cooperate with the revenge hypothesis 
makes sense. 

Having explained the refusal to cooperate 
among revenge-hypothesis-aware subjects as 
reflecting high evaluation apprehension, one 
then has to explain why in the third experi- 
ment nine slightly sophisticated subjects with 
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the weapons present did cooperate. It is a 
much different experience to walk into a room 
thinking that the experimenter probably is 
looking for a revenge reaction and to find 
two guns extremely close to the shock button 
than it is to walk into the same situation 
without the guns. It may have been that for 
these particular subjects the guns were espe- 
cially compelling demand cue: 
so obvious in our culture that : 


the instrument of aggression, perhaps cogni- 
tions about the guns and the fact that they 
were “supposed to automatically do some- 
thing to the subject" blocked the usual 
arousal of evaluation apprehension. One does 
not have to worry about personally looking 
deviant or unsocialized if he absolutely 
convinced of the effect that the independent 
variable is supposed to have and is convinced 


that everyone else will respond in the same 


way. Perhaps these subjects role played be- 
havior that would confirm the hypothesis be- 
cause they felt that this would present them 
as typical or normal individuals, a conclusion 


they would not have come to without the 
weapons. There seems to be a complex inter- 
play between subjects' sophistication, demand 
awareness, and evaluation apprehension in 
experiments designed to study a negative 
behavior. 

Recently, Sigall, Aronson, and VanHoose 
(1970) have questioned the “myth” of the 
cooperative subject. They assert that when 
there is a conflict between favorable self- 
presentation and cooperating with the hy- 
pothesis, *looking good" will prove to be the 
stronger motive. In the present study, €O- 
operation at the expense of self-image was 
not observed. It was just that some subjects 
came to different conclusions about what 
would make them look good. Perhaps this 
same statement can be applied in retrospect 
to previously reported studies (Page, 1968, 
1970a, 1970b; Page & Lumia, 1968) where 
individual differences in cooperation-resist- 
ance among demand-aware subjects were ob- 
served. In those studies no explanation was 
offered for aware resisters. Following Sigall et 
al. (1970) and Silverman and Shulman (1970); 
aware resisters are experiencing more evalua- 
tion apprehension, They know what they are 


to do, but they have concluded that 
appear stupid if they do. Coopera- 
becomes a description of the sub- 
'avior from the experimenter’s per- 
spective. Evaluation apprehension is the 
explanatory construct for individual differ- 
ences in cooperation. 

It should also be noted that when the 
weapons effect was found, the data distribu- 
tion for the weapons-present sophisticated 
group bimodal in shape; that is, some 
subjects showed strong hypothesis-confirming 
behavior. All of these subjects were aware of 
and claimed to have cooperated with the 
hypothesis. The data for the other subjects 
in that «oup were more symmetrically dis- 
tribute! and appeared similar to the distribu- 
tion of ‘be control group. This is similar to 
the pai 2 found in several previous studies 
(Page, 1968, 1969, 1970a, 1970b; Page & 
Lumia 6 68). This clear partitioning of the 
two modes on the basis of the postexperi- 
mental measures of demand awareness and 
cooperation is strong evidence for the valid- 
ity of the postexperimental measures. It is 
also strong evidence for the demand-aware- 
ness explanation of these studies. The original 
interpretations of all of these studies pre- 
dicted only differences between groups of 
subjects. The social psychology of the 
experimental-situation explanation predicts 
individual differences within groups as well 
as the group differences. 

Finally, the relationship between the type 
of interview used and the results of this 
study must be discussed. There seems to be 
a reticence among some groups of social 
Psychologists to use or accept the results of 
extensive postexperimental interviews. This is 
apparently due to two factors: (2) a mis- 
trust of all introspective or verbal report 
data growing out of the behavioristic tradi- 
tion (no further discussion of this will be 
attempted here), and (5) a fear among those 
who do use interviews that extensive inter- 
views will somehow suggest awareness after 
the fact. While this is an empirically investi- 
Sable possibility, it is not thought to be a 
Very realistic one with the present carefully 
Constructed and controlled funnel-type inter- 
Views. One has his alternatives in postinter- 
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viewing for awareness. He can use brief and 
vague questions, thus hopefully avoiding 
throwing out subjects who were not really 
aware without suggestion, but he risks keep- 
ing subjects who were equally aware but less 
articulate or eager to reveal awareness. Or 
he can use extended and increasingly specific 
questions, thus hopefully avoiding keeping 
subjects in the data who were not truly aware, 
but he may risk suggesting awareness to 
some. One way of hopefully controlling for 
suggestion is to use saliency and certainty 
questions along with awareness questions. As 
has been argued elsewhere (Page, in press), 
it is scientifically more rigorous and defen- 
sible to throw out false positives (suggested 
awares) than it is to keep false negatives 
(aware but missed by an insensitive inter- 
view). This argument alone should be suffi- 
cient to require extended postexperimental 
interviews with every deception experiment 
conducted in social psychology. Without an 
extended interview (as opposed to an open- 
ended “opportunity to talk” which many 
subjects do not take), we really have no 
idea if our deceptions were effective or if our 
hypothesis was known. 

In our interviews, it usually took a few 
questions before the full extent of a sub- 
ject’s awareness was uncovered. Revenge 
awareness (the general hypothesis) usually 
came out by Question 2, but weapons aware- 
ness (a specific cue to the general hypothesis) 
seldom came out until Questions 5 and 6. 
Even then the guns were often just men- 
tioned (“the guns made me suspicious") 
without a clearly scorable explanation. It 
was usually at Questions 10 and 11 that one 
was able to judge whether the subject thought 
the guns were a part of the experiment but 
failed to connect them to aggression (scored 
as unaware), or had connected them to an 
increase in aggression (scored as aware). 
When this was discussed with them later, it 
was clear that either the subjects were aware 
of the purpose of the guns during the experi- 
ment or they were skilled and consistent liars. 
Their failure to mention them on the first 
two questions was the result of a combination 
of reticence to reveal perhaps unwanted sus- 
picions and the general wording of the first 
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two questions. A general question about the 
hypothesis elicits a general answer regarding 
revenge or aggression. The specific basis of 
that awareness is not revealed without spe- 
cific questions. Subjects usually do not reveal 
more than is asked for, particularly with an 
oral interview. Perhaps this related to a 
combination of evaluation apprehension and 
fear that they have spoiled the experiment. 
These kinds of problems reveal the fallacy of 
using only brief and general postexperimental 
interviews. 

While it would be naive to take everything 
subjects have to say at face value, it is also 
naive to reject everything they say, or to fail 
to get a valid report of their thoughts during 
the experiment because of nonoptimum inter- 
view techniques. When subjects are inter- 
viewed using a carefully planned procedure, 
what they have to say is often fascinating 
or disconcerting, depending on one’s point 
of view. And, most importantly, what they 
report about their feelings, thoughts, and 
hypotheses during an experiment often cor- 
relates very strongly with their experimental 
performance. 
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the interrupter’s, was artificially raised. 


A particularly salient process that requires 
predictability in order for social interaction 
fo proceed smoothly is the distribution or 
alternation of individual acts over time. In 
larger groups it is often necessary to provide 

al means for ensuring predictability, such 
parliamentary rules of procedure or the 
"smaller set of rules that apply to presiden- 
tial press conferences. Such means of ordering 
acts are, however, inappropriate in small, in- 
formal groups. 
- How, then, is predictable alternation 
achieved in small, informal groups? Kendon 
(1967) has made a contribution to the analy- 
is of this problem by examining the role of 

specific variety of nonverbal behavior, 
Namely, “eye contact.” Studying dyads, he 
found that at the end of an utterance the 
Speaker directs his gaze at the listener while 
the listener, in turn, fixes on the speaker and 
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Three hypotheses dealing with the effects of vocal amplitude on interruption 


outcomes and the effect of those interruption outcomes on group process were 
tested in an experiment that involved computer-controlled “boosting” of 
subjects’ voices. As predicted, vocal amplitude had a significant effect (p < 01) 
on interruption outcomes but only when the defender's voice, as opposed to 


The hypothesis that “retaliation” would 


ensue was also supported: Subjects who were frequently thwarted in their 
interruption attempts responded by (a) invigorating their own vocal response 
to being interrupted (p < .05), which led to (b) a significant improvement 
(p< 01) in the success of their own defenses over time. The results are 
viewed as demonstrating the importance of social structure and norms of 
appropriateness at the level of microanalysis. 


looks away as he (ie. the “new” speaker) 
begins. 


Such changes in behavior which precede the utter- 
ance itself clearly make it possible for each partici- 
pant to anticipate how the other is going to deal 
with an actual point of change in speaker role, 
perhaps facilitating the achievement of smoother, or 
more "adjusted" changeovers than might otherwise 
occur [Kendon, 1967, p. 33]. 


Weisbrod (1965) has drawn similar conclu- 
sions about the function of eye contact in 
larger groups. 

Eye contact is not the only mode that fa- 
cilitates smooth transitions from speaker to 
speaker. Postural changes are associated with 
the beginning and end of communication units 
(Birdwhistell, 1960;  Scheflen, 1964), al- 
though it is not yet clear that the informa- 
tion available in such postural “markers” is 
utilized by interactants. The pitch of a voice 
can also provide reliable cues as to a speaker's 
“intentions” in the form of “junctions” of 

. which there are three types in English: a rise 
in pitch denotes a question, a fall in pitch 
denotes a completion, and a holding of pitch 
indicates that the speaker will continue 
(Gleason, 1955). 

In spite of these various nonverbal and 
paralinguistic cues, which are available ior 
regulating dyadic alternation of acts, the 
process is rarely completely fluent. There are 
two obvious types of failure that can occur: 
(a) extended joint silence and (5) extended 
joint talk. The focus of this study is on the 
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regulatory process that determines the out- 
comes of joint talk or interruptions. 

Simultaneous talk by both parties in an 
interaction presents an obvious obstacle block- 
ing effective communication. Furthermore, 
while some interruptions are, no doubt, due 
to “miscalculation” or bad timing on the part 
of an interrupter, many are also due to the 
fact that one person cannot or will not wait 
for the other to finish what he has to say. To 
the extent that this occurs frequently, it may 
well threaten the stability of the interaction 
since the continually interrupted party may 
attribute the behavior of his partner to rude- 
ness, inattention, or attempts to dominate. 
Whatever the attribution, it would seem that 
the frequency and manner of interrupting, as 
well as the pattern of outcomes of the inter- 
ruptions, might well affect interpersonal rela- 
tionships. This seems especially true when one 
considers the intuitively close relationship be- 
tween the outcomes of interruptions, that is, 
who “wins” or “loses,” and the formation or 
expression of dominance-submissiveness rela- 
tionships.* Wiens, Thompson, Matarazzo, and 
Saslow (1965) have, in fact, empirically dem- 
onstrated this relationship. Their 
showed that supervisory and head nurses were 
more successful in interruption outcomes 
than were staff nurses. 

The discussion to this point has suggested 
the possible importance of interruptions and 
their outcomes for group process, but it has 
ignored the question of what determines those 
outcomes. A study by Meltzer, Morris, and 
Hayes (1971) provided a starting point for 
the consideration of that question. In that 
study, an exploratory correlational analysis 
showed that good predictions can be made of 
interruption outcomes knowing only certain 
amplitude variables. More specifically, the 
results indicated that interruptions were won 
by (a) whoever had, on the average, the 
louder voice during the period of simultane- 
ous talk and (b) defenders who raised the 
amplitude of their voices when interrupted. 

A second finding of interest from the 


3 This is not to say that each individual interrup- 
tion event can be profitably viewed as a miniature 
battle for ascendance, but, rather, that consistent 
success or failure over time in interruption outcomes 
is likely to have implications for interpersonal rela- 
tionships. 
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Meltzer et al. (1971) data w at interrup- 
tion outcomes are more high: ciated with 
amplitude variation of the d: r than they 
are with amplitude variati the inter- 
rupter. This finding of *as: ry" in who 
determines the outcome oí nterruption 
was also noted in Kendon’s 3) correla- 
tional analysis of interruptic ome data, “ 
RESEARCH HYPOTHESI AIMS 

Hypothesis I 

Amplitude is causally rel ; interrup- 
tion outcomes such that aj raises in 
voice by a defender, conting: pon his be- 
ing interrupted, will significa: increase his 
ability to successfully defend floor. 

Since the results in the tzer et al. 
(1971) study were entire! relational, 


there is a possibility that the ‘erences im 


~ 


amplitude that were observed e so highly - 
predictive of interruption omes were 
epiphenomenal. There may be other behav- 
iors associated with such ampiitude differ- 
ences that are causally related :o interruption 
outcomes, while amplitude «differences are 
merely covarying with outcomes. For example, 
it may be that interruptions are actually de 
cided on the basis of verbal semantic content. 
Such an explanation requires that interrup- 


tions are won by people who are saying more 
"important" things during the simultaneous 
talk period and that important or high-in- 
formation words and/or phrases are voiced 
with more amplitude (intensity) than are 
low-information words and phrases. : 
In the light of this, and other possibilities, 
the validity of the hypothesis needs to be 
checked by experimental manipulation of 
speaker amplitudes during interruptions. This 
will be accomplished by utilizing a laboratory 
computer programmed to recognize interrup. 
tions (operationally defined as a period of 
simultaneous talk immediately following @ 
period of solo talk by either subject) and, 0" 
some occasions, to operate an amplifier SU 
that the apparent voice of the defender is 1- 
creased in amplitude when he is interrupted. 


Hypothesis II 


There is an asymmetry in who dem 
the outcomes of interruptions such e. 
plitude increments by the defender W! 


e as stated in Hypothesis I, but ampli- 
crements by the interrupter will not 

the interrupter's success in attacks, 
as compared to an appropriate control. 

This hypothesis is based on the correla- 
fiona! indings from Meltzer et al. (1971) and 
Kendon (1963). Such an asymmetry in who 
the outcomes of interruptions 
: highly functional in informal dyadic 
itions since it supplies the “legiti- 
right for breaking impasses that might 
f voice intensity alone were decisive. 

intensity alone were decisive and 
` to either party in the interaction, 
there would be nothing to prevent both 

rss of the dyad from using it simul- 
sly. The result would be that certain 
ions would have to be decided by a 
z escalation of shouting, an event that 
ious disruptive possibilities. 
prediction of an asymmetry favoring 
the de‘cnder makes good intuitive sense since 
‘all interruptions involve, by definition, an 
‘interference with the ongoing vocalizations of 
the defender. It stands to reason that he, the 
"offended" party, should have the preroga- 
tive for deciding the outcome. 


determines 
would 


conv: 
mized 
result 


Hypothesis IHI 


It is predicted that those subjects who are 
frequently thwarted in their interruption at- 
tempts will, over time, (2) invigorate their 
responding during their own defenses and 
thereby (5) improve their own success in de- 
fending over time. 
It is assumed that those subjects who are 
Confronted by a partner who consistently (ap- 
parently) raises his voice when interrupted 
and thereby wards off interruption attempts 
(Hypothesis I) will become frustrated. 
Is there a remedy to persons so frustrated? 
Over time, such a person may contest longer 
and more vigorously in his interruption at- 
tempts, but such behavior is patently rude 
and, if the previous notions dealing with asym- 
metry in the power to decide outcomes (Hy- 
thesis TI) are correct, it is unlikely to be 
Successful. The asymmetry notion suggests 
that the only efficacious remedy is to retaliate 
in kind, that is, to make his defenses more 
igorous rather than his attacks. Hence, Hy- 
Pothesis TIT states that an unsuccessful at- 
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tacker will retaliate by strengthening his de- 
fenses. 


METHOD 
Overview of Procedure 


Subjects arrived at the laboratory in pairs and 
were randomly assigned to one of two experimental 
rooms. Each subject was individually outfitted with 
a microphone and earphones for the purpose of 
communicating with his partner. Subjects were in- 
structed to carry on a 20-minute conversation about 
campus unrest, 

There were three different conditions, two experi- 
mental conditions and one control condition. The 
control group (N = 11 dyads) carried on the dis- 
cussion with no experimental intervention. In both 
experimental conditions, the apparent amplitude of 
one speaker was increased during certain moments 
of the discussion. A laboratory computer recognized 
the existence of certain events and signaled a 
multiplier to boost the sound of one voice as trans- 
mitted to the other person. More specifically, in the 
“boost defender" condition (N — 11 dyads) during 
approximately 50% of the instances (instances se- 
lected on a random basis) when Subject 1 inter- 
rupted Subject 2, Subject 2's voice, as heard by Sub- 
ject 1, was made louder. In the “boost interrupter" 
condition (N — 11. dyads) during approximately 
50% of the instances (instances selected on a ran- 
dom basis) when Subject 1 interrupted Subject 2, 
Subject l's voice, as heard by Subject 2, was made 
louder. Thus, in the boost defender group the effect 
was to produce an apparently "vigorous" defense 
against the interruptions of Subject 1, and in the 
boost interrupter condition the effect was to produce 
apparently “vigorous” interruptions by Subject 1. At 
the conclusion of the 20-minute period, subjects were 
debriefed and paid for their participation. 


Subjects 


Subjects were solicited by a campus advertisement, 
which described the experiment they were to par- 
ticipate in as a “computer-assisted study of com- 
munication.” Subjects consisted of a cross-section 
from the summer school population including ad- 
vanced-placement high school students, undergradu- 
ates, and graduate students. All pairing was made 
such that peer dyads were formed, for instance, two 
high school students or two graduate students. Sub- 
jects were paid $3.50 per hour for their participation 
in the experiment. 


t The designations Subject 1 and Subject 2 were 
randomly made prior to the experimental session. 
These identifications were necessary since the ma- 
nipulation was asymmetrical in the sense that manip- 
ulations subsequently only occurred when Subject 1 
interrupted Subject 2 and never when Subject 2 
interrupted Subject 1. This is further elaborated in 
the Design section. 
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The apparatus included a pair of bone-conductance 
microphones, an Analog to Digital Converter, a 
DigiData computer-compatible tape recorder, an 
IBM 1800 laboratory computer, multipliers, a La- 
fayette 10-watt stereo amplifier, and a pair of Telex 
ST-20 earphones. (Figure 1 depicts the flow of the 
analog and digital signals through the apparatus.) 

The bone-conductance microphones used as a 
transducer a Sonotone 700-3001 bone oscillator hav- 
ing a 1,000 cycles-per-second (cps) impedence of 
1,000 ohms. The microphones were held in place on 
the forehead by a 1-inch elastic headband. Such 
microphones are advantageous for the proposed re- 
search because they are excellent in their rejection 
of noises in the environment other than those pro- 
duced by the speaker. A fuller description of the 


5 Computation was done at the New York State 
Veterinary College Physical Biology Computer Facil- 
ity, which is supported by National Institutes of 
Health Grant FR 00326. Grateful acknowledgement 
is made for their assistance, 


available in Hayes and Meltzer (1967). 

The speech signal was routed to a seven fas s 
Analog to Digital converter which performes ; 
number of basic functions including sampling, in a 
gration, quantizing, and multiplexing. In ud 
periment each subject's voice was sampled 10 be 
per second and converted to a digital numi c 
1 to 63 representing the intensity of the signal. E 
is a rough correspondence between these log e 
tude units and decibels, but the function is unkno i 

The data are multiplexed onto a tape m hich 
DigiData computer-compatible tape recorder, e 
provides a permanent record of the entire interac! 


Computer Control: Creating the Amplitude 
Boosts i 
01 
An IBM 1800 laboratory computer was used 


ise in voice 
experimental control. The apparent raise ee 
was accomplished by a computer-controlle 


nnel 


: dures 
9 A more complete explication of these proce 


can be found in Morris (1970). 


tion of a pair of multipliers housed in the A-D unit. 
The multipliers’ action was determined before each 
experimental session by a set of values typed into 
the c er. These values included a control volt- 
age, a sensitivity constant, and a modification value. 
These values, in conjunction, determined the "size" 
of the amplitude boost. For example, at the begin- 
ning oí each session a control voltage of 100 was 
typed into the computer. The effect was to instruct 
the multipliers to reduce the actual voltage by an 
arbitrary amount. (The highest possible amount of 
voltage would be represented by a control voltage 
of 511, which would let the "original" speech signal 
through.) Thus, the effect was to attenuate the 
speech signals as they passed from microphone to 
earphone. When an interruption of Subject 2 by 
Subject 1 occurs, the computer randomly selects 
Írom among these events and, with probability at 

‘el, multiplies the control voltage (100) 
sitivity constant (1.1) to the power of 
the modification value. The modification value 
throughout the experiment was 9, As a result a new 
control voltage (100 X 1.1") is calculated and trans- 
mitted to the multipliers, with the effect of letting 
more of the original signal through. This has the 
effect of raising the apparent loudness of the manipu- 
lated voice. The manipulation continues until the 
dual speech period terminates by virtue of having 
either Subject 1’s voice, Subject 2's voice, or both 
drop below the threshold, at which time the inter- 
ruption is considered to be completed. The control 
voltage then returns to 100. A threshold was adopted 
so that certain nonspeech sounds resulting from 
movement, minor throat clearing, etc., would not be 
considered as speech. The threshold was set so that 
samples with an intensity of 8 log amplitude units 
or more were considered as speech. This threshold 
Was selected on the basis of previous pilot study. 
The modification value was selected on the basis 
of pilot testing, which showed it to be a value that 
produced a raise in voice throughout the voice 
range that sounded “reasonable and natural” to the 
author and several of his colleagues. It is recognized 
that the “size” of the amplitude boost is an impor- 
tant variable in its own right and deserves to be 
studied in future research. The particular modifica- 
tion value used in this research had the effect of 
raising a subject’s voice an average of 8-9 log ampli- 
tude units throughout the normal voice range. 

The actual delivery of speech signal was com- 
completed by means of a Lafayette 10-watt stereo 
amplifier which drove Telex St-20 earphones worn 
by the subjects. The signal to the earphones was 
Sampled in parallel by the Analog to Digital unit 
in the same way that microphone inputs were; 
that is, it was sampled, integrated, quantized, and 
Multiplexed. Thus, the data recorded on the tape 
included a complete record of microphone inputs and 
earphone outputs. 
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One-Instant Events 


Due to technical problems resulting from the inter- 
face between the computer and the Analog to Digital 
Converter, interruptions that were of a duration of 
. second or less could not reliably be determined 
to have been "boosted" or not. (For a fuller de- 
scription of the difficulties involved and the methods 
used to circumvent them, see Morris, 1970). Conse- 
quently, it was decided to eliminate all one-instant 
(ie, .1 second or less) interruptions from con- 
sideration in evaluating the hypotheses (I and II) 
having to do with the influence of amplitude on 
interruption outcomes. These one-instant events were, 


however, retained for all other analyses where 
detection of a computer-produced “boost” was 
unnecessary. 
Procedure 


Subjects arrived at the laboratory in pairs and 
were randomly designated as Subject 1 or Subject 2 
(see Design section below). The following written 
instructions were given to both subjects: 


This is an experiment in which we are interested 
in comparing the way people commununicate in 
face-to-face situations as opposed to “on the 
telephone” situations. You are in a simulated 
telephone situation, The discussion topic I would 
like you to pursue is as follows: In light of 
recent campus upheavals I would like you to 
exchange your views as to what you feel are 
the basic conditions underlying all the turmoil. 
What do you think will come of the protest 
for both the short and long term. You may 
deal with this topic in generalities or specifically 
with respect to such issues as ROTC, Black 
demands, democratization within the University, 
academic freedom, etc. This discussion period 
will be approximately 20 minutes long. I will 
break in when it is time to stop. Are there 
any questions? 


The cover story having to do with communication 
over the “telephone” provided an excuse to separate 
subjects without arousing undue suspicion. This 
separation, in turn, justified the use of microphones 
and earphones. All subjects were told that the com- 
puter, which they saw upon their entry into the 
laboratory, would be monitoring their conversation 
and recording data on "certain aspects of their 
interaction." 

Following the instructions, subjects were led to 
separate rooms by the experimenter and outfitted 
with microphones and earphones. The microphones 
were affixed to subjects’ foreheads by means of an 
elastic headband. All subjects were cautioned against 
engaging in excessive movements on the grounds 
that the cables might become fouled, and also to 
refrain from gum or pencil chewing and sneezing 
(insofar as that was possible). Only one side of 
the set of earphones was operative (in order to 
be more like a “telephone” situation), and subjects 


were tobi to we whichever car they used when 
speaking on the telephone 

Following this, the audio system was tumed on, 
and subjects were instructed to count aloud to 10 
s0 a» to ascertain. whether their partner could hear 
adequately. The system was preset to provide 
equivalent outpats (tested with a signal generator), 
and if any subject his partner was 
aked to speak up a little. When both subjects were 


was able to report any correlation between this 
happening and interruptions. 
Experimental Design 


experimental design is presented in Table 1. 
To understand the design it is necessary to keep in 
a dear distinction between the two con- 


TABLE 1 
EXPERIMENTAL DESIGN 
TT TRMAM———————— 
Type of interruption event 
>. a 
Subject 1 interrupts Subject 2 
interrupts 
Slee Subject 1 


Boost defender | Subject 1 never Neith 
boosted; Subject 2 pes 


festa 30% of the 
Boost inter- — | Subject 1 boosted Nei 
rupter ‘of the time; Ed feo y 
2 never boosted. 
Control Neither ever boosted, | Neither ever 
boosted. 


Note.—In each condition of the experiment, any of 
and Subject 2) 
terru. 


the two 
will be 
pter," de- 
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-hypothesis is supported. The median 


rupt) which, in turn, permits ties 
hypothesis (Hypothesis TII) th: ts 
outcomes for these two domai ote. 


Resutrts 

Tied Events 

While the majority M 
were "won" or "lost," | uten 
some interruptions were | the 
that both voices dropped | hr 
the same instant. Such eve ere 
for all analyses except th ealing 
proportion of success. In su nalyses, 
inclusion of tied events wo have 
sitated the adoption of an a: ary & 
as to classification; for inst one 
arbitrarily say that a tie succe 
defense inasmuch as the int: ter 
vented from gaining the foor con 
one could consider a tie to 2 suco 
interruption since it causec lefend 
stop talking. The use of su: finitio 
not seem consistent with the it of 
ses dealing with the propor of si 
and so these events were not -onsi 


such analyses. 


Evaluation of Hypotheses 


Hypothesis 1. Amplitude is causally 
to interruption outcomes such that à 
raises in voice by a defender, contingen 
his being interrupted, will significantly 
crease his ability to successfully defend | 
floor. ? 

The results bearing on this hypothesis 
pear in Table 2. This table presents 
come of events in which Subject 1 in 
Subject 2 in the boost defender, 
interrupter, and control conditions. 

It may be seen from Table 2 


rate (Table 2, bottom row) of Subj 
defenses when his voice was boosted is Bi 
as compared to a median of 45% ont 
boosted defenses. Significance of this f 


7 Since it was not thought justified to assum 
mality, all analyses were conducted by mea 
appropriate nonparametric tests. All rank-o! E 
relations were tested for significance against | 
distribution as suggested by Siegel (1956, p. 212 
N >10. All significance tests were two-tailed 
as noted. 
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amber of interruption events te in. 
eraction of one dyad in the boost Iaterruptet 
4. This reduced the number of dyads to N = 


oot, therefore, appear in this table 
uated by means of the sign test 

to be significant* (p < .01). The 

erall success rate of Subject 2 in 
ses in the control condition is 42%. 

of success was compared to the 
«te of success of Subject 2 (in the 
ender group) in his boosted defenses 
(median = 80%) by means of a Mann- 
Whitney U test which produced U = 13.5, 
which is significant at the .01 level. 

Hypothesis H. There is an asymmetry in 
who determines the outcomes of interrup- 
tions such that amplitude increments heard 
to come from the defender will be effective 
as stated in Hypothesis I, but amplitude 
increments heard to come from the inter- 
rupter will not increase his success in attacks 
as compared to an appropriate control. 

In contrast to the results just cited, it may 
be seen from the two right-hand columns 
of Table 2 that when the boost is instituted 
in behalf of the interrupter, there is no dis- 
cernible impact. The median success rate of 
Subject 2 in the boost interrupter condition 
when defending against boosted interruptions 


* As mentioned above, all one-instant events were 
excluded from this analysis. Had one-instant events 
been included in this analysis, the medians would 
have been, respectively, 69% and 38%. These medi- 
ans are significantly different (p< 01) when tested 
by the Wilcoxon matched-pairs signed-ranks test. 


media 
his d: 
This 

media 
boost 


» these dyad retained vince ed interruptions 
oe tiered Ape e apran ai ap j eh m i L 


there were mo imierraptine by either member and. comm 


(549%) is similar to his median rate of success 
in defending against nonboosted interruptions 
(50%). These medians do not significantly 
differ from each other by means of the 
Wilcoxon matched-pairs signed-ranks test 
(T = 12, ns) nor does the median of 54% 
differ significantly from the median success 
of Subject 2 (42%) in the control condition 
in his defenses (U = 36, ms). 

Hypothesis HI. It is predicted that those 
subjects who are frequently thwarted in their 
interruption attempts will, over time, (a) in- 
vigorate their responding during their own 
dejenses and thereby (b) improve their own 
success in defending over time. 

The “thwarted” subjects, here, are those 
subjects whose success rates in interrupting 
have been adversely affected by the experi- 
mental manipulation, namely, Subject 1’s in 
the boost defender condition. Hence, in test- 
ing Hypothesis TII, the boost defender and 
control conditions were compared. 

The data bearing on Hypothesis IIIa are 
presented in Tables 3 and 4, which show the 
trend over time of Subject 1’s vocal ampli- 
tude response to being interrupted by Sub- 
ject 2 in the boost defender and control 
groups, respectively. A difficulty in evaluating 
the results in Tables 3 and 4 is that there 
are numerous cells with no entry, that is, 


" 
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TABLE 3 


Vocat Resronsx or SUBJECT | To INTERRUPTIONS BY 
Susyect 2 over Time iN tHe Boost DEFENDER 
CONDITION AS INDEXED BY THE CHANGE IN 
Intensity (Loo AwrLrrvDE Units) oF 
Scnjecrs l's VOICE FROM THE INSTANT 
PRECEDING THE INTERRUPTION TO THE 
FIRST INSTANT OF THE 
INTERRUPTION 


Time in S-minute blocks 


| 6-10 | 1-15 | 16-20 
1 | 30 | -10| -1.2 | -23 
2 X | -20| -25 | -50 
3 3.0 30| X | x 
4 —6.0 | —90 L5 0 
5 Oe Ee en x 
6 X 50| X 3.0 
7 -n0| x —80 | —10.0 
8 01x 0 0 
9 —7.5| —50| 8.0 2.0 
10 X |-120| X 20 
Hu —6.0 30| -10 | —15 
Median change | —60| —20| -141 0 


Note.—Cell entries represent medians for all interruptions 
that occurred in a S-minute block. An X indicates that no in- 
terruptions of Subject 1 by Subject 2 occurred in that block. 


for which no interruptions of Subject 1 by 
Subject 2 occurred in that 5-minute block. 
Consequently, median changes in vocal re- 
sponses were calculated by subtracting the 
median vocal response in the last 5-minute 


TABLE 4 


VOCAL RESPONSE OF SUBJECT 1 TO INTERRUPTIONS BY 
SUBJECT 2 over Time IN THE CONTROL CONDITION 
AS INDEXED BY THE CHANGE IN INTENSITY (Loc 
AMPLITUDE Units) or SUBJECT l's VOICE FROM 


THE INSTANT ING THE 'UPTION 
TO THE FIRST INSTANT OF THE 
INTERRUPTION 


T————————————————————————— 


Time in 5-minute blocks 
Dyads 

1-5 6-10 11-15 16-20 

1 —6.0 0| —7.0 —40 

2 —6.0 —7.0 x —12.0 

3 X —2.0 6.0 —8.0 

4 =1.0 0| —3.0 —6.0 

5 —9.0 35 0 —20 

6 0 -10| -3.5 5.0 

7 2.0 5.0 | —3.0 —10 

8 x x 4.0 2.0 

9 1.0 | —10.0 0 0 
10 0 —.5| —60 —40 
11 1.0 ue AU 15 5 
Median change RU — 8| —15 —2.0 


Note.—Cell entries represent medians for all i 
that occurred in a 5-minute block, An X indicates js d 
terruptions of Subject 1 by Subject 2 occurred in that block. 
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block for which there were data for a 
Subject 1 from the median vocal 
in the first 5-minute block for which tl 
were data from the same Subject 1. This 
cedure gives a rough index of the trend | 
time of vocal amplitude by Subject 1 
sponse to Subject 2’s interruptions and 
the advantage of retaining all of the dy: 
the analysis. 

The resulting difference scores 
in the boost defender and control g 
submitted to a Mann-Whitney U 
produced U = 34, which was sign 
the .05 level (one-tailed). 1 

The data bearing on Hypthesis Ib a 
presented in Figure 2. It can be seen ti 
Subject 1s in the boost defender group sh 
a dramatic improvement over time 
success of their defenses as compared to $ 
ject Is in the control group. In order to 
these results for significance, the pel 
success of the defenses of each Subj 
(in the boost defender and control gr 
in the first half (10 minutes) of the 
mental session was subtracted from his 
centage success in the last half of the 
sion. The median change from the first | 
the last half of the session in the boost de 
fender group was an increase of 48% in t 
success of Subject 1s’ defenses, while 
comparable control group incr 
success in defending by 7%. These diff 
scores were submitted to a Mann-Whi 
test, which showed the result to be signifi 
(U — 15, $< .01): 
—Similar'analyses were performed on. 
ject 1’s interruption attempts in order | 
determine if the invigoration in his V 
amplitude and increasing success were 
stricted, as predicted, to his defenses 
whether, contrary to the hypothesis, his 
invigoration and increasing success gen 
ized to his interruption attempts. Vo 1 
vigoration of Subject 1’s interruption 9 
tempts was indexed by changes over timi 
the initial (first-instant) amplitude level. 
his interruptions. The difference DE 
boost defender and control subjects did 
approach significance (U = 54, ns). 
larly, there was no significant differen! 
tween the trend of success of Subject 
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Fic. 2. Median success of Subject 1 in his defenses 
against interruptions by Subject 2 in the boost de- 
fender and control groups as a function of time. 


interruption attempts in the boost defender 
and control groups (U — 49.5, ms). Thus, 
Hypothesis ITI may be considered as fully 
supported, 


Discussion AND FURTHER ANALYSES 


The major hypotheses of the experiment 
were confirmed. First, the importance of a 
"physicalistic" cue, vocal amplitude, has 
proved to be a decisive cue in interpersonal 
interaction, at least insofar as interruptions 
are concerned, This statement, however, must 
be qualified by noting that the separation of 
subjects may have had the effect of increasing 
their reliance on cues such as amplitude. In 
face-to-face situations, where more cues are 
available, there might be less reliance on 
amplitude. On the other hand, good predic- 
tions of interruption outcomes were obtained 
by Meltzer et al. (1971) using amplitude 
predictors in a face-to-face interaction. 

Since verbal semantic content varied ran- 
domly across boosted and nonboosted de- 
fenses, it is untenable to maintain that verbal 
semantic content was having the decisive im- 
Pact on interruption outcomes. This is not 
to say that verbal semantic content plays no 
tole in influencing the outcomes of interrup- 
tions. Such an assertion would require an 
experiment in which amplitude was varying 
tandomly, while verbal semantic content was 
being systematically manipulated. Still, the 
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results from this experiment, in conjunction 
with a theoretical analysis of the communica- 
tion requirements for resolving interruptions, 
puts a semantic interpretation on the defen- 
sive. To assume verbal semantic determina- 
tion of interruption outcomes forces one to 
the conclusion that a relatively complex cue, 
such as information content, is decoded, 
evaluated, and responded to in less than 4 
second (which is the typical time span of an 
interruption). Far more congenial with the 
facts is an interpretation that rests on a fairly 
simple perceptual discrimination: “Did he 
raise his voice?” 

Of equal importance from a social psycho- 
logical point of view is the confirmation of 
the predicted role asymmetry, that is, that 
increments in amplitude coming from the 
defender are efficacious, whereas similar incre- 
ments whose source is the interrupter are not. 
This finding, as the ensuing discussion brings 
out, suggests some bridges between the micro- 
analytic level of analysis represented by inter- 
ruption outcomes and more traditional con- 
cerns of social psychologists, such as the 
relation between norms, roles, and behavior. 

Let us retrace the arguments used to 
generate the prediction of asymmetry. First, 
it has been proposed that there must be, 
from a functional point of view, some kind 
of asymmetry of power since, otherwise, 
amplitude increments could be used by both 
interrupter and defender, resulting in shout- 
ing matches that would not only be disrup- 
tive but also potentially indecisive. A pre- 
diction of asymmetry on functional grounds 
does not, however, help us in predicting who 
will hold the power advantage. The predic- 
tion of an asymmetry favoring the defender 
is based on two separate conceptual 
considerations. 

First, amplitude variations in the defender's 
voice carry more information about intent 
than do variations in the interrupter's voice, 
since the interrupter, by his very act, has 
indicated a desire to talk. The defender's 
intentions, though, are ambiguous since his 
talk may be nonessential (from his own point 
of view) in the sense that he may just be 
finishing a verbalization out of stylistic con- 
siderations, having already conveyed all the 
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information he had intended to. A variation 
in his amplitude can reveal to his interlocutor 
that he is indeed finished (lowered amplitude) 
or that he wishes to continue (increased 
amplitude). In short, it may be said that 
an increment in loudness by the interrupter 
is redundant and therefore less likely to have 
an independent effect than is the case with 
the defender. 

Second, a defender, at the point of inter- 
ruption, does not only have more informa- 
tion about intent to offer but, it would seem, 
there is greater legitimacy to his desire than 
there is to the interrupter’s. It is a matter of 
folk knowledge in this culture that interrup- 
tions are rude, even if they result from a 
mistaken perception on the part of the inter- 
rupter. Thus, it is to be expected that the 
interrupter would be hesitant to press on, 
when faced with information from the de- 
fender that he wishes to continue, as this 
would only compound the felony. 

In short, there is a coherent argument predic- 
tive of an asymmetry favoring the defender, 
That such an asymmetry exists is attested 
to by the data. The question then becomes, 
What are the additional implications of this 
asymmetry? The demonstration that the 
potential for wielding influence lies primarily 
with the defender says little about the effect 
of continued utilization of such a potential. 
To have a weapon is one thing, to use it is 
another. The impact of continued utilization 
of power advantage, as occurred in the boost 
defender group, should have implications for 
the ongoing group process involving a wider 
range of events. 

The data indicate that there is no evidence 
of such an impact over time in the domain 
of events in which Subject 1 interrupts Sub- 
ject 2 (where computer-controlled boosts are 
occurring approximately 50% of the time), 
but that there is such an impact over time 
when Subject 2 interrupts Subject 1. Subject 1, 
who is being frequently thwarted in his inter- 
ruption attempts by Subject 2 (who is appar- 
ently raising his voice) does not “contest” 
directly against this apparent defensive be- 
havior of his interlocutor. That is, over time, 
Subject 1 does not intensify (increase the 

amplitude) his interruption attempts as com- 
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pared to control subjects, ar ject 2's rate 
of success in defending not change 
significantly over time. Rat t is in that 
domain of events in whic! 'ct 2 inter- 
rupts Subject 1 (where bo ever occur) 
where dramatic changes oc ver time, in 
his role of defender, Subj: significantly 
invigorates his vocal res to inter- 
ruptions by Subject 2 an so doing, 
significantly reduces Subje: s success im 
interrupting him. 

These findings substantiate the following 
version of group process: The production of 
an increase in amplitude by a defender is a 
fairly unambiguous and powerlul cue. It is 
reliably followed by submission on the part 
of the interrupter. But, conti: occurrences - 
of this type of event are pre d to create 
an inequity with conseque creases in 
frustration or arousal on art of the 
consistently thwarted interru; The mani- 
festation of this frustration or arousal, how- 
ever, is guided by the normative patterning 
of interaction suggested by | notion of 


! interrupter 
the inequity 


asymmetry. In short, the thw: 
(here Subject 1) acts to reduc ; 
in that domain of events in which he is 
"entitled" to exercise maximum influence, 
namely, during his own defenses. He does s0 
by raising his voice more over time in re 
sponse to interruptions and, in so doing, 
reduces the inequity by increasing the success 
of his own defenses. 

An explanation that utilizes the concept of 
“modeling” is equally viable, however, given 
the results of this experiment. The invigo- 
rated defenses of Subject 1 could be viewed 
as being due to “imitation” of the computer- 
instituted invigoration of Subject 2’s defenses. 
This “imitative” invigoration of defenses ! 
conjunction with the already established va 
lidity of Hypothesis I would predict the 
observed pattern of results. ; 

While this experiment was not designed to 
test between the frustration and modeling 
explanations, there are some data that favor 
the frustration explanation. For examp g 
while it is true that every Subject 2 in the 
boost defender group experienced at 
change or a diminution in success over M 
in his interruption attempts, there is 2 W! 


! 
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taria in the overall (entire session) per- 
centa success in interruption attempts 
amon; ‘he 11 Subject 2s. Moreover, the ex- 
tent Subject 2s’ success in their inter- 
ruptio: attempts is significantly negatively 
correla ^1 with their own frequency of inter- 
ruption attempts (p = —.68, t = 2.82, df = 9, 
p< That is, those Subject 2s who inter- 
rupted ‘requently tended to have lower rates 
of suc in those interruptions. That there 
is no 'ssary relationship between the fre- 
quency of interruptions and success in them 
is indicated by the lack of significant relation- 
ships i: comparable correlations. For example, 
the cor: elation between frequency of interrup- 
tions success in them for Subject 1 in the 
contro! «roup is p = —.17; for Subject 2 in the 
contro! zroup, p — —.25; and for Subject 1 
in the boost defender group, p — .47. The 
compaiale correlations in the boost inter- 
rupter «oup are p = .03 and .07 for Subjects 
1 and 2, respectively. 

Thus, it can be seen that it is only in the 
contex: of a situation where a person engages 
in unsuaily vigorous defenses of the floor 
when he is interrupted that his own success 


in interruption attempts is significantly nega- 
tively correlated with the frequency of those 
interrupiion attempts. These data are con- 
sistent with a frustration explanation, which 
would hold that experiencing vigorous de- 
lenses and being thereby thwarted when one 
interrupts coupled with being frequently in- 
terrupted yourself summates and produces 
greater frustration, hence, more vigorous re- 
taliation. In further support of this view is 
the fact that those Subject 2s who inter- 
rupted most frequently tended to elicit the 
greatest amount of invigoration over time in 
the vocal amplitude response of Subject 1 to 
their interruptions (p= .47, p < 20). In 
short, the above findings are predictable from 
a frustration view, but not from a modeling 
Position. 

The final candidate for explaining the re- 
sults supporting Hypothesis IIb is the notion 
of “interruption credit,” which has been sug- 
gested by Leo Meltzer (personal communica- 
tion, July 1969) (cf. Hollander’s, 1958, 
notion of “idiosyncrasy credit”). By this in- 
lerpretation, the inflated success in the de- 
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fenses of Subject 2s in the boost defender 
group contributes to the "privilege" of Sub- 
ject Is to have a similarly high rate of 
success at a latter time; this is in order to re- 
store equity. This notion, while not ruled out, 
is weakened by the fact that there is an ob- 
servable behavior, the increase over time in 
the vocal amplitude response of Subject 1 to 
interruptions by Subject 2, which can explain 
Subject 1’s increasing success in his defenses. 
By analogy, when we observe someone using 
force (increase amplitude) to obtain some- 
thing (a successful defense of the floor), we 
are unlikely to presume that there was a norm 
(“interruption credit") that gave him legiti- 
mate claim ("privilege") to the object. This 
is not to say that no such norm exists but 
only that it was not apparently operating in 
this experiment. 


Post Hoc Analyses 


There is a further set of findings that 
deserves some mention here. There is a strong 
relationship between the number of times the 
two discussants interrupt each other in the 
boost defender condition (p = 82, t = 427, 
dj — 9, p < 01). The corresponding correla- 
tion? in the control group is p — .05. Let us 
consider the possible significance of this dif- 
ference insofar as it may shed light on the 
impact that the manipulation had on the 
quality of the interaction. 

We begin by noting that in both the 
boost defender and control groups, subjects 
begin with an implicit “working consensus" 
(Goffman, 1961), that is, a set of rules that 
specify what is and is not appropriate in 
regard to the organization of their perform- 
ances. If one assumes that blatant attempts 
at domination, for instance, frequent vigor- 
ous defenses against interruption attempts, 
are, to a certain extent, proscribed by the 
working consensus, then it may be said that 


9? This correlation does not attain significance in 
the boost interrupter group (p= .41, p > .20). The 
import and meaning of this and other findings from 
the boost interrupter group are difficult to interpret 
because the manipulation had no significant effect on 
the major dependent variable, percentage of success. 
Consequently, such findings are not generally dis- 
cussed here, as they would only serve to confuse the 
major issues. 
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the working consensus was violated in the 
boost defender group. Goffman (1961) sug- 
gested that violation of the working consensus 
causes the members of the dyad to shift 
from the “official” or task-oriented focus of 
attention to socioemotional aspects of the 
interaction, such as how well they are faring 
as actors, or normative aspects, such as what 
set of rules should apply. It is claimed, on 
this basis, that the effect of the manipulation 
in the boost defender group is to draw the 
attention of the participants to one another 
as individuals. Violation of the working con- 
sensus causes the concern of subjects to shift 
from the content of the conversation toward 
the concern of maintaining a satisfactory rela- 
tionship and set of rules within which both 
members can function. In the context of 
concern over rules and relationships, inter- 
ruptions are more likely to be responded to 
as having personal significance. The result of 
such a concern, in equal status dyads, might 
be that the interaction becomes a competi- 
tion. “If you interrupt me, TIl interrupt you." 
The result: a high correlation between the 
frequency of interruptions of Subject 1 and 
Subject 2. In the control group, the working 
consensus is not tampered with, and, there- 
fore, interruptions per se do not acquire 
personal significance. The result: one inter- 
ruption does not beget another, with the re- 
sult that the correlation between interruption 
frequencies is low. 

If there is a competition in the boost de- 
fender group then it seems appropriate to 
ask if there is a recognizable outcome of that 
competition in terms of dominance-submis- 
siveness (which is, after all, what the pro- 
posed competition is all about). If the dyads 
in the boost defender group are more likely 
than control dyads to develop a dominant- 
submissive relationship, then the data might 
be expected to show this. More specifically, 
one might expect that overall success rates 
in interruptions (attacks plus defenses) would 
be more extremely distributed in the boost 
defender group as compared to the control 
group. 

Since the prediction is one of an outcome 
from an ongoing process initiated at some 
indeterminate time during the 20-minute 
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session, it was decided that the data from the 
last 10 minutes of the session should be used 
for the test of the hypothesis. The appropri- 
ate nonparametric statistical test for com- 
paring the distributions is the Moses Test 
of Extreme Reactions (Siegel, 1956, p. 245). 
This test confirms the fact that success rates 


are more extremely distributed in the boost < 
defender than in the control group (S, = I1, 
N,=11, N,—11, A= 1, p< 03) during 


the last half of the session. 


Theoretical Implications 


This research is seen as having two major 
theoretical implications. The first has to do 
with the explanatory status within social 
psychology of that class of stimuli is phys- 
ical in nature. In our social interaction with 
others, we both send and receive a great 
deal of stimuli. It is true, but only trivially 
so as far as most social psychologists are 
concerned, that such stimuli impinge as pat- 
terns of light to the eye and air vibrations 
to the ear. The zeitgeist in social psychology 
would have us pay minimal attention to these 
stimuli in their physical form, devoting most 
of our efforts instead to the "constructions 
or interpretations which such stimuli give 
rise to. . 

Such as emphasis has been productive. 
There is no doubt that theorizing about 
cognitive process has given social psychology 
an empirical and theoretical "shot in the 
arm.” The danger is that the “construction 
ist" point of view can lead to an inattention 
to careful measurement and analysis 9 
physical parameters. 

The interest in physical stimuli, however 
is not to be construed as solely remedial in 
nature. In a more positive sense, it is impor- 
tant to inquire about those conditions zm 
which physical stimuli are likely to be SI 
nificant in social interaction. More spec" 
cally, it is in the burgeoning literature F 
nonverbal communication where attention 1 
“physicalistic” stimuli is beginning to hes 
fruit. This literature depicts people in inier 
action as exchanging information via ee 
different modes other than the vel 
instance, posture (Mehrabian, 1969; Schellem 


), eye contact (Kendon, 1967), and 
Merper:sonal distance (Sommer, 1967). 

This ability or propensity to use modes 
communication other than the verbal can 
esent a significant functional advantage. 
instance, (a) Kendon's (1967) analysis 
Of the function of eye contact patterns in 
dyads indicates that they facilitate smooth 
itions from speaker to speaker; (b) the 
exchange of “meaningful glances” can have 
a functional advantage in group situations 
where one wishes to communicate “in private" 
Since gaze can be “funneled” in a way that 
the verbal channel cannot, Similarly, a func- 
tional analysis of the communication require- 
ments during an interruption event suggests 
there is the need for a quick resolution in 
der to prevent a protracted period of simul- 
laneous talk which, at best, prevents com- 
ication and, at worst, can disrupt inter- 


light of this need, has the dual virtues of 
being easily modulated by the speaker, and 
providing “on-off” information, that is, “I’m 


th a minimum of delay. 
- The second major implication of this re- 
Search derives from the strong support the 


The results clearly show that the speaker- 
stener role set is an important factor in medi- 
ng the effect of amplitude in social inter- 
ion. This mediation is apparent both with 
Tespect to outcomes (the defender has the 


t is not undifferentiated or “mechanical.” 
ther, vocal behavior during interruptions 
forms to what has been called the “rule- 
following” model of behavior by Peters 
(1958). By rule following, Peters is referring 
to the guidance of behavior by “norms or 
standards of social appropriateness.” 

The theoretical importance of such findings 
ves from the fact that the “typical” 
lerruption is extremely brief (less than 3 
Second when speech is sampled 10 times per 
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second). Conversants, therefore, are differ- 
entially producing and responding to stimuli 
with latencies under | second according to 
whether their role is speaker or listener. To 
show role asymmetry in this context, there- 
fore, is to extend the explanatory value of 
concepts such as group structure and norms 
to the level of microanalysis. 
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THE “WEAPONS EFFECT,” DEMAND CHARACTERIST! 


, AND 


The Page and Scheidt 


THE MYTH OF THE COMPLIANT SUBJECT 


LEONARD BERKOWITZ?! 


University of Wisconsin 


In discussing the 1971 Page and Scheidt critique of the 1967 Berkowitz a: 

LePage “weapons effect” experiment, it is noted that (a) awareness of th 
experimental variables does not in itself prove that the subjects knew th 
study’s hypothesis or were motivated to confirm it; and (b) consistent viti 
Berkowitz’s classical conditioning model, there are indications in this papt 

that the presence of weapons had automatically influenced the subjects’ beha\ 

ior. The Page and Scheidt argument is inconsistent in a number of importar 

respects. A major problem lies in the correlational nature of their evidence: 
subjects may have developed the notion that the guns had affected thei 
behavior after they had acted, in part at least because the intensive po: 

experimental interview had put this idea in their mind. The demand 
characteristic notion is examined, and ii pointed out that there is ver 
little evidence that subjects are motivated to confirm the experimentei 

hypothesis if they are free of evaluation apprehension and do not fear th 
experimenter’s disapproval. Findings from a number of Wisconsin experimen 

are reported, indicating that many subjects react against the experimenter 
“demands,” possibly because of evaluation apprehension but apparently also 
because of reactance processes. The image of the compliant subject inherent 
in the demand-characteristic concept seems exaggerated. 


(1971) article sent himself in a favorable liz 


(to seem 


raised a number of important issues for my 
own research as well as for social psychology 
generally. Because of these widespread impli- 
cations, I would like to consider their criti- 
cisms of the original interpretation offered 
by myself and LePage (Berkowitz & LePage, 
(1967) and then go beyond the matter of a 
“weapons effect.” I present here some data 
collected at the University of Wisconsin over 
the past few years suggesting that many of 
our college student subjects (and probably 
those at other universities as well) are not 
as passive as the demand-characteristic notion 
maintains. 


How Plausible Is the Original Berkowitz- 
LePage Interpretation? 


Page and Scheidt made note of two types 
of experimental artifacts which could have 
affected the subjects’ reactions to the mere 
presence of weapons: evaluation apprehension 
and demand cues. Following Rosenberg 
(1969), they think of the former artifact as 
the subject’s anxiety-motivated desire to pre- 


! Requests for reprints should be sent to Leonard 
Berkowitz, Department of Psychology, University 
of Wisconsin, Madison, Wisconsin $3706 
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“healthy” or “normal” psychologically), and 
indicated that this apprehension operated to 
limit the number of shocks given to the 
target person. But in addition, demand cues 
supposedly informed the subject that the 
experimenter wanted him to punish the con- 
federate a certain number of times. With 
Orne (1962), Page and Scheidt argued that 
subjects are typically motivated to cooperate 
with the experimenter and therefore often do 
what they believe he wants. This compliance 
to demand cues presumably calling for more 
shocks, they said, led to frequent punishment 
of the confederate. 

According to Page and Scheidt, one such 
cue was the number of shocks inflicted on 
the subject in the anger-nonanger manipula- 
tion. Rather than provoking the subject, the 
seven shocks supposedly told him only how 
much he was to shock the confederate M 
return. Now, there is little doubt that ou 
subjects exposed to this treatment frequently 
seek to reciprocate—they want to “get even 
with the confederate. The seven shocks they 
receive justifies paying him back with rela- 
tively many shocks—but also motivates the 
men make aggressive responses. 


to these 


" 


Tue “Weapons EFFECT” 


There is some evidence for this in a number 
of my carly aggression experiments in which 
the aver was produced by direct verbal in- 
sults; when some of these studies were later 
repea using the electric shocks as the 
arouse: source instead of insults, identical 
outco: were obtained despite this proce- 
dural ference. More important, in these 
studie- as in the Berkowitz and LePage 
(1967 205) experiment, the subjects 
gettin: seven shocks rated themselves as sig- 
nifica angrier than the subjects receiving 
only : shock. At the conclusion of their 
paper, “age and Scheidt urged us to take the 
subjec'-' interview and questionnaire answers 
at faco value. In accord with their conten- 
tion, here any compelling reason why the 
mood »‘ings in the original study should not 
be acce ied as evidence that the seven shocks 
did © ste some anger? The Berkowitz- 
LePav subjects were instigated to attack 
the person who had hurt them. I gather that 
the Paye-Scheidt subjects were not as 
strony!) aroused for some reason. 

Other demand cues, Page and Scheidt said, 
were introduced by the experimental instruc- 
tions and explanations, especially by state- 
ments indicating that the subject was not 


to see the confederate again and would not 


be given any more shocks by him, or that 
friends were not to volunteer together. But 
this information was deliberately provided in 


order to lower the subjects’ inhibitions against 
aggression. Long experience has taught us 
that many of our college students are quite 
reluctant to shock a peer because they think 
it is wrong to hurt someone and because of 
fear of some retaliation. We wanted to lower 
this latter fear. The impulsive aggressive 
reactions elicited by environmental stimuli 
are generally fairly weak and easily masked 
by restraints or incompatible responses 
evoked by other types of stimuli. We usually 
have to minimize the subjects’ inhibitions 
against aggression in order to be able to 
detect these stimulus-elicited aggressive re- 
sponses, By the way, the fact that the great 
majority of subjects in all of these experi- 
ments return considerably fewer shocks than 
the seven given them indicates that the men 
are still unwilling to go all out in attacking 
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their partner. They think they will be hurting 
someone and do not want to give an exact 
tit for tat. To repeat, however, lowering 
restraints against aggression does not in itself 
instigate aggression. Did the statement that 
friends should not be in the experiment to- 
gether necessarily tell the subjects they 
should give their partners many shocks? If 
the instructions implied that the subjects were 
expected to shock their partner, why did so 
many of the people in the University of 
Nebraska research evidently experience evalu- 
ation and apprehension and give few shocks? 

Inhibitory processes could also account for 
the direction of some of the differences ob- 
tained by Page and Scheidt (1971). In sev- 
eral experiments, provoked subjects have 
“Jeaned over backward" to avoid exhibiting 
aggression when (a) they have been strongly 
instigated to aggression but (b) believed 
strong aggression was improper or unjustified 
in the given situation (e.g. Berkowitz, 
Lepinski, & Angulo, 1969; Berkowitz & 
Turner, 1971). I have pointed out that this 
pattern is conceptually similar to findings 
reported by Clark (1952) in his well-known 
study of the effects of sex arousal on TAT 
responses. Sexually stimulated men in that 
experiment generally gave fewer sex-related 
TAT responses than did their nonaroused 
controls until they were placed in the in- 
formal and permissive setting of a beer party. 
In all of these cases, the “reaction formation" 
occurred because the subjects were instigated 
to do something that might bring censure 
or disapproval. Can it be that a similar 
phenomenon operated in the University of 
Nebraska Experiments IT and II? In those 
investigations, the subjects presumably feel- 
ing the strongest evaluation apprehension 
also tended to administer fewer shocks when 
weapons were present, as compared to when 
the guns were not nearby, Evaluation appre- 
hension in itself does not explain the direction 
of this difference; the concept only implies 
that the subjects should restrain these actions 
that might put them in a bad light. Assuming 
that the Page-Scheidt subjects were appre- 
hensive, as the authors suggested, my hunch 
is that they detected the relatively strong 
aggressive reactions evoked in them by the 
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weapons, believed they would look bad if 
they exhibited this strong aggression openly, 
and in resisting their aggressive inclinations 
actually gave fewer shocks than the non- 
tempted controls. In other words, rather than 
being due solely to evaluation apprehension, 
the unusually low aggression in the presence 
of weapons is presumptive evidence of a wea- 
pons effect. (This may seem like circular, 
can't-lose reasoning, but is certainly suscepti- 
ble to experimental test. There should be a 
positive effect when this anxiety is low, and 
a negative effect when the apprehension is 
great.) 

The Page and Scheidt. report does docu- 
ment the relative weakness of the weapons 
effect. Situationally evoked inhibitions can 
mask the stimulus-elicited aggressive reac- 
tions, as was noted earlier, and personality 
variables might also cloud this effect. Follow- 
ing the original procedure, Fraczek and 
Macaulay (1971) gave all of their subjects 
seven shocks and then placed them either in 
the no-weapons or unassociated-weapons con- 
ditions. These men had previously been clas- 
sified as either high or low in emotionality in 
terms of their responses to a word association 
test. The investigators found that only the 
low-emotionality subjects were more aggres- 
sive, on shock and questionnaire measures, in 
the presence of the guns. By contrast, the 
high-emotionality men showed the “reaction 
formation"; they gave reliably shorter shocks 
and were more apt to deny their anger if 
weapons were nearby than if they were not 
present. Fraczek and Macaulay attributed 
this negative difference to fairly strong anxi- 
ety in the highly emotional subjects, stemming 
partly from their very strong aggressive in- 
clinations. 

A somewhat simpler, though related, possi- 
bility is also consistent with the classical- 
conditioning interpretation of the weapons 
effect. In discussing this phenomenon (see, 
eg, Berkowitz, 1971), I have pointed out 
that it is the meaning imparted to the object 
by the observer that enables the weapon to 
function as a conditioned stimulus for ag- 
gressive responses. For many persons, the ag- 
aggressive meaning is paramount, For other 
people, however, weapons might be associated 
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more strongly with anxiety and suffering so 
that the sight of a gun would automatically 
evoke in them strong, aggression-inhibiting 
anxiety. Suggesting this, when Fischer, Kelm, 
and Rose (1969) asked if knives could fune- 


tion as aggression cues, they íound that the 
presence of a switchblade knile produced 
heightened moods of hostility, anxiety, and 
depression in provoked men. The guns might 


have had a stronger anxiety than aggressive 
meaning for the highly emotional subjects in 
the Fraczek-Macaulay experiment, causing 
them to restrain their aggressive reactions. 
Further research into the effects of the mere 


presence of weapons clearly must determine 
what ideas and feelings are associated with 
these objects; their meaning influences the 
effect they will have. 


Some Problems with the Page-Scheidt 
Argument 


Many things could be said about the rea- 
soning employed in the Page-Scheidt re- 
search. For example, I could question the 4 
authors' failure to give the subjects in their 
third experiment an explanation for the guns 
on the table. Seeing these weapons lying there 
for no apparent reason, the naive subjects 
should have been far more apprehensive than 
their University of Wisconsin counterparts. 
Who knows what other effects might have also 
been produced? The University of Nebraska 
investigators wanted to avoid any “demand 
characteristics” inherent in the explanation 
but were left with a situation that was ex- 
tremely strange to their subjects and that was 
very different from the original experimental 
setting. The Berkowitz-LePage statement 
accounting for these weapons undoubtedly 
heightened the University of Wisconsin sub- 
jects’ awareness of the guns—but the men 
could not have reacted to the weapons if they 
had not seen them. This awareness is not Jn 
itself proof that the subjects (a) knew the 
experimental hypothesis and (b) were moti- 
vated to confirm it. The fact that the sound 
of the bell was very apparent to Pavlov’s dogs 
does not mean that they then salivated in 
order to comply with Pavlov’s expectation. 
In order to demonstrate the influence of de- 


nd cues, we must know more than that 
je subjects attended to the guns. 
Bu: rather than go into a point-by-point 
rebuttal, | will try to save journal space by 
psidering only a few of the major problems 
fentering around the inconsistencies in the 
and Scheidt argument. To begin, in 
ussing their results, the authors differenti- 
] between two types of awareness which 
upposedly operated in the University of 
consin experiment. They termed these the 
Subjects’ revenge and weapons hypothe- 
es. Surely, demand-characteristics advocates 
vould have initially expected both hypotheses 
Jo lead to enhanced aggression; the subjects 
"presumably would think the investigator 
ted them to seek revenge and also wanted 
to attack their tormentor in the pres- 
ince of the guns. (Indeed, the authors’ earlier 
‘interpretation of the seven shocks given to 
the subjects implied that these shocks “de- 
manded” many shocks in return.) Page and 
Scheidt told us, however, that the revenge 
hypothesis led to evaluation apprehension 
and thus suppressed retaliatory aggression, 
"while the weapons hypothesis supposedly pro- 
duced a high number of shocks. I somehow 
find it difficult to believe that college students 
would deliberately want to show both that 
they do not try to get even with someone who 
had insulted them earlier and also that the 
"mere sight of guns provokes them into an 
Unusually strong attack on their partner. Why 
ould anyone be willing to demonstrate that 
he reacts extremely aggressively just to the 
presence of weapons if it is generally bad to 
“seek revenge? If this population typically 
rowns on aggression, which seems likely, 
would a subject assume he can look good by 
‘reacting strongly to guns? Do most people 
really think, as Page and Scheidt suggested, 
hat a normal individual would “fly off the 
handle,” so to speak, and attack some tor- 
mentor unusually severely simply because 
‘Suns were nearby? (If many laymen do be- 
lieve this—which I doubt—this probably 
means that they have seen the “weapons ef- 
fect” take place. Maybe, as far as the man on 
the street is concerned, it was not necessary 
‘to carry out the Berkowitz-LePage experi- 
ment.) 
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Then, too, we must wonder about the in- 
tensive probing carried out by the University 
of Nebraska researchers. If subjects are so 
compliant, why does it take so much ques- 
tioning to get them to reveal their hypothe- 
ses about the experiment? Orne’s (1962) 
reply was that the subjects refuse to reveal 
their suspicions because they do not want 
their performance wasted. How different bis 
subjects must have been from those we have 
at the University of Wisconsin! Many of the 
students we see are quite eager to show how 
smart they are, and that they have not been 
fooled. If they have any confidence at all in 
their suspicions (and do not think they will 
look silly by mentioning something stupid), 
they are quite willing to demonstrate that 
they caught on to the deceptions. 

The most obvious inconsistency of all cen- 
ters around the postexperimental interview. Ii 
subjects are so responsive to demand cues in 
the experiment, why isn’t the repeated prob- 
ing at the end of the session a strong demand 
in its own right? Isn't it likely that the inter- 
view crystallizes the subjects’ suspicions, fo- 
cusing and steering their doubts in the direc- 
tion of the questions, and ultimately putting 
words in their mouths? Many persons seek to 
justify their aggression by inventing reasons 
for their behavior after they have attacked 
their victim (Brock & Pallak, 1969). The 
subjects who gave the confederate many 
shocks in the presence of the weapons may 
well have felt guilt and/or dissonance; they 
did not have an adequate justification for 
what they had done. When the experimenter 
then came along and implied that the experi- 
mental setting and especially the guns had 
been responsible for the strong attacks on the 
victim, the subjects could have seized on 
this idea as an excuse for their actions. They 
now believed the notion implanted in them by 
the questioning because it explained and justi- 
fied what they had done. 

Here is the most serious weakness in the 
Page and Scheidt (1971) paper: Their ar- 
gument essentially rests on correlational evi- 
dence, and we cannot be certain about the 
direction of causation. The weapons hypothe- 
sis is associated with strong attacks upon the 
confederate but might have resulted from 
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these attacks in cooperation with the inter- 
viewer’s questioning. Adequate proof of the 
role of demand characteristics requires un- 
tangling this correlational problem. 


Where Is the Compliant Subject? 


I have often wondered about the wide- 
spread acceptance of the demand-character- 
istics notion in social psychology. There are 
so many logical and empirical difficulties that 
have not been resolved. I have already al- 
luded to the matter of the correlational na- 
ture of the evidence, but there is also a very 
broad inferential leap in the reasoning. Orne 
(1962) showed that subjects will persevere 
on an apparently meaningless task for a sur- 
prisingly long time, and concluded that they 
are motivated to help the experimenter. As 
Sigall, Aronson, and Van Hoose (1970) have 
commented, however, Orne demonstrated only 
that subjects are obedient to the experiment- 
er’s clear request amd not that they are 
strongly motivated to confirm the experi- 
menter's hypothesis if this is not explicitly 
required of them. Sigall et al. found that sub- 
jects will attempt to present themselves in a 
favorable (“normal,” *healthy") manner, as 
the evaluation apprehension notion (Rosen- 
berg, 1969) contends, but do not give any 
indication of being motivated to comply with 
non-explicitly-required “demands” when this 
evaluation apprehension is controlled. T for 
one would like to have evidence that this 
kind of compliance does arise when the sub- 
jects are not afraid of incurring the investi- 
gator’s displeasure and have no reason to 
think that the compliance will enhance their 
self-image. 

Our experience at the University of Wis- 
consin also runs strongly counter to the no- 
tion of a compliant subject. Time and time 
again we have seen that our subjects’ aware- 
ness of the deceptions practiced on them 
tends to be inversely related to the strength 
of their aggressive reactions to the experi- 
mentally presented stimuli. Page and Scheidt 
argue that subject suspiciousness is not the 
same as demand awareness. Nonetheless, the 
suspicious subject is generally also aware (at 
some level) that the investigator is interested 
in his aggressive behavior; if he doubts the 
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cover story, he usually catch ) the pur- 
pose of the shock apparatus | the “de- 
mand” viewpoint, then, he be moti- 
vated to give many (or inter ocks—but 


this frequently does not happ: 
The Berkowitz-LePage exp nt yielded 


nonsignificant trends in this ive direc- 
tion, while Fraczek and Maca 1971) ob- 
tained somewhat stronger ino ons of this 
negativism in their replicatio: »e original 
study. Thus, where their retai: ghly emo- 
tional subjects administered a: ge of 5.5 
shocks in the presence of the : or a total 
of 3.8 seconds duration, the í en in this 
condition who were rejected bi e of their 
clear awareness of the study' ern with 
aggression gave only 4.67 shoc ting only 
2.8 seconds. Similarly, the less ional sub- 
jects gave 5.6 shocks for a to ation of 
3.1 seconds when the guns wer: , but the 
four aware-rejected men hac ninistered 
4.8 shocks lasting 2.6 seconds. 

Pertinent findings were als ained by 
Berkowitz, Alioto, and Turne: in experi- 
ment investigating the effects o! meaning 
of filmed violence. Hypothesizin: that wit- 
nessed fights and football games would elicit 
aggressive reactions to the extent that the 


observers imparted aggressive meaning to the 
scene (ie. believed that the players wanted 
to hurt each other), we provided half of the 
angry men seeing the movie with a brief in- 
troduction to the film that defined ihe contest 
as an aggressive encounter. The demand- 
characteristics theorist would surely say this 
aggressive-meaning introduction was à “de- 
mand” on the subjects calling for strong a£ 
gression. However, when we classified the 
subjects in terms of their awareness of the 
deceptions and purpose of the study (on the 
basis of a final interview), we found that this 
awareness was associated with a decreased 
number of attacks on the available target 
person: Sixty-five percent of the men in the 
prize-fight-aggressive-meaning condition who 
had any awareness of the experimental ma- 
nipulations gave fewer shocks than the mea? 
given by the nonsuspicious subjects 1n n 
condition, while fully 86% of the similar y 
aware men in the football-aggressive-meamns 


? Unpublished experiment, in preparation. 


p gave fewer shocks than their nonaware 
parts. The significant results obtained 
tudy apparently arose in spite of this 
ss rather than because of it. 
r University of Wisconsin experiments 
uso demonstrated this negativism at 
my original report of the Berkowitz- 
tudy, I mentioned an experiment by 
Bragg ? dealing with conformity. In- 
í measuring the subjects’ awareness of 
imental purpose after the fact, Allen 
ragg deliberately created three aware- 
Is. In psychology classes, one group 
told about conformity experiments, 
group was given only general infor- 
;ibout this line of research, and a 
up was provided with detailed in- 
about conformity experiments. 
ifterward, when each person was ex- 
the conformity situation, the investi- 
ound that the subjects displayed less 
ty the greater and more exact their 
ve of the experimental procedure and 


indings mentioned so far can readily 
stood in evaluation-apprehension 
"terms. The subjects were unwilling to exhibit 
socially disapproved behavior (either aggres- 
‘sion or conformity) to the extent that they 
knew the study was concerned with this be- 
havior. But another process might also be at 
work. In accord with Brehm's (1966) reac- 
tance theory, subjects might regard the ex- 
perimenter's expectation as an unwelcome 
pressure on them. In resisting this pressure, 
they may then show relatively little of the 
behavior they think the experimenter wants, 
even if this action is socially desirable. 

Our University of Wisconsin research on 
help giving has obtained a number of results 
along these lines. For example, in an unpub- 
lished experiment by Hildebrand and Berko- 
Witz college women listened to a brief tape 
‘recording of a discussion between two women 
in which one said she would help the other. 
After this, each subject had an opportunity 
to help the experimenter. Two judges listened 
to the postsession interview and rated the sub- 
ject’s awareness of the true purpose of the 


^ Unpublished experiment, in preparation. 
‘Unpublished experiment, in preparation. 
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study (with very high agreement between 
them). We again found that awareness was 
negatively related to the supposedly demanded 
behavior. Ten of the 13 highly aware sub- 
jects showed a level of helpfulness that was 
below the condition mean, but only 9 of the 
23 low awares were less helpful than the 
average subject in their condition. The tape 
recording should have been a demand for 
helpfulness. Why did the women who were 
aware of the experimenter's interest in their 
helpfulness act in a less helpful manner? If 
we assume the awareness had arisen before 
the action, they evidently exhibited reactance 
but not compliance. 

Yet another study in this area is especially 
noteworthy because the experimenter's im- 
plicit pressure on the subject was deliberately 
manipulated. Berkowitz, Klanderman, and 
Harris (1964) led their subjects to believe 
that a peer needed their help, and then in- 
formed each person that the experimenter 
either (a) would learn of their effort in their 
partner's behalf right away or (b) would not 
see how much help they had given their part- 
ner until the end of the semester several 
months from then. In half of the cases, the 
experimenter and the subject were of the same 
sex, while in the remaining instances they 
were of the opposite sex. From a reactance 
viewpoint, we would expect the subjects to 
feel under the greatest implied pressure to 
aid their partner when the experimenter was 
immediately aware of their effort and was of 
the same sex (since most subjects would vol- 
untarily want to impress an investigator of 
the opposite sex). 

Analysis of the subjects’ work scores yielded 
a statistically significant interaction entirely 
in accord with this expectation. When the 
experimenter was of a different sex, the sub- 
jects worked harder for their partner the 
sooner the experimenter would see their work; 
they wanted to gain the opposite-sex experi- 
menter’s approval if he (or she) would learn 
of their effort right away. By contrast, if the 
experimenter was of the same sex, they ac- 
tually expended less effort if the experimenter 
would soon see their work; this person’s 
awareness of their performance apparently 
was an unwelcome pressure. 


nn oom 


In presenting this evidence of subject re- 
actance, I certainly do not mean to suggest 
that every college student will act in this 
negativistic manner when in a laboratory ex- 
periment. Somewhere in this broad land of 
ours there may be the compliant subjects that 
Orne and Page and Scheidt envision. My 
hunch is that the way subjects are recruited 
does much to determine the degree to which 
subjects act in the fashion that Orne and his 
followers describe. College students might 
want to help the experimenter in every way 
they can if they had felt really free about 
participating in the study and had exerted a 
good deal of effort to serve in the study. Co- 
ercing students to take part in experiments 
through points counting toward their grade 
(as at the University of Wisconsin) or 
through course requirements (as at many 
other universities), on the other hand, could 
create a readiness for reactance. Since most 
subjects are pressed into service, to some ex- 
tent at least, I suspect demand characteristics 
have a far less influential role in determining 
the outcome of the experiment (holding eval- 
uation apprehension constant) than is com- 
monly believed today. 
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only with sbifts on the risk dimension. Quite a few theories bave been devel- 


gesting that (wo or more 


. Theories that attribute choice shifts to 


mechanisms 
hence that more than one of the theories may be correct, 


zh the title of this special issue is 


con t with usage over the past decade, in 
the 5 years we have come to recognize 
tha! . more appropriate designation for this 


type of research might be "choice shift.” Re- 
centis we have learned that the phenomenon 
is a... deal broader and that early empha- 
sis on the risky shift was a historical accident. 
The first researchers in this area (Stoner, 
1961; Wallach, Kogan, & Bem, 1962) em- 
ployed a set of decision problems (the Kogan 


and Wallach “choice-dilemmas” items given 
in the Appendix) that are mainly oriented to- 
ward risk, When they compared the riskiness 
of groups after discussion of these problems 
with the riskiness previously exhibited on the 
same problems by the individual group mem- 
bers, they found the groups more risky than 
the individuals, taking the problem set as a 
whole. Hence, the term “risky shift.” But 
careful examination of their data and the data 
from comparable studies (Brown, 1965; 
Pruitt & Teger, 1967) has revealed that 
groups usually shift toward caution on one 
problem in the original set (Number 12) and 

‘The author wishes to thank Dorwin Cartwright, 
John Greenwood, and Jane Holland for their advice 
9n this manuscript. 
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often shift in that direction on another prob- 
lem (Number 5). Subsequently, other re- 
searchers (Fraser, Gouge, & Billig, 1970; 
Nordhéy, 1962; Rabow, Fowler, Bradford, 
Hofeller, & Shibuya, 1966; $ , 1968; 
Vidmar & Burdeny, 1969) have been able to 
write additional items that shift toward cau- 
tion, Hence, it seems more proper to speak of 
a shift on the risk dimension. 

Other research implies a further broaden- 
ing of the conception of this phenomenon, in 
that shifts induced by group discussion have 
been found on a number of apparently non- 
risk-involving dimensions? Thus, Moscovici 
and Zavalloni (1969) discovered shifts among 
French students in the direction of more posi- 
tive attitudes toward deGaulle and more 
negative attitudes toward Americans; also 
shifts that involved judging pro-deGaulle 
items as more positive and anti-deGaulle 
items as more negative. Doise (1969) showed 
that groups of French students shifted toward 


2 The dimension referred to here and in the terms 
“risky shift" and “cautious shift” involves objective 
risk. Whether shifts on this dimension are accompa- 
nied by shiíts in subjective risk taking is a theo- 
retical matter on which there is some disagreement. 

3 By no means all such items yield shifts. Of 12 
risk-neutral choice dilemmas items used as filler ma- 


terial by Wallach, Kogan, and Burt (1968), only 2 — 


for the men and 3 for the women exbibited sta- 
tistically reliable, group-induced shifts. 


339 


EV mp, 


340 


more extreme opinions of their educational in- 
stitution. Myers and Bishop (1970) found 
that groups of prejudiced American high 
school students shifted toward expression of 
more prejudiced attitudes, while unprejudiced 
students shifted toward expression of less 
prejudiced attitudes. And in a much earlier 
study, Stone (1964) showed that groups ex- 
hibit a smaller autokinetic effect than indi- 
viduals. Assuming that the findings just re- 
ported are reflections of the same phenome- 
non as produces shifts on the risk dimension, 
it seems best to use the most neutral available 
terms to describe the issue under considera- 
tion: choice shift or group-induced shift. 
So long as the phenomenon was viewed as 
a risky shift, it was assumed to have mainly 
practical implications. The research seemed 
to imply that groups should be employed 
when risk taking is desired and individuals 
when caution is desired. Implications for in- 
ternational decision making were even some- 
times drawn (Wallach, Kogan, & Bem, 1964). 
But as evidence began to develop that cau- 
tious as well as risky shifts occur, efforts to 
draw practical conclusions went into eclipse. 
(Indeed, the dangers involved in hasty gen- 
eralization of laboratory findings have never 
been so apparent.) At the same time, and 
especially in light of the discoveries about 
shifts on non-risk-involving dimensions, the 
phenomenon has begun to look more interest- 
ing from a theoretical viewpoint. The reason 
for this is the generality implied by the no- 
tion of "choice shift." /f, as it now appears, 
there are many dimensions on which groups 
differ from the average of the individuals who 
compose them and if, in addition, social psy- 
chology can develop an efficient theory about 
these differences, it will be a real accomplish- 
ment. Both of these "ifs" involve big as- 
sumptions, but they are challenging assump- 
tions that have contributed to the attraction 
of an ever-increasing number of researchers to 
this phenomenon. Indeed, so many manu- 
scripts have been prepared on this topic that 
it seemed desirable to devote this entire spe- 
cial issue of the Journal of Personality and 
Social Psychology to this topic. 
The present article, which covers generality 
issues and theories of the choice shift, is 
viewed as a companion piece to the next ar- 
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ticle, by Dorwin Cartwright, which provides 
an excellent summary of the basic findings in 
this field and a methodological critique. A 


number of other summaries of the findings 
and theories about choice shifts have ap- 
peared in recent years, most prominently 
those by Kogan and Wallach (1907b); Dion, 


Baron, Dion, and Miller Clark 
(1971); and Vinokur (1971). However, the 
organization of this and the Cartwright article 
and some of the issues covered and tentative 
conclusions reached are sufficiently distinct to 
warrant adding to the literature at this time. 
In addition, this article serves as an introduc- 
tion to the others in this special issue and is 
essential background for understanding the 


(1970); 


Conclusions section at the end of the issue. 
GENERALITY OF SHIFTS ON THE RISK 
DIMENSION 


The question of generality can be answered 
first in terms of choice-dilemma items, such as 
those in the Kogan and Wallach set. There 1s 
an amazing uniformity in this phenomenon 
from laboratory to laboratory within the 


United States, even to the point of the same | 


decision problems producing shifts of compa- 
rable direction and size (Pruitt & Teger, 
1967). Comparable shifts have been found in 
other countries with Western European cul- 
tures, including Canada (Vidmar, 1970), 
England (Bateson, 1966; Fraser et al., 1970), 
France (Kogan & Doise, 1969), Germany 
(Lamm & Kogan, 1970), Israel (Rim, 1963), 
and New Zealand (Bell & Jamieson, 1970). 
Comparable shifts are found for both men and 
women (Wallach et al., 1962) and for work- 
ers (Jamieson, 1968) and people in profes- 
sional and managerial capacities (Marquis, 
1962; Rim, 1963; Siegel & Zajonc, 1967), as 
well as for college students. j 
Decision problems other than the ES 
dilemmas items have also been used. pe 
concern about generality led to the use 0 
two tasks that involved human performan a 
with real outcomes. Wallach et al. (196 ) 
employed choices among ability-test no 
with different probabilities of sucess. Be d 
Wallach, and Kogan (1965) employed d 
among tasks that allegedly had differe: 
probabilities of noxious physical effects (€-8» 


Mausea resulting from smelling a chemical). 
Both studies yielded shifts toward risk. 
Shifts have also been found in studies of 
"group gambling, though the direction of shift 
has varied with the study. Of the six studies 
‘that have been done, statistically significant 
shifts were found in four; two in a risky di- 
rection (Kogan & Zaleska, 1969; Pruitt & 
Teger, 1969) and two in a cautious direction 
(Atthowe, 1960; Zajonc, Wolosin, Wolosin, & 
Sherman, 1968). 


DirFUSION-OF-RESPONSIBILITY THEORY 


The diffusion-of-responsibility theory, ad- 
vanced by Wallach, Kogan, and their collab- 
orators (Bem et al., 1965; Kogan & Wallach, 
1967b; Wallach & Kogan, 1965; Wallach et 
"al, 1964; Wallach, Kogan, & Burt, 1967), 
explains only shifts toward risk, not shifts 
toward caution. It holds that group experience 
reduces anxiety about the possible negative 
consequences of making the risky decision, 
for example, reduces anxiety about the possi- 
bility that the new electronics firm will fail 
or the football game will be lost, in two of 
the choice-dilemmas problems (see Appendix). 
This happens because the responsibility for 
‘negative consequences can be psychologically 
shifted (“diffused”) from one's own shoulders 
to those of the other group members. Reduced 
anxiety makes it possible to accept the risky 
alternative at a lower probability of success. 

— At first, these authors indicated that group 
decision was the necessary precursor of dií- 
! fusion of responsibility. But later, in light of 
Certain research findings, they argued that 
group discussion is a sufficient basis for re- 
Sponsibility diffusion, providing that emo- 
tional bonds develop or exist between the 
‘group members. 

The authors of diffusion-of-responsibility 
theory cite several research results in its sup- 
- port. Two of these allegedly support the as- 
¿£ Sumption that responsibility diffusion pro- 
duces the risky shift. One is a nearly signifi- 
cant finding that more anxious individuals 
Show a greater shift toward risk (Kogan & 
Wallach, 1967a). This would be expected if 
‘We assume that more anxious people are more 
Concerned about negative outcomes and hence 
More profoundly affected by anxiety reduc- 
tion when they have a chance to diffuse re- 
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sponsibility. The other source of support is 
the finding that group discussion produces a 
shift toward attempting more difficult test 
items (and thus toward greater risk) when 
success or failure on these items affects the 
welfare of the entire group than when it af- 
fects only the welfare of a single group mem- 
ber (Wallach et al, 1964). This finding is 
assumed to support diffusion-of-responsibility 
theory because it can be argued that group 
members feel more responsible in the former 
than in the latter case and hence have more 
responsibility to diffuse. However, alternative 
explanations are possible. 

There are also three studies that can be 
construed as providing support for the hy- 
pothesis that emotional bonds among group 
members encourage the risky shift. In two of 
these, an attenuated risky shift was found in 
groups of people whose personalities caused 
them to be aloof or fearful of interpersonal 
bonds (Kogan & Wallach, 1967a; Wallach 
et al., 1967). In the third study, a positive 
correlation was found between group cohesive- 
ness and the size of the risky shift (Pruitt & 
Teger, 1969). It should be noted that accep- 
tance of the emotional-bonds hypothesis does 
not necessarily imply acceptance of the as- 
sumption that responsibility diffusion pro- 
duces the risky shift (Dion et al., 1970). 

Against these five relatively inconclusive 
pieces of evidence in favor of diffusion-of- 
responsibility theory, five arguments against 
the theory have been developed by various 
authors. None of them is absolutely con- 
clusive, but the entire set is impressive. 


1. Much of the evidence for the existence 
of a risky shift involves hypothetical dilemma 
problems. Since these problems do not entail 
real outcomes, it is hard to see how the ca- 
pacity to shift responsibility for these out- 
comes onto the shoulders of others can have 
any effect on behavior (Lamm, 1967). Advo- 
cates of  diffusion-of-responsibility theory 
might respond to this criticism by saying that 
subjects get so deeply involved in role play- 
ing that the outcomes are real to them. 

2. The risky shift persists up to 6 weeks 
after the group discussion (Wallach et al., 
1962). Bateson (1966) has pointed out that 
“it seems a little farfetched” to believe that 
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after 6 weeks the subjects “still felt that the 
other group members shared responsibility for 
their present decisions, as the responsibility- 
diffusion hypothesis would have us believe 
[p. 120]." In defense of diffusion-of-responsi- 
bility theory against this criticism, it should 
be pointed out that in the relevant study, the 
shift persisted for 6 weeks after the subjects 
had participated in group decisions. These 
decisions could conceivably have been caused 
by diffusion of responsibility and still have 
produced changes in attitudes toward the risk 
levels that persisted after 6 weeks when all 
vestiges of diffusion of responsibility had dis- 
appeared. 

3. Full risky shifts are found when subjects 
watch a live discussion group (Lamm, 1967), 
and partial shifts are found when subjects 
only listen to a discussion (Lamm, 1967) and 
even when they listen to a recording which 
they know to have been made by a previous 
group (Kogan & Wallach, 1967c). It is hard 
to see how a subject can place the blame for 
the outcomes of his decisions on persons with 
whom he has had no interaction or who, in 
the case of the last finding, are no longer in- 
volved in the study. 

4. Pruitt and Teger (1969) found no risky 
shift in a gambling situation where groups 
made a decision on level of risk without dis- 
cussion (by voting) after discussing a set of 
irrelevant issues. Yet, this condition seems to 
provide (a) an opportunity for diffusing re- 
sponsibility to others who, in addition to one- 
self, are clearly responsible for the decision 
and (5) the requisite emotional ties. 

5. Diffusion-of-responsibility theory lacks 
parsimony, in that it can only account for 
shifts toward risk and must explain reliable 
shifts toward caution in terms of some other 
mechanism. In the same vein, many of the 
persistent findings surrounding the risky shift 
(such as the high correlation that is always 
found across items between initial risk and 
risky shift) cannot be predicted from dif- 
fusion-of-responsibility theory, nor does this 
theory imply choice shifts on nonrisk dimen- 
sions. While arguments in terms of parsimony 
are never conclusive, it seems reasonable to 
attempt to explain all these seemingly related 
phenomena with a single theory, which at this 

point seems very unlikely to be diffusion-of- 
responsibility theory. 
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FAMILIARIZATION THEO! 

The familiarization theory origi: 
Bateson (1966), who reasoned tha: 
happening in group discussion was " 
miliarization with the items dix 
creased familiarity should make p« 
willing to take risk on these items ! 
a general reduction in uncertainty. | 
that any procedure that increases fa 
will cause a risky shift, including | 
that are done with isolated individua: 
study, Bateson had some subjects 
choice-dilemmas items, and others write briefs 
on them, noting “all the points for and ag«inst 
the possible decisions.” Risky shifts of equiva- 
lent sizes were exhibited under both pro- 
cedures, supporting the theory. Bateson'- ind- 
ings were successfully replicated by Fl: ders 
and Thistlethwaite (1967). 

Like diffusion-of-responsibility theo 
miliarization theory lacks generality, 
it is unable to explain shifts toward cav 
shifts on nonrisk dimensions. It also i 
account for most of the findings surro 
the risky shift. However, unlike diffusion 
responsibility theory, it seems to have 
solid evidence behind it. 

But how good is this evidence? Quite a few 
researchers have asked this question and re- 
peated the two supporting studies. Without 
exception, they have all failed to replicate the 
familiarization effect (Bell & Jamieson, 1970; 
Fraser, 1970; St. Jean, 1970; two groups of 
researchers cited in Dion et al, 1970; and 
two groups of researchers cited in Teger, Pru- 
itt, St. Jean, & Haaland, 1970). Yet, the 
group-discussion-induced risky shift has been 
easily produced in all of these laboratories. In 
conclusion, the results obtained by Bateson 
(1966) and Flanders and Thistlethwaite 
(1967) do not appear to have been due to sim- 
ple familiarization, nor can the familiarization 
theory of the risky shift be taken very seri- 
ously. 
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LEADERSHIP THEORY 


The basic tenet of the leadership theory is 
that higher risk takers are more persuasive 10 
group discussions that produce a risky shift 
(Collins & Guetzkow, 1964; Marquis, 1962). 
There are various versions of this theory, some 
of which also state that cautious individuals 
are more persuasive in discussions that pro- 
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duce s shift, The main evidence for 
this th hat in discussions of the origi- 
nal V ind Kogan choice dilemmas, 
higher cers are perceived by their fel- 
low gr: ibers as having been more in- 
fluentis ie discussion (Wallach et al., 
1962; \\.ilach, Kogan, & Burt, 1965). Simi- 
lar evidence has been found with items that 


shift toward caution, where more cautious in- 
dividuals are perceived as more influential 


(Brown, 19 Rabow et al., 1966), and 
with nor volving items (Doise, 1969). 
While th idence suggests that there may 
be some ; leadership theory, it must be 
taken w zrain of salt because of an 
alternatis ial interpretation, namely, that 
subjects he highest risk taker (or lowest 
risk take the cautious case) as particu- 
larly infi il because they have shifted 
toward h her than because he has been 
particulari; ‘rfuential. 

One vei jf leadership theory has been 
clearly di en, namely, that people who 
are inher: high risk takers are also in- 
herently hi; persuasive. In the relevant ex- 
periment, Waach, Kogan, and Burt (1968) 
showed thei subjects who accepted higher 
levels of risk on the choice-dilemmas problems 
Were not viewed by fellow group members as 


more influential in discussions of risk-neutral 
decision problems. However, it must be noted 
that this version of the theory was something 
of a straw man from the outset, since leader- 
ship ability cannot be construed as a strong, 
pervasive personality trait (Cartwright & 
Zander, 1968), and there is also some doubt 
about the trait of risk taking (Kogan & Wal- 
lach, 1964; Slovic, 1962, 1964). 

A second version of leadership theory has 
received more support: the notion that people 
Who make a decision involving higher risk on 
any given item are more confident in their 
Position, and that this confidence translates 
itself into greater assertiveness and therefore 
Steater influence during group discussion 
(Burnstein, 1960). While stated in such a 
Way as only to explain the shift toward risk, 
this theory can conceivably be extended to 
the cautious shift by asserting that on cau- 
lion-oriented items, people who give more 
Cautious answers have greater confidence. 

his theory can be called leader-confidence 

cory, Evidence concerning the relationship 


INTRODUCTION 


343 


between individual risk taking and confidence 
is mixed. Using laboratory gambling pro- 
cedures, both Burnstein (1969) and Clausen 
(1965) found a positive correlation between 
these variables. But on the Kogan and Wal- 
lach choice-dilemmas problems (which are 
crucial if this version of leadership theory is 
to explain the risky shift as it is usually de- 
fined), neither Stoner (1961) nor Teger and 
Pruitt (1967) found any such correlation. 
Evidence of a relationship between confidence 
and the risky shift has been found by Clausen 
(1965) in a laboratory gambling situation. In 
her study, a risky shift appeared in groups 
where the high risk member was also high in 
confidence, but disappeared in groups where 
the high risk member was low in confidence. 
These results are sufficiently provocative to 
deserve replication with the standard choice- 
dilemmas methodology. 

The results of a study by Teger and Pruitt 
(1967) have sometimes been construed as 
evidence against all leadership hypotheses 
(Dion et al, 1970). The relevant condition 
involved no group discussion; instead, group 
members exchanged information about their 
answers to the choice-dilemmas items, by hold- 
ing up cards stating their preferred probabil- 
ity levels, and then made individual decisions. 
This procedure yielded a risky shift, although 
it was of smaller amplitude than that ob- 
tained from regular group discussion.* It can 
be argued that leadership was not possible in 
this condition, since potential leaders did not 
have the opportunity for self-expression that 
is accorded by a regular discussion. However, 
this point is not well taken, because the in- 
formation-exchange condition entailed face- 
to-face interaction and hence permitted non- 
verbal communication, which is a perfectly 


4Partial shifts toward risk resulting from the 
exchange of information about risk preferences have 
also been reported by Bell and Jamieson (1970), 
Clark and Willems (1969b), St. Jean (1970), and 
Willems and Clark [Relative tenability of the dif- 
fusion-of-responsibility and risk-as-value hypotheses 
in the risky shift, unpublished], though not all of 
these shifts were statistically significant. On the 
other hand, Murdoch, Myers, and Smith (1970) and 
Wallach and Kogan (1965) were unable to obtain 
the finding, though the method used in the latter 
study leaves something to be desired (Teger & Pruitt, 
1967). 
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reasonable vehicle for leadership. The fact 
that only a partial risky shift was found in 
this condition is consistent with leadership 
theory thus conceived. While nonverbal com- 
munication is a possible vehicle for the exer- 
cise of leadership, the combination of verbal 
and nonverbal communication that is afforded 
by a full-scale discussion is probably better, 
and hence should permit more effective lead- 
ership. 

Evidence from the previously cited study 
by Lamm (1967) is consistent with the in- 
terpretation just presented. He found a full 
risky shift when the subject both watched and 
listened to a group discussion, but only a par- 
tial shift when he merely listened to the dis- 
cussion. Leadership theorists could explain 
these data by asserting that the possibility of 
nonverbal communication was eliminated in 
the listening-only condition. This presumably 
reduced the number of ways in which high-risk 
group members could demonstrate confidence 
and act assertively, thereby diminishing the 
size of the risky shift. 

The explanations of the Teger-Pruitt and 
Lamm results previously presented take an 
interesting form. They argue that a single 
process is at work in the risky shift (leader- 
ship by the high risk taker) and that this 
process is given greater scope in the procedure 
that yields the greater risky shift (i.e., in the 
fully experienced group discussion, as opposed 
to mere information exchange or mere listen- 
ing to the discussion). Another way to in- 
terpret data of this kind is to assert that 
there are two different processes at work, one 
of which operates in the procedure that yields 
the smaller risky shift and both of which op- 
erate in the procedure that yields the larger 
shift. In other words, this latter approach as- 
serts that two theories of the risky shift are 
correct and that their effects summate in the 
regular group discussion. 

Kelley and Thibaut (1969) have adopted 
the two-process approach in their interpreta- 
tion of the Teger-Pruitt information-exchange 
findings. They argued that these findings favor 
Brown’s value theory (see the following sec- 
tion) as a partial explanation for the risky 
shift; but when verbalizations are added to 
information exchange in the full discussion 
condition, an additional process comes into 
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play. This process is summed up in a third 
version of leadership theory, the so-called 
*"rhetoric-of-risk" version, which holds that 
the structure of the language makes argu- 


ments for risk more dramatic and hence in- 
herently more persuasive than arguments for 
caution. Therefore, a large risky shift is ob- 
tained when people are permitted to argue for 
their positions, as well as exchange informa- 
tion about them. There is little evidence to 
date for or against the rhetoric-of-risk version 
of leadership theory, which can be applied to 
the cautious shift only with some difficulty. 

In summary, the first version oí leadership 
theory, that high risk takers are inherently 
more persuasive, is clearly wrong. However, 
the evidence against it cannot, as sometimes 
happens, be used to fault all other versions. 
Two other versions attribute leadership to the 
man who takes the highest (or, in the case of 
caution-oriented items, the lowest) level of 
risk on any given item, regardless of how he 
behaves on other items. One version traces his 
influence to his high level of confidence, the 
other to the persuasiveness of his rhetoric. 
Both theories are viable as explanations for 
at least part of the shift; and there is some 
evidence in favor of the confidence version. 
A fourth version of leadership theory, release 
theory, will be presented in the next section, 
because it is also an example of value theory. 


VALUE THEORIES 


We come now to a set of five theories that 
have in common the assumption that groups 
shift in a direction toward which most mem- 
bers of the group are already attracted as in- 
dividuals. The names of these theories are 
social-comparison theory, pluralistic-ignorance 
theory, release theory, relevant-arguments 
theory, and commitment theory. These names 
describe the assumed process, whereby indi- 
vidual attraction toward one end of the scale 
is translated into group shift. These theories 
are called *value theories" because they seek 
the cause of this attraction, and hence the 
energy behind the shift, in widely held human 
values. However, the assumption that the 
shift is powered by values is not apparently 
fundamental to most of these theories, in 
that other sources of attraction toward one 
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of the scale seem capable of activating 
shift-inducing processes postulated. 

The main piece of evidence supporting all 

f these theories is a high positive correlation 
oss items that is always found between the 
average initial risk on an item and the amount 
lof shift toward risk (Clark & Willems, 
1969a; Pruitt & Teger, 1967; Teger & Pruitt, 
1967). A related finding is that people take 
an initially riskier approach to items that 
shift toward risk than to items that shift 
toward caution (Fraser et al., 1970; Stoner, 
1968). These findings support the funda- 
mental assumption underlying value theory, 
because the initial risk taken on an item can 
“be viewed as an index of the end of the scale 
‘toward which most individuals are attracted 
and the extent of this attraction. The basic 
assumption of value theory might also be 
‘construed as predicting that for any item, a 
positive correlation will be found across groups 
between initial risk and risky shift. But a 
small negative correlation (averaging out to 
—07) was found by Teger and Pruitt 
(1967), who argued that the eífect predicted 
by value theory was obscured by a regression 
"toward the mean. This argument is plausible, 
in that regression can be expected in averages 
based on only four or five data points (the 
groups consisted of four or five members) and 
in that a larger negative correlation (—.32) 
was found in the control condition where the 
effect of regression toward the mean can be 
Seen in its pure form. 

An even more direct test of the assumption 
that groups shift in a direction toward which 
most members are already attracted can be 
seen in a study by Fraser et al. (1970), which 
employed Likert-type options (ranging from 

strongly recommend Alternative X” through 
neutrality to “strongly recommend Alterna- 
tive Y”) instead of the usual set of probabili- 
ties that Alternative X will be successful. The 
advantage of this procedure is that the direc- 
tion of initial preference can be more precisely 
Measured than with the usual procedure. 
‘These authors found a “polarization” effect 
Such that (a) most items shifted further in 
the direction of the average initial preference 
shown on them, and (5) most groups shifted 
further in the direction of the average initial 
Preference of their members. Unfortunately, 
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in the analysis of these data, Finding b was 
not completely independent of Finding d, so 
that we cannot be sure about the behavior of 
groups on any individual item. But the results 
do appear to support the basic assumption of 
value theory. 

Findings quite similar to those of Fraser 
et al. have been obtained in three studies of 
shifts in attitudes (Doise, 1969; Moscovici & 
Zavalloni, 1969; Myers & Bishop, 1970). 
Hence, the value theories would appear to be 
leading contenders for explaining the general 
choice shift, which might be better called the 
"extremity shift" in view of these findings. 

The study by Fraser et al. is interesting in 
another way. They used both the Likert-type 
options and the ordinary probability options 
with the same choice-dilemma stems. The two 
forms of response yielded comparable find- 
ings on both initial scores and shift scores, 
suggesting that they are measuring roughly 
the same thing. This is potentially important, 
because we might conclude from it that the 
Likert-type response best expresses what is 
being measured on choice-dilemma items and 
hence what is shifting. This would mean that 
we are dealing with shifts in attitude toward 
(i.e. the perceived acceptability of) the risky 
alternative, as opposed to the cautious alter- 
native, and not with shifts on a dimension in- 
volving different levels of risk, as has always 
been assumed heretofore. This conclusion may 
have implications for generalizing laboratory 
findings on risky and cautious shifts to the 
real world. But it does not appear to cause 
trouble for existing theories about the proc- 
esses underlying these shifts, all of which can 
apparently be translated into theories about 
shifts in attitude. 

In summary, the basic assumption of value 
theory, that groups shift in a direction toward 
which the individual members are already 
attracted, has received a good deal of support. 
Some of the evidence favoring this assumption 
also implies that the basic shift phenomenon 
may be an intensification of attitude for or 
against the attractive alternative, rather than 
a shift on some substantive dimension such as 
subjective risk. The conception of the shift as 
a change in attitude seems compatible with 
all existing theories of this phenomenon. 
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Social-Com parison Theory 


There are several versions of social-com- 
parison theory. These differ from one another 
in their assumptions about the values that 
underlie attraction toward the risky or cau- 
tious alternative rather than their assumptions 
about the process that produces shifts. 

The most prominent version of value theory 
can be called the risk-as-value social-com- 
parison theory. Developed by Brown (1965), 
it postulates that riskiness is a culturally pre- 
scribed value or ego ideal which causes the 
typical American to want to be at least as 
risky in his behavior as other people similar 
to him. This implies a social-comparison 
process in which the individual tries to figure 
out where other people stand on the decision 
problem he is facing, and then chooses, as his 
initial decision, a level of risk that is at or 
above what he assumes to be the group aver- 
age. Later, actual social comparison takes 
place in the group discussion. In Brown's 
words: 


when individuals talk together and disclose their 
decisions the actual distribution is made known. 
"Those who find themselves below the mean of the 

- . members of the group discover that they are 
failing to realize the ideal of riskiness that they 
may have thought they were realizing. Consequently 
they feel impelled to move in a risky direction both 
in accepting the decision of the group and in 
Changing their private opinions. Subjects at or above 
the group mean feel no such impulsion, they are 
relatively risky just as they meant to be. The result 
would be, of course, a shift in the group decision 
toward greater risk than the mean of the individual 
decisions [p. 701]. 


The theory also explains cautious shifts in 
terms of a "value on caution," which impels 
people in some problems to be equally or more 
cautious than the group average. 

For the purpose of presenting evidence 
favoring Brown's theory, it is convenient to 
divide this theory conceptually into two 
parts: (a) the assumption that risk is valued 
on problems that shift toward risk and that 
caution is valued on problems that shift 
toward caution and (5) the assumption of 
a social-comparison process in which people 
choose a level of risk that is at the assumed 
group average or in the valued direction 
beyond this average. 

Results from a study by Madaras and Bem 
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(1968) are often cited as support the first 
assumption. They found that ficti ous indi- 
viduals who purportedly had end sed risky 


decisions on risk-oriented items were rated 
more favorably on a semantic ferential 
scale than those who had endorse cautious 
decisions. However, these results appear to 
have been overinterpreted as port for 
Brown's first assumption. Rather, they seem» 
only to offer support for the basic assumption 


underlying all value theories, that people are 
generally attracted to the alternative toward 
which groups shift, and hence to people who 
find that alternative attractive. 

As will be shown presently, some of the 
evidence that supports the second assumption - 
can also be construed as supporting the first. 
Further evidence for the value assumption 
is presented in later sections of this article 
on pluralistic-ignorance theory and release 
theory. 

Support for Brown's second assumption, of 
a social-comparison process, comes from three 
other results. Several studies have shown that 
in their initial decisions, people view others 
as making more cautious decisions than them- 
selves on choice-dilemma problems that shift 
toward risk and more risky decisions on 
problems that shift toward caution (Levinger 
& Schneider, 1969; Pruitt & Teger, 1967; 
Wallach & Wing, 1968). This finding can also 
be seen as support for the contention that 
risk is a value, if we assume that people 
believe, on the average, that they are more 
successful than others in adhering to their 
values. 

Indirect support for the social-comparison 
assumption can also be derived from a study 
by Vidmar (1970). This author reasoned that 
a social-comparison process would produce a 
larger risky shift the more heterogeneous 
(farther apart) group members were in initial 
risk, because low risk takers would have to 
move further to place themselves at or above 
the group average in heterogeneous groups. - 
His data support this hypothesis, as did data 
from a similar study by Willems and Clark 
(Clark, 1971). Further indirect support for 
Brown's second assumption can be seen in 
the results of a study discussed earlier (Teger 
& Pruitt, 1967). These authors interpreted 
Brown's theory as implying that exchange 


OO — S TE E o 


RISKY-SHIFT INTRODUCTION 


of info ıbout initial risk levels with- 
out d ould provide all the informa- 
tion : social comparison of risk 
levels ld therefore produce a risky 
shift. A ned earlier, they found a 
risky sh information exchange. How- 
ever, it be noted that this shift was 
not as la those found after discussion. 
This latte ling led them to hypothesize a 
two-process ‘heory, that the social comparison 
mechanism accounts for only part of the shift 
and that discussion brings another shift- 
inducing 1 nism into operation. This evi- 
dence wo lso be compatible with a single- 
process s »mparison theory if it could 
be assum: the social-comparison mecha- 
nism wh ctions under information ex- 
change i: nented by group discussion. 
Howeve: : point a single-process theory 
is less co: ng than two-process theories, 
because i xard to imagine a source of 
such augn ion. 

Two s have failed to confirm predic- 
tions from Frown’s risk-as-value social-com- 
parison the The first of these (Pruitt, 
1969) the assumption that “people 
mean to ky” in their decisions (Brown, 
1965, p. 7 Subjects were asked to make 
decisions o the Kogan and Wallach items 
and also to rate all six options as either 


"risky" or cautious." When the ratings were 
made first and the decisions second (in a 
separate booklet), people tended to decide in 
favor of options which they had previously 
rated as "cautious," This is hard to reconcile 
With the notion that people intend to be 
risky." The second nonconfirmatory study was 
performed by Madaras and Bem (1968). 
These researchers arranged a situation in 
Which pretest and posttest measures were 
taken on 10 risk-oriented problems, but in 
Which groups discussed only 5 of these 
Problems, They reasoned from Brown's theory 
*In another part of Pruitt’s study, decisions were 
Made first and ratings second. Here the options 
pos Were typically rated as “risky.” This suggests 
Bt people tend to imagine or portray their past 
havior as risky rather than, as Brown supposed, 
Sonsciously taking risk in current decisions. This 


“search is examined in more detail in the section 
On release theory, 


347 


that 


if the crucial information transmitted in the groups 
is the risk level of others, a hypothetical subject 
would be cognizant of his relative conservatism on 
the undiscussed problems after group discussion of 
other problems, and would alter his risk preferences 
on the undiscussed problems as well [p. 349]. 


This hypothesis was not supported. 

Though not responding directly to the 
negative findings cited in the last paragraph, 
Stoner (1968) has articulated a somewhat 
revised version of Brown’s theory that seems 
to accommodate these findings. The revision 
is in the value assumption, rather than the 
social-comparison assumption. He did not 
speak in terms of a single value of risk or 
caution, but argued that each of the choice- 
dilemma problems is answered in terms of 
a different value or set of values (e.g., your 
own physical comfort, winning games of a 
competitive nature, working on a job which 
has potential for advancement and challenge). 
The effects of these item-specific values are 
accentuated in group discussion through a 
social-comparison process similar to that 
postulated by Brown. Hence, Stoner’s theory 
can be called a multiple-values social-com- 
parison theory. In his experiment, Stoner 
measured, on an ordinal scale, the values 
associated with a set of choice-dilemma items. 
His results demonstrated that on each prob- 
lem, the relative importance of the value 
embodied in the risky alternative, as opposed 
to the cautious alternative, was predictive of 
both the initial risk and the direction of 
group-induced shift. 

Stoner’s version of social-comparison theory 
accommodates the Pruitt findings because it 
does not assume that people are trying to be 
risky. It accommodates Madaras and Bem’s 
findings because, in the absence of a common 
value cutting across all risk-oriented items, 
one would not expect what is learned from the 
group discussion of one item to have an 
impact on decisions regarding another item. 
It is supported by all of the evidence that 
supports Brown's theory and seems less 
strained in its handling of cautious shifts 
because it drops Brown's rather unlikely 
notion of a value on caution. Indeed, it seems 
easily adaptable to the choice shift as a 
whole, since it implies that choice shifts 
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should be found on all decisions involving 
values that impel the individual to do as well 
as, or better than, his fellow man. 

Jellison and Riskind (1970) have developed 
a variant of Brown’s theory which links it to 
Festinger’s (1954) theory of social compari- 
son processes. Festinger postulated that peo- 
ple wish to see themselves as having slightly 
more ability than others with whom they 
compare themselves. Jellison and Riskind 
added the assumption that the risk level taken 
by a person is interpreted as an index of his 
ability. From these two assumptions, they con- 
cluded that “When a person learns that the 
level of risk he has initially chosen is at 
or below the group average, he has learned 
that he is defining himself as only as able as, 
or less able than, the other group members 
[p. 377]." This leads him to raise his level of 
risk, thus producing the risky shift. This can 
be called the social-comparison-of-abilities 
theory. 

In support of their assumptions, these 
authors presented evidence that (a) risk 
taking in others is seen as a sign of ability 
and ability as a sign of risk taking, (5) peo- 
ple who are motivated to demonstrate their 
ability take higher risk than people who are 
not so motivated, and (c) people view them- 
selves as more capable and as having a greater 
likelihood of success than others. Related evi- 
dence can be seen in a study by Madaras and 
Bem (1968), in which (male) subjects were 
found to view high risk acceptors as stronger, 
more active, more successful, and more mascu- 
line than low risk acceptors. This provides 
general support for Jellison and Riskind's 
theory, but suggests that risk taking may con- 
note a somewhat broader characteristic than 
simple ability, including strength and activity 
level as well. 

It should be noted that none of the evi- 
dence cited in the preceding paragraph sup- 
ports the contention that social comparison 
of abilities actually produces the risky shift. 
This contention is as yet untested. 

Social-comparison-of-abilities theory is not 
presented as an alternative to Brown's risk- 
as-value theory, but as a supplement that 
explains why risk is valued on risk-oriented 
problems, Nevertheless, the theory can 
be reconciled with Stoner's multiple-values 
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theory (which is clearly a rival to Brown's 
position) and with the Madaras and Bem 
evidence that indirectly supports Stoner's 
theory by postulating the existence of a set of 
values, each tied to a different dimension of 
ability. The items in the Kogan-Wallach list 
can then be assumed to entail different types 
of ability and hence different values that, 
nevertheless, all pull toward risky decisions. 
Hence, what is learned in the discussion of 
one item need not have an impact on deci- 
sions about other items. Like multiple-values 
theory, social-comparison-of-abilities theory 
draws support from all the evidence favoring 
risk-as-value theory and seems to be able to 
handle the two findings that cast doubt on 
the latter theory. 

Social-comparison-of-abilities theory offers 
no distinctive explanation for the cautious 
shift. To account for this phenomenon, Jelli- 
son and Riskind simply hypothesized a so- 
cial comparison on other non-ability-related, 
item-specific values, such as the value of ap- 
pearing to be responsible. 

'The notion of item-specific values provides 
a framework within which it may be possible 
to account for one of the most puzzling facts 
in the field of group risk taking, that cautious 
shifts are harder to obtain and on the average 
considerably smaller than risky shifts. Fraser 
(1970) suggested that this reflects the opera- 
tion of an “overall value of risk," which has 
some impact on all choice-dilemma items, no 
matter what item-specific values are involved. 
He argued that 
On risky items both item-specific factors and the 
general value for risk operate in the same direction, 
hence large risky shiíts. On cautious items item- 
specific factors are partially counteracted by value 
for risk, hence smaller cautious shifts [p. 5]. 
Similarly, elaborating somewhat on Jellison 
and Riskind, one may argue that the risky op- 
tion holds a fascination on all choice-dilemma 
problems, no matter what item-specific values 
may be involved, because of the implication 
that a decision maker who takes this option 
has ability. Further evidence supporting Fras- 
er’s assumption that risk is especially valued 
is presented in the next section. 

To summarize the discussion of social-com- 
parison theory, there is some evidence that 
people decide on a level of risk that is at oF 
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somewhat beyond the assumed group average 
in the direction toward which they are al- 
ready attracted. Such a decision rule can ac- 
count for the group-induced shiít, because 
during the discussion, some people in every 
group are sure to learn that they are below 
the group average, which should cause them 
to move in the attractive direction. There is 
also evidence that values underlie the attrac- 
tion toward risk or caution on choice-dilemma 
items. However, the nature of these values is 
controversial, with at least three versions: 
Brown asserted the existence of unitary values 
of risk and caution. Fraser argued for a uni- 
tary value of risk which operates in addition 
to multiple item-specific values that may pull 
in the same or in the opposite direction to its 
influence. Stoner posulated only item-specific 
values. Regardless of which of these three po- 
sitions is adopted, a strong argument can be 
made for the assumption that the value or 
values underlying risk taking are more com- 
pelling than those underlying caution. There 
is reason to believe that this unusually strong 
attraction toward risk is due to the associa- 
tion people make between risk taking and 
ability or general competence. This assump- 
tion is compatible with both the unitary- and 
the multiple-value positions. Social-compari- 
son theory has some difficulty accounting 
fully for the fact that discussion produces a 
larger shift than information exchange. Hence, 
à two-process theory involving the social- 
comparison mechanism plus some other mech- 
anism may be indicated. 


Pluralistic-Ignorance Theory 


Levinger and Schneider (1969) proposed a 
Reneral theory of choice shifts which assumes 
that decision making in many areas is guided 
by a “conflict-compromise” process. The con- 
flict in question is between an “ideal pref- 
erence,” what the individual would really like 
to do, and an “assumed group standard,” what 
he thinks other people favor. Such a conflict 
leads to choice of a compromise position that 
lies part way between the ideal and the as- 
Sumed group standard. In the standard ex- 
perimental paradigm, group discussion re- 
Veals other people's choices, which may lead 
to a readjustment of the assumed group stan- 
ard and, as a result, a shift in the individual's 
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decision. If, as typically happens, other group 
members are found to be closer to the indi- 
vidual's ideal than he had assumed, he will 
shift toward his ideal. This can be called a 
“pluralistic-ignorance” theory of choice shift, 
because this term has been used to describe a 
situation in which the members of a group 
embrace one attitude but believe that others 
embrace another (Schanck, 1932). 

This theory can be applied to the shift 
toward risk if the familiar assumption is made 
that people value risk or the outcomes associ- 
ated with risk but do not realize how widely 
their values are shared. Under such circum- 
stances, group discussion should reveal more 
social support for risk taking than anticipated 
and thus allow the individual to move toward 
greater risk. A comparable explanation can be 
developed for shifts toward caution. 

Pluralistic-ignorance theory is supported by 
most of the evidence that supports social- 
comparison theory, though unlike the latter 
theory, it does not specifically imply Vidmar's 
finding of a larger risky shift with more 
heterogeneous groups. It rests on a reinterpre- 
tation of the finding that people view others 
as more cautious than themselves on choice- 
dilemma problems that shift toward risk. 
Brown attributed this finding to the operation 
of a value of risk in the context of social com- 
parison. Perhaps more reasonably, Levinger 
and Schneider treated it as evidence that peo- 
ple see the group standard as relatively cau- 
tious. 

Levinger and Schneider. performed a study 
of initial decisions on the Kogan-Wallach 
items, in which they asked the subjects to 
indicate not only their own decisions but also 
*the answers which you believe a majority of 
your fellow students here would make for 
each item" (assumed group standard) and 
“the choice you would admire most" (ideal 
preference). On the 10 risk-oriented items, 
they obtained the usual finding that fellow 
students (.59) were viewed as more cautious 
than oneself (.52). This supports both social- 
comparison theory and pluralistic-ignorance 
theory, but for different reasons, as indicated 
in the last paragraph. They also found, on 
the risk-oriented items, that the admired level 
of risk (.41) was well in excess of the sub- 
jects’ own level of risk (.52). This is the best 
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piece of evidence known to the present author 
that ego ideals underlie attraction toward the 
risky alternative on risk-oriented items, an 
assumption made both by Brown and by Lev- 
inger and Schneider. The two results just re- 
ported add up to a finding that the average 
level of risk chosen by the subjects lay part 
way between the average level of risk attrib- 
uted to fellow students and the average level 
of risk most admired. This supports Levinger 
and Schneider’s contention that decisions re- 
sult from a compromise between the assumed 
group standard and the ideal preference. 

According to pluralistic-ignorance theory, 
on caution-oriented items, the decision at- 
tributed to others should be riskier than one’s 
own decision, And the latter should, in turn, 
be riskier than the most admired decision. 
The first of these predictions was confirmed 
on the two items of this type (Numbers 5 
and 12) in the Kogan-Wallach set (perceived 
fellow students averaged .69 and own deci- 
sions, .72). But the second was not (most 
admired decisions averaged .66), calling into 
question the assumption that the values under- 
lying the cautious shift are ego ideals. How- 
ever, in a later study by Fraser (1970), em- 
ploying an unknown set of five caution-ori- 
ented items, both predictions were supported. 

Some of Fraser’s other findings can be 
viewed as pertaining to the puzzling fact men- 
tioned earlier that cautious shifts are harder 
to obtain and, on the average, considerably 
smaller than risky shifts. Fraser reported that 
on his caution-oriented items, the average 
discrepancy between perceived other and own 
decision was 48, while the average discrep- 
ancy between own and admired decision was 
-62. In contrast, on six comparable risk-ori- 
ented items, these values were —.74 and 
—1.39. Comparison between the absolute 
magnitudes of these two pairs of numbers 
suggests that the values supporting risk taking 
on risk-oriented items are stronger than the 
values supporting caution on caution-oriented 
items. This provides further evidence for 
Fraser's earlier mentioned hypothesis that risk 
is particularly valued on all choice-dilemma 
items, no matter what the nature of other 
item-specific values. 

Pilkonis and Zanna (1969) have employed 
an ingenious design to test the Levinger- 
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Schneider theory with risk-orientec items 
one condition (C-O), they provid f 
tion to the subjects indicating tha 

ple are very conservative on a s 
dilemma items. In the second 

( R-O), the information revealed 

are very risky. Subsequent to thi: 

tion, a conventional procedure wa 
involving pretest, group discussion 

test. It was hypothesized that grou 

sion would counteract this false ini 

and cause a change in the assum 
standard. In the C-O condition, the a 
group standard should move toward ri 
ducing a risky shift. In the R-O conditi 
movement should be in the opposite di: 
producing a cautious shift. Trends in 
supported both predictions, but the 
R-O condition did not yield cautious shi 
sufficient vigor or universality to per a 
strong endorsement of the theory. This s a 
study that deserves replication with a 
number of subjects and an improved m 
ology. 

According to pluralistic-ignorance theo 
all that is needed to evoke a risky or cau! 
shift is for the group members to learn v 
one another stand on this issue. Therefore 
is true of social-comparison theory, the cap: 
ity of this theory to account for the e;ire 
risky shift in group discussion is called into 
question by Teger and Pruitt’s (1967) often 
replicated finding that exchange of informa- 
tion about preferred risk levels produces only 
a partial shift. Hence it seems more reason- 
able to conclude that the pluralistic-ignorance 
mechanism can account for only part of the 
group-induced shift and that discussion brings 
another mechanism into operation. 

To summarize, in the empirical work based 
on pluralistic-ignorance theory, Levinger and 
Schneider have provided good reason to be- 
lieve that the values which make risk taking 
attractive on risk-oriented items are ego ideals. 
The status of the values associated with cau- 
tion-oriented items is less clear. The evidence 
that underlies social-comparison theory also 
supports pluralistic-ignorance theory with the 
exception of Vidmar's findings, which do not 
contradict this theory but cannot apparently 
be deduced from it. A study by Pilkonis and 
Zanna provides specific support for plural- 


Risky-Suirt INTRODUCTION 


istic-ig theory, but the findings are 
weak »etween social-comparison and 
plural ince theories, existing evi- 
dence g former a slight advantage as 
an expla or that part of the shift pro- 
duced ! hange of information on pre- 
ferred Further evidence could eas- 
P ily reverse idvantage, 

Release Th 

This theory has been briefly described by 
Pruitt (1: , and will be more fully elab- 
orated he :pplies to all sorts of shifts, 
but can be described at first as a theory 
of the ri it. Like pluralistic-ignorance 
theory, ns with the assumption that 
most peop! in conflict when they are 
called uj | commit themselves as indi- 
viduals t vel of risk on a risk-oriented 
decision pr ;. On the one hand, risk taking 
is attractiv: ause of a value of risk or be- 
cause it imois confidence in one's ability to 
Cope with the environment. The forces that 
produce t! ttraction are internalized and 


ego syntonic. On the other hand (and here 
the theory departs somewhat from pluralistic- 
ignorance theory), a cautious approach is 
Compelling because of widely held values at- 
laching to moderation and being reasonable. 
These valucs are adhered to not so much out 
9f conviction as out of a conformity motive, a 
Sense that other people predominantly support 
the so-called “golden mean,” and that it is 
essential to avoid being seen by others as “too 
far out on a limb,” 

[ Evidence that conflict surrounds individual 
tisk taking can be seen in a phenomenon 
talled the “Walter Mitty effect” (Pruitt, 
1969). Subjects were asked to make decisions 
on the Kogan and Wallach items and to indi- 
‘fate which of the six possible options (“1/10” 
P "not") seemed “risky” and which ones 
Cautious.” One group of subjects rated the 
*ptions first and made decisions second (Con- 
"lion A). Two other groups made decisions 
first and rated the options second (Condi- 
tions B and C). Roughly the same decisions 
Were made in all three conditions. But, for 
e risk-oriented items, the options chosen in 


TS. 
Ens Point was not made in the original report 
ause of space limitations, 
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Conditions B and C were much more likely to 
be labeled as “risky” than those chosen in 
Condition A. 

These findings can be seen as evidence of 
conflict or ambivalence, because of the sharp 
difference exhibited between actions and selí- 
presentation. When called on to make an ac- 
tual decision, people chose cautiously, as was 
shown in Condition A. But they described— 
and quite possibly imagined—their behavior 
as risky, as was shown in Conditions B and 
C. The caution-seeking dimension of this ap- 
parent ambivalence can be interpreted as re- 
sulting either from fear that the risky alterna- 
tive will fail or, as postulated by release 
theory, from conformity to an assumed social 
norm of moderation. The risk-seeking dimen- 
sion may be seen as conformity to an assumed 
norm of riskiness (in which case the ratings 
in Conditions B and C could be interpreted as 
self-presentation) or, as postulated by release 
theory, as an effort to achieve self-esteem 
where riskiness is an ego ideal (in which case 
these ratings can be interpreted as manifesta- 
tions of the subject's actual image of himself). 
The latter interpretation portrays the sub- 
ject in this experiment as acting like Walter 
Mitty, in the famous Thurber short story: 
engaging in ego-gratifying fantasies of him- 
self as a high risk taker but acting cautiously 
when faced with an actual decision. 

According to release theory, the risky shift 
occurs in group discussion because the dis- 
covery of a single group member (the model) 
who endorses high risk taking releases the 
more cautious group members from the as- 
sumed social constraints that are holding 
them back from risk taking. In other words, 
finding a high risk taker in their group gives 
them the “courage of their convictions." 

The notion of release from social constraints 
can be related to the phenomenon of “be- 
havioral contagion" (Redl, 1949; Wheeler, 
1970). In contagion, a person who is strongly 
motivated (in the vernacular, tempted") to 
behave in a certain fashion but constrained 
from doing so by ethical considerations or 
social pressure (real or assumed) is somehow 
released from his constraints by observing a 
single model who is behaving in the fashion 
to which he is attracted. As a result, the 
model is imitated. Wheeler and Caggiula 
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(1966) have demonstrated this phenomenon 
in an experiment where it was shown that a 
subject would engage in verbal aggression 
against a confederate who had angered him 
only if another confederate (the model) had 
first aggressed, Impulsive mob behavior is 
sometimes explained in terms of behavioral 
contagion. 

The notion of release (and the phenomenon 
of contagion) may also be related to Asch's 
findings with respect to the impact of a “part- 
ner" on conformity (Asch, 1952). In Asch's 
study, subjects were faced with massive social 
pressure to verbalize judgments that were 
obviously wrong. A considerable amount of 
conformity was found when all other group 
members verbalized the faulty judgment. But 
the presence of only one group member who 
agreed with the subject dramatically reduced 
the amount of conformity, even though the 
five other group members all reported the 
faulty judgment. In terms oí the preceding 
analysis, we can view the subject in Asch's 
studies as in conflict between what he really 
wants to do (report the obviously right 
answer) and social pressure to do something 
else. The presence of a single model who does 
what the subject really wants to do releases 
him from these social pressures and permits 
him to behave as he really wishes. 

Release theory is sufficiently similar to so- 
cial-comparison theory and pluralistic-igno- 
rance theory that it draws support from the 
evidence buttressing both of these theories. In 
addition, Vidmar's (1970) findings, which 
only support social-comparison theory, are 
implied by release theory because the average 
member is further from the high risk model 
in heterogeneous than in homogeneous groups 
and hence has further to move. As is true of 
the other two value theories, release theory 
can probably at best only account for that 
portion of the full discussion-induced shift 
that results from exchange of information 
about where others stand on an item. 

There is a difference between release theory 
and the other two value theories that provides 
the germ of a crucial experiment, Social-com- 
parison theory and pluralistic-ignorance the- 
ory both apparently take the view that indi- 
vidual group members respond to what they 
perceive to be the average level of risk taking 
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among the other group members. Social-com- 
parison theory holds that the individual will 
try to be at or above this average (see Brown, 
1965, p. 349, quoted earlier). Pluralistic-ig- 
norance theory assumes that the group experi- 
ence causes the individual to rethink his as- 
sumptions about group standards, group 
standards presumably referring to the behay- , 
ior of the average group member. On the 
other hand, release theory assumes that the 
group member taking the Aighest level of risk 
will become a model for other group members, 
because his behavior best expresses the values 
associated with the risky option, which most 
group members share. Hence individuals will 
be influenced not by the average level of risk — 
taking in their group but by the highest level 
taken. This hypothesis looks like a relatively 
easy one to test. 
A partial test of this hypothesis has been 
performed by Wallach and Mabli (1970), 
though these authors viewed themselves as 
testing a derivation from social-comparison 
theory. Three-man groups were composed on 
the basis of average initial responses to the 
choice-dilemma questionnaire. These groups 
consisted either of two conservative and one 
risky member (Condition a) or one conserva- 
tive and two risky members (Condition b). 
This procedure presumably led to a rough 
equivalency between the twa conditions in the 
average choice made by the highest risk taker 
in a group. Hence, release theory would pre- 
dict roughly equal shifts under the two condi- 
tions. This procedure also meant that the aver- 
age risk level of the other two subjects in an 
individual's group was greater in Condition b 
than in Condition a. Hence social-comparison 
theory and pluralistic-ignorance theory would 
predict a larger shift toward risk in Condition 
b than in Condition a. The results clearly 
supported our prediction from release theory. 
These data also lend themselves to testing 
another prediction from release theory, that 


the degree of shift for low-risk group mem- ~i 


bers is a positive function of the level of risk 
taken by the highest risk taker in the group. 
Unfortunately, Wallach and Mabli did not 
test this hypothesis. 

Release theory has been described so far 
as a theory of the risky shift. However, it can 
be easily extended to cover both the cautious 
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shift and the more general choice shift. In its 
f more general form, it is essentially a theory of 
response polarization, which postulates that 
people view others as supporting a norm of 
moderation and reasonableness and that most 
people are therefore reluctant on their own to 
get too far from neutrality on any matter of 
attitude or decision. Nevertheless, on some 
matters, values lead people to be attracted 
toward extremities of attitude or action—for 
example, toward risk, caution, support for 
deGaulle, etc. A conflict ensues, and the indi- 
vidual strikes a compromise between what he 
really wants to do or say and what he feels is 
sufficiently moderate to be socially acceptable. 
But if he comes into contact with another 
person who takes a more extreme position in 
the direction toward which he is attracted, he 
is likely to take this person as a model—re- 
leasing him, at least in part, from the social 
constraints that bind him and allowing him 
lo move toward a more polarized position in 
what he does or says. 

Stated in this more general form, release 
theory draws support from studies which show 
that on bipolar attitude scales (the kind that 
range from strongly positive to strongly nega- 
tive), others are viewed as more neutral than 
oneself (Fraser et al., 1970; Steiner, 1954). 
Also, it explicitly predicts the finding that 
group-induced shifts involve movement by 
the group away from neutrality (i.e., toward 
One or the other extreme) on bipolar attitude 
scales (Doise, 1969; Moscovici & Zavalloni, 
1969; Myers & Bishop, 1970) and on risk 
items that are set up like bipolar attitude 
scales (Fraser et al., 1970). All of the other 
value theories can accommodate these latter 
Tesults but do not explicitly predict them as 
does release theory. Interpretation of these 
Tesults must take into account McCauley’s 
(1970) finding that group shifts away from 
| Neutrality on attitude and risk items are not 
accompanied by met individual shifts away 
from neutrality. On the other hand, McCau- 
ley’s negative findings may conceivably re- 
ect a combination of two processes, with 
Some initially inhibited individuals released 
y the group experience and becoming less 
Neutral in the valued dimension while other 
ess inhibited individuals, who started out 
More extreme, become more neutral in order 
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to join the emerging group consensus. A good 
mathematical model, which embodies both 
shifts and movements toward consensus, is 
needed to unscramble this literature. 

The extension of release theory to the cau- 
tious shift might be construed as implying 
that a reverse Walter Mitty effect should ob- 
tain on caution-oriented items, such that the 
option chosen should be more often rated 
"risky" when ratings are made aíter as op- 
posed to before decision. However, Pruitt 
(1969) found no effect whatsoever on the 
two caution-oriented members of the Kogan- 
Wallach set (Numbers 5 and 12); the options 
chosen were predominantly rated “cautious” 
in all conditions. This finding might be con- 
strued as evidence against release theory. But 
an alternative approach would be to assume 
that only certain kinds of values are expressed 
in fantasy and therefore capable of showing a 
direct or reverse Walter Mitty effect, the 
values associated with risk taking being ex- 
amples of the kind that are expressed in fan- 
tasy and the values associated with caution 
being examples of the kind that are not so 
expressed. This speculation seems plausible, in 
that the outcomes associated with caution on 
Kogan-Wallach Numbers 5 and 12, avoiding 
the loss of money and avoiding an unhappy 
marriage, do not seem apt subjects for ego- 
gratifying fantasy. 

Release theory can be construed as a fourth 
version of leadership theory, in that it at- 
tributes shifts to the disproportionate influ- 
ence of high risk takers. Unlike the three ver- 
sions of leadership theory discussed earlier, 
which viewed the process of influence as em- 
anating from the behavior of the high risk 
taker, this new version of leadership theory 
attributes the influence of the high risk taker 
to the mere fact of his presence in the group. 
Nevertheless, the finding cited earlier in sup- 
port of leadership theory, that high risk takers 
are seen as having more influence over the 
group discussion than low risk takers, can be 
viewed as support for release theory. This 
finding has been extended to shifts on non- 
risk-involving dimensions by Doise (1969). 

In summary, the assumption that people 
are initially in conflict between risky and 
cautious approaches is indirectly supported by 
the finding that people are more likely to 
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portray past decisions as risky than actually 
to take risk (the Walter Mitty effect). This 
assumption is part of both pluralistic-igno- 
rance and release theories. All of the evidence 
supporting social-comparison theory, as well 
as the basic finding underpinning leadership 
theory also supports release theory. In addi- 
tion, specific support for the latter theory can 
be derived from Wallach and Mabli's find- 
ings. Hence, in terms of existing evidence, re- 
lease theory would seem to have an edge over 
the two value theories previously presented as 
an explanation for that part of the shift pro- 
duced by exchange of information about ini- 
tial risk levels. Of course, this edge may be 
only temporary. 


Relevant-Arguments Theory * 


A fourth form of value theory holds that 
the dominant value or values in a decision 
problem elicit persuasive arguments in group 
discussion that convince group members to 
move further in the direction of these values. 
Though originated by Nordhgy (1962), this 
theory was first presented in the general lit- 
erature by Brown (1965). It has since been 
endorsed by St. Jean (1970), Stoner (1968), 
Teger and Pruitt (1967), and in somewhat 
modified form by Vinokur (1971). 

Relevant-arguments theory is supported by 
all of the findings that suggest the importance 
of values for the choice shift. However, it 
draws no apparent support from the studies 
that buttress the three previously mentioned 
value theories (Pilkonis & Zanna, 1970; 
Pruitt, 1969; Vidmar, 1970; Wallach & 
Mabli, 1970; and the finding that high risk 
takers are seen as most influential in group 
discussion, which was mentioned in connec- 
tion with release theory). However, there are 
two other results that seem to weigh favor- 
ably for this theory. One is Nordhgy’s (1962) 
finding that more statements were presented 
in support of risk than caution when groups 
discussed the Kogan and Wallach choice-di- 
lemma problems. Further support comes from 
a finding by St. Jean (1970), that subjects in 
an arguments-only condition, who were al- 
lowed to discuss only *the pros and cons at- 
tached to the risky action without revealing 


* This term is due to St. Jean (1970). 
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their own risk preferences,” d almost 
as large a risky shift on risk-oriented choice- 
dilemma items as was found in an ordinary 
discussion of these items. 

St. Jean’s finding about the arguments-only 
condition is puzzling in light of the results ob- 


exnibit 


tained by Teger and Pruitt (1967) in their 
information-exchange condition. In terms of 
overt content, the individual group member 
receives totally different information under 
these two conditions; yet a partial risky shift 


is found in one (information exchange) and a 
nearly full shift in the other 
only). One way out of this puzzle would be 
to assume that the relevant-arguments theory 
is totally adequate to explain the risky shift, 
and that in the information-exchange condi- 
tion people are stimulated to imagine the ar- 
guments with which their fellow group mem- 
bers are justifying their choices. The shiít is 
only partial in this condition, because imag- 
ined arguments are neither as rich nor as con- 
vincing as those actually heard from fellow 
group members. Another way out of the puz- 
zle would be to assume that people make 
relatively accurate inferences in the argu- 
ments-only condition about the risk levels 
favored by their fellow group members. 
Hence, the arguments-only condition carries 
much the same information as a full group 
discussion; and a two-process theory, like 
those proposed earlier, is possible. Such a 
theory might employ a social-comparison, 
pluralistic-ignorance, or release mechanism 
to explain the impact of knowledge or infer- 
ence concerning others' risk levels. It might 
employ relevant-arguments or leadership 
mechanisms to account for the additional im- 
pact of group discussion. A third way out 
would be to assume that partial shifts result 
from learning about other people's attitudes 
toward the basic risky alternative as opposed 
to the basic cautious alternative. It is reason- 
able to assume that such learning can take 
place in both the information-exchange and 
the arguments-only conditions. As was true 
with the second *way out," this approach 
would imply a two-process theory, with the 
full shift resulting from the experience of 
learning about other people's attitudes plus 
the impact of relevant arguments or leader- 
ship style. This third approach is quite com- 
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patible with the finding mentioned earlier 
that typical risky and cautious shifts occur 
when responses to choice-dilemma items are 
made on Likert-type attitude scales (Fraser et 
al., 1970). 

One question that might be raised within 
the context of relevant-arguments theory con- 
cerns how the dominant value or values elicit 
supportive arguments. There are two possible 
mechanisms: (a) People voice arguments that 
express their own values, and (5) people re- 
ward others for expressing such arguments. 
The latter mechanism is particularly interest- 
ing, because it implies that the frequency of 
arguments favorable to the dominant value or 
values will increase over the course of a dis- 
cussion, reflecting the fact that more and 
more reinforcements have been given for the 
expression of such arguments. This hypothe- 
sis deserves to be tested. 

To summarize, many of the findings re- 
ported earlier as support for other value the- 
ories cannot be deduced from relevant-argu- 
ments theory, though they do not specifically 
contradict it. But there are two results that 
are clearly implied by it. The best way out 
of this puzzle seems to be to postulate two 
processes, one consisting of relevant argu- 
ments and the other of some other value proc- 
ess (e.g, release). Such an approach is im- 
plied by the finding of a partial shift when 
information is exchanged about initial pref- 
erences. 


Commitment Theory 


Moscovici and Zavalloni (1969) have pro- 
Posed a theory which views group discussion 
as a setting for the individual to commit him- 
self further to the approach he has taken in 
his initial decision, rather than a setting in 
which he can learn about other people's 
Choices or opinions. This theory was designed 
to account for both the ordinary discussion- 
induced risky shift and the now largely dis- 
credited finding that a risky shift occurs as a 
Tesult of further familiarization with the 
Items. In the words of these authors: “In the 
Course of handling the information, as he in- 
leracts with real or imaginary interlocutors, 

e Chooses alternatives, binds himself to the 
Choice, and thus commits himself to the work 

€ is doing [p. 127]." Such a commitment is 
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assumed to move the individual further in the 
direction of his initial decision, that is, to- 
ward risk in the case of risk-oriented items 
and toward caution in the case of caution-ori- 
ented items. This theory is classified as a 
value theory, because Moscovici and Zaval- 
loni seem to imply that the value pull of each 
item will determine the direction of the ini- 
tial decision and thus of the further commit- 
ment that ensues during discussion. 

Commitment theory is flexible enough to 
allow for most of the research findings on the 
choice shift. But it would seem to have diffi- 
culty explaining the fact that a risky shift is 
obtained when the subject observes a live 
group discussion (Lamm, 1967) or listens to 
a recorded discussion (Kogan & Wallach, 
1967c). The role of the subject is so passive 
in these studies that it is hard to see how the 
notion of *commitment" applies. These stud- 
ies seem much more easily explainable in 
terms of theories that attribute the risky shift 
to information learned from watching or lis- 
tening to other group members. Furthermore, 
the validity of this theory may be questioned 
on the basis of McCauley's (1970) finding 
that group discussion does not cause indi- 
viduals, on the average, to shift toward the 
extremes of bipolar scales. Last, the con- 
tinued failure to replicate the familiarization 
effect, which was mentioned earlier, is em- 
barrassing to this theory. 


Overview of Value Theories 


A good deal of evidence supports the notion 
that people are initially attracted toward the 
direction in which an item shifts and that this 
attraction results, in large part, from the in- 
fluence of widely held values. The values that 
underlie attraction toward risk in the case of 
items that shift toward risk may well be ego 
ideals and would appear to be quite strong for 
most people. This strength may derive from 
the association which people make between 
risk taking and ability. The values underlying 
attraction toward caution in caution-oriented 
items are less well understood. 

Although there is a certain amount of in- 
direct evidence for the assumption that value- 
based attraction toward one end of the scale 
actually underlies the choice shift, convincing 
direct evidence has not yet appeared. Assum- 
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ing that such evidence will eventually ma- 
terialize, we can distinguish five theories 
about the mechanisms that produce the shift: 
social-comparison theory (in its various ver- 
sions), pluralistic-ignorance theory, release 
theory, relevant-arguments theory, and com- 
mitment theory. Commitment theory seems 
questionable in the light of existing evidence. 
Specific support exists for the other four 
theories, and it is hard to choose among them 
at this time. Of the first three, release theory 
seems to have a slight advantage at this time 
—but new evidence may well change the pic- 
ture. It is distinctly possible that one of the 
first three theories accounts for part of the 
shift and relevant-arguments theory (or possi- 
bly a version of leadership theory) for the 
other part. 


DECISION THEORIES 


A few authors have employed utility-based 
decision theory to conceptualize choices and 
shifts on the risk dimension. This approach 
is reviewed in the article by Cartwright im- 
mediately following this one. It takes two 
forms: (a) The first hypothesis is that group 
discussion causes a change in the utilities 
which individuals assign to the outcomes that 
are associated with the options available. 
Vinokur (1971) has advanced a relevant-ar- 
guments version of this hypothesis, which 
holds that arguments heard during group dis- 
cussion cause utility changes. He pointed out 
that these arguments may reflect widely held 
values. Hence, his position may be viewed as 
a version of value theory, which traces the 
path of causation from values, through argu- 
ments, to changes in utility and thence to 
shifts in risk taking. Diffusion-of-responsibil- 
ity theory (Wallach & Kogan, 1965) can also 
be regarded as an example of this kind of 
expected-utility theory—though it has not 
been so described in the past—because it 
postulates reduced apprehension about failure 
of the risky alternative, which can be con- 
Strued as increased utility for this outcome. 
(b) The second hypothesis is that group dis- 
cussion leads to convergence on the utilities 
associated with the various possible outcomes 
and that this convergence can lead to a risky 
or cautious shift, depending on other para- 
meters of the situation (Miller, 1970). There 
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is little evidence for or against these two hy- 
potheses. 

Another approach occasionally taken in- 


ift from 
Burnstein 


volves an effort to predict the ris 
models of group decision making. 


(1969) described a kind of majority-rule 
model, but presented no data supporting it. 
Fraser (1970) also endorsed a majority-rule « 
model, as part of a two-process theory in 


which relevant arguments are the basis for 
the second process. He cited as evidence for 
this model unpublished findings by Lambert, 
Cartwright, in the article immediately fol- 
lowing this one, describes results which he 
obtained from a reanalysis of data collected 
by Cecil. These seem to show that most group 
decisions on the choice-dilemma problems can 
be accounted for by three principles: majority 
rule, coalition rule, and mean rule. 

Two other studies call into question the 
adequacy of the principles of majority and 
coalition rule for explaining the risky shift. 
In these studies, the direction of skewness of 
the initial distribution of choices, and there- 
fore the direction of the majority or potential * 
coalition, seemed to make no difference for 
the size of the risky shift in risk-oriented 
items (Vinokur, 1969; Wallach & Mabli, 
1970). Cartwright has criticized the latter 
study for composing the groups on the basis 
of risk-taking scores that were averaged over 
a number of items. However, other findings in- 
ternal to the study seem to suggest that the 
intended features of group composition were 
at least partly exhibited during discussions of 
individual items. It should be reiterated that 
the Vinokur and Wallach-Mabli studies per- 
tain only to the adequacy of group-decision 
models for explaining the risky shift. It is 
still possible that such models will prove ade- 
quate for explaining other sorts of shifts. In- 
deed some of Vinokur’s results point in this 
direction: he found that groups shifted in the 
direction of their majorities on two neutral | 
items which were not oriented toward risk or 
caution. 

Tn summary, though interesting, the various 
decision theories are as yet inadequately sup- 
ported. 
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SAMPLE ITEM FROM KOGAN AND WALLACH (1967b) SET 


Mr. A, an electrical engineer, who is married and has one child, has been working for a large 
electronics corporation since graduating from college 5 years ago. He is assured of a lifetime 
job with a modest, though adequate, salary, and liberal pension benefits upon retirement. 


On the r hand, it is very unlikely that his salary will increase much before he retires. 
While attending a convention, Mr. A is offered a job with a small, newly founded company 
which has a highly uncertain future. The new job would pay more to start and would offer 
the possibility of a share in the ownership if the company survived the competition of the 
larger firms. 

Imagine that you are advising Mr. A. Listed below are several probabilities or odds of the 
new company’s proving financially sound. 

PLEASE CHECK THE LOWEST PROBABILITY THAT YOU WOULD CONSIDER 
ACCEPTABLE TO MAKE IT WORTHWHILE FOR MR. A. TO TAKE THE NEW 
JOB. 


The chances are 1 in 10 that the company will prove financially sound. 

The chances are 3 in 10 that the company will prove financially sound. 

- The chances are 5 in 10 that the company will prove financially sound. 

— The chances are 7 in 10 that the company will prove financially sound. 

— The chances are 9 in 10 that the company will prove financially sound. 
— Place a check here if you think Mr. A should not take the new job no matter what the 


probabilities. 


REMAINING Irems FROM KoGAN AND WALLACH SET 


- Mr. B, a 48-year-old accountant, has recently been informed by his physician that he has 


developed a severe heart ailment. The disease would be sufficiently serious to force Mr. B to 
change many of his strongest life habits—reducing his work load, drastically changing his 
diet, giving up favorite leisure-time pursuits. The physician suggests that a delicate medical 
operation could be attempted which, if successful, would completely relieve the heart con- 
dition, But its success could not be assured, and in fact, the operation might prove fatal. 


. Mr. C, a married man with two children, has a steady job that pays him about $6000 per 


year. He can easily afford the necessities of life, but few of the luxuries. Mr. Cs father, 
who died recently, carried a $4000 life insurance policy. Mr. C would like to invest this money 
in stocks. He is well aware of the secure “blue-chip” stocks and bonds that would pay 
approximately 6% on his investment. On the other hand, Mr. C has heard that the stocks 
of a relatively unknown Company X might double their present value if a new product 
currently in production is favorably received by the buying public. However, if the product 
is unfavorably received, the stocks would decline in value. 


- Mr. D is the captain of College X's football team. College X is playing its traditional rival, 


College Y, in the final game of the season. The game is in its final seconds, and Mr. D's 
team, College X, is behind in the score. College X has time to run one more play. Mr. D, 
the captain, must decide whether it would be best to settle for a tie score with a play which 
would be almost certain to work; or, on the other hand, should he try a more complicated 
and risky play which could bring victory if it succeeded, but defeat if not. ; 

Mr. E is president of a light metals corporation in the United States. The corporation is 
quite prosperous, and has strongly considered the possibilities of business expansion by 
building an additional plant in a new location. The choice is between building another plant 
in the United States, where there would be a moderate return on the initial investment, or 
building a plant in a foreign country. Lower labor costs and easy access to raw materials 
in that country would mean a much higher return on the initial investment. On the other 
hand, there is a history of political instability and revolution in the foreign country under 
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consideration. In fact, the leader of a small minority party is committed to nationalizing, 
that is, taking over, all foreign investments. 

Mr. F is currently a college senior who is very eager to pursue graduate study in chemistry 
leading to the Doctor of Philosophy degree. He has been accepted by both University X and 
University Y. University X has a world-wide reputation for excellence in chemistry. While 
a degree from University X would signify outstanding training in this field, the standards 
are so very rigorous that only a fraction of the degree candidates actually receive the degree. 
University Y, on the other hand, has much less of a reputation in chemistry, but almost 


everyone admitted is awarded the Doctor of Philosophy degree, though the degree has much 
less prestige than the corresponding degree from University X. 


. Mr. G, a competent chess player, is participating in a national chess tournament. In an 
early match he draws the top-favored player in the tournament as his opponent. Mr. G 
has been given a relatively low ranking in view of his performance in previous tournaments. 
During the course of his play with the top-favored man, Mr. G notes the possibility of a 
deceptive though risky maneuver which might bring him a quick victory. At the same 
time, if the attempted maneuver should fail, Mr. G would be left in an exposed position, and 


defeat would almost certainly follow. 


. Mr. H, a college senior, has studied the piano since childhood. He has won amateur prizes 


and given small recitals, suggesting that Mr. H has considerable musical talent. As gradua- 
tion approaches, Mr. H has the choice of going to medical school to become a physician, a 
profession which would bring certain prestige and financial rewards, or entering a con- 
servatory of music for advanced training with a well-known pianist. Mr. H realizes that 
even upon completion of his piano studies, which would take many more years and a lot of 
money, successs as a concert pianist would not be assured. 


- Mr. J is an American captured by the enemy in World War II and placed in a prisoner-of- 


war camp. Conditions in the camp are quite bad, with long hours of hard physical labor 
and a barely sufficient diet. After spending several months in this camp, Mr. J notes the 
possibility of escape by concealing himself in a supply truck that shuttles in and out of camp. 
Of course, there is no guarantee that the escape would prove successful. Recapture by the 
enemy could well mean execution. 

Mr. K is a successful businessman who has participated in a number of civic activities of 
considerable value to the community. Mr. K has been approached by the leaders of his 
political party as a possible congressional candidate in the next election. Mr. K's party 
1s à minority party in the district, though the party has won occasional elections in the past. 
Mr. K would like to hold political office, but to do so would involve a serious financial 
sacrifice, since the party has insufficient campaign funds. He would also have to endure the 
attacks of his political opponents in a hot campaign. 

Mr. L, a married 30-year-old research physicist, has been given a 5-year appointment by 
a major university laboratory. As he contemplates the next 5 years, he realizes that he 
might work on a difficult long-term problem which, if a solution could be found, would 
resolve basic scientific issues in the field and bring high scientific honors. If no solution were 
found, however, Mr. L would have little to show for his 5 years in the laboratory, and 
this would make it hard for him to get a good job afterwards. On the other hand, he could, 
as most of his professional associates are doing, work on a series of short-term problems 
where solutions would be easier to find, but where the problems are of lesser scientific im- 
portance. 

Mr. M is contemplating marriage to Miss T, a girl whom he has known for a little more 
than a year. Recently, however, a number of arguments have occurred between them, 
suggesting some sharp differences of opinion in the way each views certain matters. Indeed, 
they decide to seek professional advice from a marriage counselor as to whether it would 
be wise for them to marry. On the basis of these meetings with a marriage counselor, they 
realize that a happy marriage, while possible, would not be assured. 


(Received June 7, 1971) 


Journal of Personality and Social Psychology 
1971, Vol. 20, No. $, 361-378 


Rarely in the history of social psy- 
chology has a single study stimulated as 
much research as the master's thesis by 
Stoner (1961) which reported the discovery 
of "the risky shift? Its conclusion that 
groups are riskier than individuals was widely 
interpreted as being contrary to the findings 
of previous research on the effects of groups 
on individuals. It challenged conventional 
wisdom, and it appeared to have implications 
for those responsible for making important 
decisions involving risk. As findings began 
to accumulate, certain other attractive fea- 
tures of the research became apparent. The 
Choice Dilemmas Questionnaire (CDQ), 
Which was used by Stoner to measure the 
risk-taking disposition of individuals and 
groups, is easy to administer, permits a com- 
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The research of the past decade has established a number of firm findings 
concerning the effects of group discussion on decisions involving risk. An 
examination of these findings, however, raises certain questions about the basic 
assumptions that have guided research. Evidence that the items of the Choice 
Dilemmas Questionnaire (CDQ) produce systematically different responses both 
before and after group discussion casts serious doubt on the propriety of using 
CDQ scores as measures of a unitary disposition to take risks. And the conclu- 
sion that groups are invariably riskier than individuals is no longer justified. The 
risky-shift paradigm has generated an impressive body of facts, but these facts 
reveal the inadequacy of the paradigm itself. The field therefore faces a theo- 
retical crisis, The central task for theory is not to explain “the risky shift,” 
but to account for the much more complicated effects of group discussion. The 
attainment of a proper understanding of these effects will require the use of 
theoretical orientations and the collection of data that focus much more closely 
than in the past on the concrete properties of the decision-making process and 
the exact way in which group discussion affects them. 


parison of findings from different studies, and, 
most important, produces replicable results. 
A risky shift in CDQ scores has been found 
by numerous investigators in several different 
countries and with a variety of subjects. 

The popularity of research employing the 
CDQ is especially interesting, since studies 
by Atthowe (1961), Hunt and Rowe (1960), 
and Lonergan and McClintock (1961), which 
appeared at about the same time as Stoner's 
thesis but used different measures of risk, did 
not find significant differences between indi- 
viduals and groups. Several investigators have 
continued to use these alternative measures of 
risk, but their results have not led to simple 
general conclusions. Since the CDQ has pro- 
duced the more dramatic findings, it has 
had greater appeal, and the choice-dilemma 
paradigm has continued to dominate the field. 

A major reason for the great interest in 
the risky shift is that it has stimulated theo- 
retical controversy. Several competing hy- 
potheses have been advanced to explain why 
groups are riskier than individuals. And at- 
tempts to test these hypotheses have pro- 
duced a large body of data about the effects 
of various experimental treatments on re- 
sponses to the CDQ. It is quite apparent, 
however, from reviews of the literature pre- 
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pared by Clark (1971), Dion, Baron, and 
Miller (1970), Mackenzie (1971), and 
Vinokur (1971b), that the theoretical issues 
are far from settled. Since the failure to find 
a convincing explanation of the risky shift 
can hardly be attributed to a lack of effort 
or ingenuity, the possibility should be con- 
sidered that the effect itself is improperly 
conceived. The purpose of the present paper 
is to investigate this possibility. Toward this 
end, we shall review the literature to ascertain 
what is now known about the nature of the 
risky shift. Our objective is not to evaluate 
explanations, but to identify the facts that 
require explanation. 


Firm FINDINGS 


What findings from the past decade of re- 
search have been sufficiently established to 
require theoretical explanation? What are the 
facts? The awarding of factual status to a 
finding is, of course, to some degree a matter 
of subjective judgment. Has it been suffi- 
ciently replicated? Is it properly stated? 
Can it be obtained with conceptually equiv- 
alent methods? Does it generalize to popula- 
tions not previously studied? Such questions 
can rarely be answered with great confidence. 
In constructing the following list of firm find- 
ings, primary weight has been given to actual 
replication. Some findings, however, have 
been taken to be facts, even though their 
replicability has not been fully demonstrated, 
simply because they seem to fit well with 
other established findings. And some repli- 
cated findings have not been treated as facts 
because of questions concerning their inter- 
pretation. An effort has been made to be 
cautious but not too cautious. Perhaps a com- 
mittee would have arrived at a different 


listing. 
Effects of Treatments 


A repeated-measures design has been em- 
ployed in nearly all of the research. The 
CDQ, or some modified version of it, is first 
used to obtain pretreatment measures for the 
subjects of the experiment. Then, after the 
administration of some experimental treat- 
ment, the same instrument is employed to 
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yield one or more posttreatment measures for 
the same subjects. The effects of experimental 
manipulations are usually assessed by ascer- 
taining whether the means of the pretreat- 
ment and posttreatment measures are s 
cantly different. If they are, a shift in risk 
taking is said to exist. Efforts to account for 
the effect of a given treatment have com- 
monly relied upon a comparison of results 
obtained from it with those produced by one 
or more other treatments which differ in 
some particular way. 

The standard procedure for getting pre- 
treatment measures has been to ask subjects 
to fill out the CDQ as individuals and without 
knowledge of the nature of the subsequent 
treatment. Their decisions thus reflect per- 
sonal preferences which are presumably un- 
affected by any systematic anticipation of 
subsequent events in the experiment. The 
conditions under which these measures are 
taken have varied somewhat from one experi- 
ment to the next. Sometimes subjects com- 
plete the CDQ in the presence of others and 
sometimes in isolation. The time interval be- 
tween the pretreatment measures and the 
treatment has also varied considerably. It 
does not appear, however, that these differ- 
ences have any substantial effect on the basic 
findings. 

Two kinds of posttreatment measures have 
been used, depending on the nature of the 
treatment. One procedure is simply to have 
subjects again complete the CDQ as indi- 
viduals. In asking them to take the CDQ a 
second time, an attempt is usually made to 
make subjects feel free to alter their initial 
decisions without suggesting that they should 
do so. A second procedure is to have each 
group complete the CDQ by means of unani- 
mous group decisions. Often, subjects are 
told that the initial administration of the 
CDQ was intended merely to familiarize 
them with the nature of the problems. It 
should be noted that these two types of post- 
treatment measures, when compared with 
pretreatment ones, reveal conceptually differ- 
ent effects of a treatment. The first reflects 
the treatment's effects on the personal pref- 
erences of individuals, whereas the second 
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compares the preferences of individuals and 
groups. In some studies, both types of mea- 
sures have been taken, but group measures 
have always come first. 

Let us now examine the dependable effects 
of various treatments. We consider first the 
results when responses to the items of the 
CDQ are summed to give a single score for 
each individual or group. 

1. Group discussion to consensus. The 
most frequently replicated treatment is one 
in which small groups, usually containing 
from two to seven members, are formed and 
simply asked to discuss each item in order 
to reach a unanimous group decision con- 
cerning the lowest acceptable odds of success 
for the uncertain alternative. No constraints 
are imposed upon the content of the discus- 
sion or upon the way the group organizes 
itself to perform the task. The results of this 
treatment are remarkably consistent. The 
mean of CDQ scores for groups is dependably 
less than the mean for individuals. Group 
decisions, following free group discussion, are 
riskier on the average than personal decisions 
of the same individuals made prior to the 
discussion. It is this phenomenon to which 
the term risky shift was originally applied. 

In several studies subjects have been asked 
to indicate their final personal choices, 
whether or not they agree with the group 
decisions, by completing the CDQ once again 
as individuals. The mean of these scores is 
regularly found to be close to the mean of 
the groups’ scores. Thus, both group decisions 
and individual choices are riskier after group 
discussion to consensus than before. Although 
the effect of the time interval following this 
experimental treatment upon the shift in 
individual scores has not been investigated 
Systematically, Wallach, Kogan, and Bem 
(1962) reported that it endures for a period 
of 2-6 weeks. 

2. Free discussion without group decision. 
Another frequently used treatment is one in 
Which groups are asked to discuss the items 
of the CDQ but not asked to reach an explicit 
decision. Here, the mean of CDQ scores for 
individuals is significantly smaller after dis- 
Cussion without group decision than before. 
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It should be noted that these results taken 
alone do not demonstrate that the shift for 
group decisions can be attributed to changes 
in individual preferences resulting from dis- 
cussion, They do show, however, that an 
explicit group decision is not required for 
the occurrence of shifts in the choices of 
individuals. 

3. Observing free discussion without group 
decision. A few experiments have employed 
a treatment in which subjects are merely 
exposed to a group discussion of the CDQ 
items without themselves participating in the 
discussion. They may watch or listen to an 
actual discussion or read a transcript of it. 
Studies reported by Bell and Jamieson 
(1970), Kogan and Wallach (1967), Lamm 
(1967), and St. Jean (1970) have consist- 
ently found a risky shift in the personal pref- 
erences of observers, but its magnitude tends 
to be smaller than for participants. Although 
this research demonstrates that mere expo- 
sure to the content of a group discussion can 
produce a shift, it does not reveal what 
features of the discussion are responsible for 
its occurrence. Perhaps some of the arguments 
are found to be persuasive. Perhaps the 
observer learns that others are riskier than 
he is. Or it may be that observers are able 
to discern the drift of the discussion and 
simply make changes which they believe will 
match the modal choices of the participants. 
Actually, there is some evidence from other 
studies suggesting that when subjects are 
given information about the modal choices of 
others they do tend to conform to these 
“norms.” It would be useful to know how 
well observers are able to predict how the 
discussants actually complete the CDQ. 

4. Discussion without revealing initial 
choices. According to the value hypothesis 
proposed by Brown (1965), there is a perva- 
sive cultural value favoring risk which makes 
each person want to be at least as risky as 
others. When subjects participate in a group 
discussion, some of them will discover that 
their initial choices were less risky than those 
of some others, and they will be motivated to 
shift toward greater risk. Thus, shifts result- 
ing from group discussion are attributed to the 
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information it provides about the initial pref- 
erences of the participants. If this informa- 
tion were withheld from the discussion, then 
it might be expected that no shifts would 
occur. Several investigators have employed 
such a treatment, in which subjects are re- 
quested to discuss the content of the dilemma 
without revealing their personal choice of 
odds. The results for this treatment, as re- 
ported by Clark, Crockett, and Archer (1971) 
and St. Jean (1970), show significant and 
substantial shifts. Similar findings have been 
obtained by Burnstein, Miller, Vinokur, Katz, 
and Crowley (1971) and Vinokur (1970, 
1971a). These investigators asked subjects 
to discuss the values, or utilities, involved 
in each dilemma, rather than preferred odds, 
and to reach a unanimous decision as to how 
to rate these utilities on a scale. These dis- 
cussions were found to yield significant shifts 
both in utilities and odds. The results from 
this treatment cannot, however, be taken as 
a definitive test of the value hypothesis, since 
subjects may infer the risk preferences of 
others from the discussion and base their 
posttreatment choices on these inferences. It 
does appear, though, that an explicit state- 
ment of initial choices is not required for 
a shift to occur. 

5. Providing initial 


information about 


_ choices of others. A more direct test of the 


value hypothesis has been attempted by con- 
structing a treatment in which subjects are 
simply given information about the initial 
choices of others. In this research there has 
been considerable variation in the wording 
of instructions, the items used, and the ways 
in which the information about the choices 
of others is presented. It is not surprising, 
therefore, that the results have not been en- 
tirely consistent. Nevertheless, the findings 
teported by Bell and Jamieson (1970), 
Clark et al. (1971), Murdoch, Myers, and 
Smith (1971), St. Jean (1970), Stokes (1971), 
Teger and Pruitt (1967), and Wallach 
and Kogan (1965) justify the conclusion that 
the mere exchange of information about 
initial choices does not dependably produce 
shifts as large as those generated by free 
discussion with or without group discussion. 


Dorwin CARTWRIGHT 


—_—- = 


6. Familiarization. One consequence of a 
group discussion is undoubtedly that it per- 
mits subjects to become more familiar with 


the choice dilemmas. It is possible, therefore, 
that familiarization by itself will produce 
shifts comparable to those resulting from 
discussion. Bateson (1966) first tested this 
possibility by constructing a treatment in 
which individual subjects engaged in several 


activities designed to familiarize themselves 
with the nature of each dilemma. Flanders 
and Thistlethwaite (1967) subsequently de- 
signed a somewhat different familiarization 
treatment. Both studies found a significant 
risky shift. Attempts to replicate these find- 
ings, however, have met with universal failure. 
No significant shifts following familiarization 
have been obtained in studies reported by 
Bell and Jamieson (1970), Miller and Dion 
(1970), Rule and Evans (1971), Stokes 
(1971), and Teger, Pruitt, and St. Jean 
(1970). The weight of evidence now strongly 
supports the conclusion that familiarization 
does not dependably produce significant shifts. 
The concept of “familiarization,” however, 
has not been clearly defined, and no con- 
vincing explanation has yet been advanced 
to account for the differences between the 
two sets of findings. 

7. Irrelevant group discussion. It is con- 
ceivable that the effects of group discussion 
are independent of its content. The mere 
occurrence of a discussion might change inter- 
personal relationships or personal motivation 
in some way that makes subjects more willing 
to take risks. The available evidence, how- 
ever, gives little support for this possibility. 
Clark and Willems (1970) found no signifi- 
cant shift following discussion of an “irrele- 
vant” discussion. And Alker and Kogan 
(1968) obtained none following a discus- 
sion of women’s fashions. Nor did Lamm, 
Trommsdorff, and Kogan (1970) find one 
when they had subjects discuss the actual 
probability of success for each dilemma rather 
than the minimum acceptable odds. Incon- 
sistent results have been obtained, however, 
by Madaras and Bem (1968) and Graham 
and Harris (1969). In both of these studies, 
subjects discussed only half of the items of 


s 


the CDQ but gave responses to all of them. 
The items not discussed showed no signifi- 
cant shifts in the first study but significant 
risky shifts in the second. The situation is 
further complicated by findings reported by 
Alker and Kogan (1968). These investigators 
used two forms of the CDQ, one for initial 
choices and the other for choices after discus- 
sion, and obtained a significant shift toward 
tisk in total scores. In another condition, they 
even found a cautious shift following a dis- 
cussion of certain problems of ethics. It is 
possible, then, that a discussion of topics 
other than the dilemma problems themselves 
may produce a change in choices, but it 
remains to be determined what features of a 
discussion make it “relevant” to preferences 
on the CDQ. 

8. Irrelevant activities. Numerous investi- 
gators have employed a basic control treat- 
ment in which subjects engage in a presum- 
ably “neutral” activity between two adminis- 
trations of the CDQ. In some experiments 
the questionnaire is completed at two different 
sessions which are separated by a period of 
naturalistic “free activity.” In others, it is 
given twice in the same session with some 
standard intervening activity. Despite these 
procedural variations, the results uniformly 
Show no significant differences in means be- 
tween the first and second set of scores. Shifts 
cannot be attributed simply to repeated 
experience with the CDQ. 

4 In considering the practical or theoretical 
- implications of these results, it is important 
to be quite clear about the statistical proper- 
lies of the risky shift. Research has demon- 
strated the existence of a dependable shift 
in the sense that mean scores on the CDQ 
are significantly smaller after group discus- 
. Sion than before, but few details about the 
- Microstructure of the data can be found in 
the published literature. Little attention has 
been paid to the magnitude of the shift, the 
‘umber of groups which show it, or to the 
distribution of shifts made by each group on 
‘tach item. Most investigators have been con- 
tent with testing the null hypothesis. From 
the data that are readily available, however, 
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it is evident that the average trends are made 
up of a great variety of specific effects. 

How big is the risky shift? A review of the 
literature reveals that the mean initial scores 
on the CDQ tend to be about 70, and that 
mean shifts resulting from the strongest 
treatment approximate 10. Since the CDQ 
contains 12 items, the average shift per item, 
expressed as odds, is roughly from 6 in 10 to 
5 in 10. Groups may be riskier than indi- 
viduals on the average, but they are not 
strikingly so. 

As to frequency of occurrence, the evidence 
indicates that the great majority of groups 
do display a shift toward risk in CDQ scores. 
But since each score combines choices for 12 
items, it does not follow that a majority of 
the particular decisions made by groups show 
such a shift. In fact, there is good reason to 
believe that they do not. When Burns (1967) 
examined the entire set of group decisions in 
the original Stoner study, he found that 51% 
of them deviate by one unit or less from the 
group’s mean of initial choices. Since Stoner’s 
scale contained only the discrete points 1, 3, 
5, 7, 9, and 10, these group decisions actually 
fall at the point closest to the initial mean 
and cannot be considered genuine shifts. Only 
35% of the decisions give risky shifts larger 
than one unit, and 1446 show cautious shifts 
of this magnitude. Belovicz and Finch (1971) 
have made a similar analysis for three sepa- 
rate experiments involving a total of 564 
group decisions. Of these, 40% are at the 
point closest to the group mean, 39% deviate 
in a risky direction, and 21% in a cautious 
one. Thus, it appears that most group deci- 
sions are not riskier than individual ones, and 
that a substantial minority is actually more 
cautious. The most frequent outcome may 
actually be no shift at all. These results cast 
serious doubt about the adequacy of any 
explanation of the risky shift, which assumes 
that the typical group decision shows a shift 
toward risk. 


Effects of Items 


Even in the earliest research it was evident 
that the 12 items of the CDQ contribute to 
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the overall shift in quite different ways. 
Although the mean shiít was toward risk for 
most items, its magnitude varied consider- 
ably. There were, moreover, 2 items which 
repeatedly produced cautious shifts. The re- 
sponse of investigators to this discovery may 
be of interest to the historian of science. 
Some essentially ignored it and continued to 
deal with total scores. Others removed the 
2 deviant items from the CDQ, apparently 
assuming that they would thereby improve 
the reliability of the instrument as a measure 
of risk taking. A few were motivated to 
discover whether they could design new items 
in the same format which would also display 
cautious shifts. On the whole, however, the 
magic of the risky shift was so powerful that 
few seriously questioned whether its existence 
depends upon the particular set of items that 
happened to be contained in the original 
version of the CDQ. And most theoretical 
efforts continued to be directed to accounting 
for the shift toward risk. 

9. Mean shifts for items. There can now 
be no doubt that different items generate 
distinctively different mean shifts. Although 
data are still not always given separately for 
items, the “typical” shift for each item can 
be estimated from a tabulation of the mean 
shifts reported in the literature. For this 
purpose, we consider the median of the mean 
shifts resulting from discussion to consensus. 
For Items 1, 4, and 7, these medians, ex- 
pressed as odds, range from 1.0 to 1.6. For 
Items 6, 8, and 9, they all approximate .8. 
And for Items 3, 10, and 11, they lie between 
-6 and .7. These nine items, then, consistently 
yield risky shifts, but their magnitudes vary 
substantially. On the other hand, Item 2 
with a median of .3, yields small shifts which 
are predominantly, but not uniformly, risky. 
And Items 5 and 12 typically give cautious 
shifts, with medians of —.1 and 22:6; 
respectively. 

The attempt by Nordhgy (1962) , Rabow, 
Fowler, Bradíord, Hofeller, and Shibuya 
(1966), Stoner (1968), and others to construct 
choice dilemmas that will dependably yield 
cautious shifts has been generally successful. 
Stoner found, however, that it is more dif- 
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ficult to write cautious-shift items than risky- 
shift ones. This fact, together with the rela- 
tively small magnitude of shifts for the 
cautious-shift items of the original CDQ, has 


two 


sometimes been interpreted as indicating that 
the “natural” shift is toward risk. But such 
a conclusion seems unjustified, since several 
of the newly constructed items give larger 
cautious shifts than the risky shifts produced 
by several of the CDQ items. The difficulty 


in constructing cautious-shift items may 
merely indicate a lack of understanding of 
the determinants of shifts in general rather 
than some peculiarity of the cautious shifts. 

10. Means of initial choices. It is now 
clearly established that each item also gen- 
erates a characteristic distribution of initial 
individual choices. Three items, 4, 7, and 11, 
tend to produce relatively risky choices. 
The medians of their means lie between 4.1 
and 4.5. Items 2, 5, and 12, on the other 
hand, yield relatively cautious choices, with 
medians between 6.8 and 7.5. The remaining 
six items may be considered “neutral,” since 
their medians lie near the midpoint of the 
scale, ranging between 5.0 and 5.9. In view of 
the diverse conditions of testing and scoring 
employed in different studies, the variation 
in means for a given item is remarkably small. 
For all but two items, half of the reported 
means deviate from the median by less than 
one-third of a unit. 

11. Relation between initial means and 
mean shifts. As first explicitly noted by 
Teger and Pruitt (1967), items with larger 
risky shifts tend to have riskier initial means, 
and those producing cautious shifts tend to 
have relatively cautious initial means. When 
Teger and Pruitt combined data from several 
studies to generate "national norms," they 
obtained a correlation of .81 between initial 
means and mean shifts for the items of the 
CDQ. Correlations of similar magnitude have 
Since been reported by other investigators 
both for the original items and for newly 
constructed ones. 

The proper interpretation of this well- 
established relationship is not at all obvious. 
It is possible that the content of each item 
suggests a characteristic set of persuasive 
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arguments which influence both initial choices 


and the content of group discussion, Or an 
item may activate a cultural value favoring 
risk or caution which affects initial choices 


then reinforced by group discussion. 
e, however, that an entirely different 
process is involved. The correlation 
tion refers to properties of the total 


and i 
It may 


population of individuals or groups studied 
and only indirectly the concrete situations in 
which shifts occur. Since the distribution of 
initial choices in the population at large 
determines their distributions within ran- 
domly composed groups, the correlation may 
simply reflect the effects of these within- 
group distributions on shifts. Indeed, we 
would find such a correlation if the mean of 


initial choices in the general population were 
systematically related to other properties of 
the distribution, such as skewness or variance, 
and ií shifts were dependent upon these 
properties of within-group distributions. No 
confident choice among these interpretations 
can be made in the absence of evidence 
directly relevant to them. 

12. Other responses to items. In one of the 
earliest studies, Hinds (1962) asked subjects 
to indicate how they thought similar others 
would respond to some of the choice 
dilemmas, and found that they tended to 
believe that others would make riskier choices 
than they, More extensive data collected by 
Wallach and Wing (1968) and by Levinger 
and Schneider (1969) have confirmed the 
finding for those items of the CDQ that yield 
consistent risky shifts. The results of this and 
other research justify the conclusion that for 
tisky-shift items, subjects tend to believe that 
their initial choices are riskier than those 
of others. 

There is also evidence that for risky-shift 
items, subjects favorably evaluate riskier 
choices and people who take greater risks. 
Levinger and Schneider had subjects specify 
Which choice they would most admire and 
found that these choices were riskier on the 
average than their own. When Madaras and 
Bem (1968) and Jellison and Riskind (1970) 
asked subjects to attribute personal traits 
to people who had allegedly made risky or 


cautious choices, they found that these traits 
were generally more favorable when others 
were viewed as favoring risk rather than 
caution. 

Since interest has been directed primarily 
to explaining the risky shift, less attention has 
been paid to cautious-shift items, but data 
provided by Levinger and Schneider (1969), 
Stoner (1968), and Vidmar and Burdeny 
(1969) indicate that for such items subjects 
tend to believe that their initial choices are 
more cautious than those of others. Evidence - 
concerning the evaluations of the choices 
made to cautious-shift items is sparse and 
inconclusive. 

On the whole, these findings are consistent 
with some version of the value hypothesis 
advanced by Brown (1965), but they do not, 
in themselves, demonstrate that the beliefs - 
and values associated with choices are caus- 
ally related to the shifts made by particular - 
individuals on particular items. Evidence that 
they may be is provided by Clark et al. 
(1971), who found a larger shift toward risk 
among subjects who believed that their own 
choices were at least as risky as others than 
among those who believed that they were less 
risky than others. The findings reported by 
several investigators, that estimates of others' 
choices change as a consequence of group 
discussion, are also consistent with this inter- 
pretation. Further research, which relates 
shifts to the beliefs and values of specific 
individuals for specific items, is required be- 
fore the issue can be settled. Such research 
should, of course, investigate both risky-shift 
and cautious-shift items, The analyses under- 
taken by Ferguson and Vidmar (1971) make 
an excellent start toward providing the kind 
of detailed information required. 


SIGNIFICANCE OF BAsIc FINDINGS 


The research of the past decade has clearly 
established that a relevant group discussion 
yields a reduction in mean CDQ scores. And 
if it is assumed that these scores measure a 
readiness to take risks, the conclusion is 
justified that there is, indeed, a risky shift. 
But the significance of this shift for theory 
and practice is far from evident. The findings 
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show that the shiít is not large, that most 
choices made after group discussion are not 
riskier than before, and that the shifts ob- 
tained for different choice dilemmas differ in 
direction and magnitude. There are, more- 
over, serious questions yet unanswered con- 
cerning the meaning of CDQ scores, the 
nature of the specific choices which are com- 
bined to give total scores, and the character- 
istics of the concrete processes that bring 
about changes in the choices made by par- 
ticular individuals or groups on particular 
items. We turn now to an examination of the 
more important of these issues. 


Differences Among Items 


Use of the CDQ in research on group 
decision making was originally based on the 
assumption that it is an instrument for mea- 
suring the risk-taking dispositions of indi- 
viduals or groups. Since responses to its 12 
items were conceived as being determined by 
this unitary disposition, they were summed 
to give a single score. And the reduction in 
the mean of these scores following group dis- 
cussion was taken as evidence that groups 
are “riskier” than individual members when 
acting alone. This psychometric orientation 
has had a pervasive influence on most of the 
subsequent research. 

Confidence in the propriety of this ap- 
proach has been seriously undermined by the 
findings concerning differences among items. 
It is now clear that the items contained in the 
original CDQ are in no sense a representa- 
tive sample of the universe of all possible 
items. Instruments similar to the CDQ could 
readily be constructed whose scores would 
display risky shifts, cautious ones, or none 
at all. Since there is no basis now known for 
selecting any one of these as the true measure 
of a risk-taking disposition, the argument that 
group discussion heightens this disposition is 
hardly compelling. 

If the CDQ score does not measure a uni- 
tary disposition, then it has no clear referent, 
for it merely summarizes the choices made 
to a particular set of 12 choice dilemmas. 
There is evidence, moreover, that its use may 
be quite misleading. Thus, for example, when 
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Hoyt and Stoner (1968) attempted | 
“homogeneous” groups by selecting subjects 
on the basis of their initial CDQ scores, they 
found that the groups were actually hetero- 
geneous on many of the items. Similar dif- 
ficulties arise in studies of the effects of the 
relative riskiness of individuals within a 
group, since a person’s rank on CDQ scores 
does not necessarily correspond to his rank 
on even a majority of items. Clearly, great 
care must be exercised in interpreting results 
based on these scores and in evaluating thei: 
implications for theory. 


Meaning of Choices on Choice Dilemmas 


The concrete events which generate the 
basic data to be explained are the choices 
made by individuals or groups on each choice 
dilemma. The significance of findings from 
this research therefore depends upon the 
meaning that can be attributed to these 
choices. It has commonly been assumed that 
they reflect a “readiness” to take risk, but 
the way in which they do so has not been 
clearly specified. If the meaning of these 
choices is to be clarified, it is necessary first 
of all to understand the nature of the dilem- 
mas themselves. What is the structure of the 
problem that they present to subjects? An 
examination of the dilemmas reveals that 
each contains two alternative courses of ac- 
tion. One of these, R, is risky in the sense 
that its outcome is uncertain; it may lead 
either to success or failure. The other, C, is 
cautious in that its outcome is described as 


certain. Let us denote the values of success | 


and failure on R, as assessed by a given sub- 
ject, by V and —K, respectively, and the 
value of the certain outcome by v. The 
dilemma consists in the fact that v is assumed 
io be less than V and greater than K. The 
assigned task is to specify the minimum prob- 
ability of success on R that would justify 
undertaking it, and this probability consti- 
tutes the choice for each dilemma. 

When a subject faces such a dilemma, what 
determines his choice? Although little empir- 
ical work has been aimed directly at answer- 
ing this fundamental question, it is possible 
to identify some of the possible determinants. 


createur 


. 


irst, it is clear that the relative magnitudes 
f the values of outcomes must have some 
ffect on choices. If, for example, a person 
laces a relatively high value on the success- 
ful outcome of R, he would be expected to 
recommend undertaking it even though the 
probability of success were small. If he be- 
lieves that the costs of failure are great, he 
yould be expected to require a higher prob- 
bility of success. And if he finds the certain 
utcome attractive; he should be willing to 
ecommend the risky alternative only when 
ts probability of success is relatively high. 
There would seem to be dittle doubt, then, 
that the relative magnitudes of the values v, 
V, and —K at least partially determine the 
choices made to dilemmas. 

It should be noted that these determinants 
have nothing directly to do with a “value for 
risk,” since they are concerned only with the 
values of outcomes. If they are the only de- 
terminants, then it is clearly not appropriate 
to consider choices as measures of the value 
for risk. It is possible, of course, that such a 
value also exerts an influence on choices, that 
a person who values risk itself will favor 
riskier choices. But even if this is true, it 
is clear that choices cannot be taken as a 
measure of the value for risk unless the values 
of outcomes are held constant. Since the re- 
quirement is obviously not met in the usual 
formulation of choice dilemmas, the concep- 
tual meaning of choices remains ambiguous. 
And theories which account for shifts in 
choices by invoking some mechanism that 
‘aodifies the value for risk rest on a shaky 
empirical base. 

Miller (1970) has proposed that subjects 
actually understand the task presented in the 
dilemmas to be one of selecting the probabil- 
ity of success on the risky alternative such 
that the subjective expected utilities of the two 
alternatives are equal. If they do and if we 
consider the values v, V, and —K as utilities, 
then it follows that they should choose a 
probability, p, for which p = v + K/V + Be, 
According to this view, items differ in mean 
initial choices because they set up character- 
istic distributions of subjective utilities, and 
subjects differ because their assessments of 
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the utilities differ. Data collected by Miller 
(1970), Burnstein et al. (1971), and Vinokur 
(1970, 1971a) are generally consistent with 
this interpretation. In these studies, subjects 
were asked to make the usual choices of 
minimum probability, but also to give ratings 
of the utilities involved. When the ratings 
were converted into "predicted" probabilities 
in accordance with the equation given above, 
a substantial correlation between predicted 
and actual probabilities was obtained. 

If subjects do not respond directly to risk 
but instead solve the problem in terms of 
utilities (or values of outcomes), then the 
main function of group discussion is to recon- 
cile interpersonal differences in the assess- 
ment of these utilities. Miller has shown 
mathematically that if a group discussion 
causes these assessments to converge to the 
mean, there will be predictable shifts in prob- 
ability choices, whose direction and magni- 
tude will depend upon certain properties of 
the initial within-group distributions of utili- 
ties (and hence probabilities). It has yet to 
be ascertained how well actual data on shifts 
conform to the predictions derived from this 
model, but it has been shown that group 
discussion results in regular changes in the 
ratings of utilities. 

Although it is clear that choices are influ- 
enced by the values (or utilities) v, V, and 
—K, the available findings do not justify the 
conclusion that they are completely deter- 
mined by them. It is still possible that they 
are also influenced by a value (or utility) 
for risk itself. A formal model, known as 
portfolio theory, would appear to be espe- 
cially well suited for dealing with this com- 
plex of determinants. As described by Coombs 
and Huang (1970), this theory has three 
critical elements: perceived risk, ideal risk, 
and expected value. From the perspective of 
portfolio theory, a choice dilemma presents a 
set of available choices (undertaking the 
risky alternative at a particular probability 
of success), each of which has an expected 
value and a degree of perceived risk. When a 
subject is asked to select one of these, he 
evaluates each in terms of its expected value 
and how closely it approximates his ideal level 
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of risk. He then compares this evaluation 
with a similar one for the cautious alterna- 
tive. A group discussion will bring about 
shifts in choices if it causes a change in the 
perceived riskiness of the choices, a heighten- 
ing or lowering of the ideal level of risk, or 
a reassessment of the values of outcomes. 

The immediate value of portíolio theory for 
research on choice dilemmas is its identifica- 
tion of separate potential influences on both 
initial choices and shifts. Research on the 
risky shift, by concentrating on a poorly de- 
fined cultural value of risk, has provided little 
information about the way subjects perceive 
the riskiness of choices, their initial levels 
of risk, or their assessment of the values of 
outcomes. A beginning has been made toward 
an understanding of these determinants by 
Pruitt (1969) and Higbee (1971) in their 
research on labeling of choices as risky or 
cautious, by Levinger and Schneider (1969) 
in their study of most admired choices, and 
by Burnstein et al. (1971) and Vinokur 
(1970, 1971a) in their investigations of the 
assessed values of outcomes. Studies are 
needed, however, which examine more closely 
the ways in which these determinants jointly 
afiect choices. Until such research is com- 
pleted, the meaning of choices on choice 
dilemmas will remain ambiguous. 


Nature of Group Processes 


Since it has been firmly established that 
group discussion is essential for the occur- 
rence of shifts, it is remarkable how little 
attention has been paid to its content and 
to the function it serves. When a group dis- 
cusses a particular item, are the members 
mainly concerned with reconciling the differ- 
ences among them in their initial choices? 
Do they merely seek an acceptable compro- 
mise, or do they attempt to change the 
choices of others? Does the discussion focus 
mainly on the values of outcomes, the ideal 
level of risk, or on the amount of risk actu- 
ally involyed? Are particular arguments com- 
monly advanced in support of certain choices? 
Are some of these especially persuasive? How 
does the distribution of initial choices within 
the group affect these processes? Unfortu- 
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answers to these questions. Fascination will 
the analysis of variance paradigm has 1 
investigators to look almost exclusively at, 
differences between treatments rather than at 
the process directly involved. The ated re 


nately, research has not yet provided i 


approach has unnecessarily dominated re 
search, for these processes can be directl 

observed. Surely, it is time to pry the lid off 
the box to see what goes on inside. 

In order to gain a proper understanding 
of how group processes bring about shifts ir 
choices, it is necessary to examine the within 
group distributions of choices on specific, 
items before and after group discussion. Al- 


though these distributions have rarely been ° 


reported, it is clear that the choice dilemmas 
generate a wide distribution of initial choices. 
Detailed data presented by Cecil (1967) 
from a sample of 238 subjects show that 
each item of the CDQ yields a remarkably 
flat distribution of initial choices. For each 
item, there are some choices at both ends of 


the probability scale. This finding implies « 


that randomly composed groups will normally 
contain members with a diversity of choices 
on most items, If these members are asked 
to reach a unanimous group decision, they 
must somehow reconcile their differences. And 
even if they are requested merely to discuss 
the item, they face a similar task since they 
are likely to seek a single “correct” choice. 

A detailed comparison of the choices made 
by individuals before and after discussion 
would greatly clarify the meaning of shifts 
even in the absence of information about the 


nature of the discussion itself. Although such | 


an analysis was reported by Marquis (1962) 
in one of the earliest studies of the risky 
shift, the significance of his findings has not 
been sufficiently recognized. It is evident in 
the data presented by Marquis that group 
discussion has the effect of reducing the 
within-group variance of choices on each item 
and that when a risky shift occurs, initially 
cautious subjects change more than initially 
risky ones. Since these findings have recently 
been confirmed by Ferguson and Vidmar 
(1971) and Vidmar and Burdeny (1969), 
we may safely conclude that convergence is a 


" 
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major consequence of group discussion and 
that shifts in mean scores result from system- 
atic differences in changes made by indi- 
viduals who are relatively more risky or 
cautious than others in a particular group on 
a particular item. If those holding different 
initial opinions change in systematically dif- 
ferent ways, it would appear that the within- 
group distributions of initial choices is an 
important determinant of shifts. It is not 
obvious, however, exactly which properties of 
this distribution are critical. 

Burns (1967) has proposed an extremity- 
variance theory in which it is assumed that 
the deviation of the mean of initial choices 
from the midpoint of the scale and their 
variance jointly affect the magnitude and 
direction of shifts. He has tested two slightly 
difierent models based on this assumption 
against several sets of unanimous group de- 
cisions and has found that they successfully 
predict from 4995 to 6296 of the decisions. 
Although these results are rather impressive, 
it must be noted that both models contain 
two parameters not derivable from the initial 
distributions. And Haley and Rule (1971) 
have obtained data which they interpret as 
being inconsistent with Burns’ theory. What- 
ever the ultimate evaluation of these models, 
the attempt to make specific predictions is 
commendable, and Burns’ percentage of suc- 
cess provides a base line against which to 
Compare alternative models. 

The assumption that the extremity of a 
Stoup’s mean of initial choices affects the 
direction of shifts has also been made by 
theorists who interpret shifts as instances of 
a more general “polarization effect." Data 
Collected by Doise (1969), Fraser, Gouge, 
and Billig (1971), McCauley (1970), Mosco- 
Vid and Zavalloni (1969), and Myers and 

ishop (1970) are generally consistent with 
this assumption, but there are some statistical 
) Problems in interpreting the results of several 
| Of these studies. 

The possibility that the skewness of the 
Within-group distribution of initial choices is 
4 determinant of shifts has been examined 
Y Vinokur (1969). From an analysis of his 
own data and a reanalysis of data obtained 
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by others, he concluded that skewness does 
have an effect on shifts but that it cannot, 
by itself, account for the general tendency 
to find risky shifts in total scores on the 
CDQ. The effects appear to be different, 
moreover, for risky and cautious shifts. A 
more intensive investigation of the effects of 
skewness should be informative. 

Since the basic question motivating this 
entire line of investigation is whether the 
decisions of groups are riskier than those of 
individuals, it is surprising that theories of 
group decision making have rarely been em- 
ployed in the analysis and interpretation of 
data on choice dilemmas. From the perspec- 
tive of these theories, the initial choice of 
each individual represents his preference for 
the group decision, and the group’s task is 
to employ some decision rule for combining 
these preferences into a single choice for the 
group as a whole. In order to predict a par- 
ticular group decision, it is necessary to know 
(a) the set of possible choices, (5) the dis- 
tribution of individual preferences over this 
set, and (c) the decision rule employed by 
the group. 

As an illustration of this type of approach, 
we present a reanalysis of data presented by 
Cecil (1967) for 43 three-person groups who 
were required to make unanimous group de- 
cisions for each of four items of the CDQ. 
The set of possible choices provided in this 
study, as in many others, can be conceived 
as forming an odds scale with discrete points: 
1, 3, 5, 7, 9, and 10. Let us examine the 
within-group distributions of initial choices 
on this scale to see whether there are decision 
rules that might account for the resulting 
group decisions. 

Majority rule. In 90 (52%) of the dis- 
tributions, at least two of the initial choices 
are the same. The most natural procedure for 
these groups would be to select the points on 
the odds scale preferred by the majority. 
Actually, 7995 of them do. And it is inter- 
esting to note that essentially the same results 
are obtained whether the majority is rela- 
tively risky or cautious. For the entire 
set of 172 decisions, we thus find that 41% 
can be successfully predicted simply by 
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assuming that they conform to a principle of 
majority rule. 

Coalition rule. In the remaining distribu- 
tions, each choice is unique, so that the prin- 
ciple of majority rule cannot apply. For 60 
(35%) of these, however, it is meaningful to 
conceive of a principle of coalition rule. To 
do so, two types of distributions must be 
distinguished. First, in 36 of the distributions, 
two of the choices are adjacent on the scale, 
whereas a third is separated from them by 
at least 1 point (e.g., 1-3-7). Here it is rea- 
sonable to assume that the pair of adjacent 
members form a coalition on the basis of 
their relative similarity of preferences. Sec- 
ond, for 24 distributions, all three choices 
are adjacent, but the distribution is not sym- 
metric around Point 5 (eg. 1-3-5, 5-7-9, 
7-9-10). Since there is good reason to believe 
that more extreme choices are made with 
greater confidence or commitment, we may 
assume here that the two more extreme mem- 
bers form a coalition on the basis of their 
greater involvement in relatively similar pref- 
erences, If a group adopts the principle of 
coalition rule, we might expect its choice to 
coincide with the mean of the choices of the 
members of the coalition. But since the two 
members are adjacent on the odds scale, we 
can predict only that the group will select 
one of their choices, presumably with equal 
frequency over a large number of cases. 
Cecil’s data show that in fact 80% of these 
group decisions are consistent with this prin- 
ciple. Separate analyses for relatively risky 
and cautious coalitions reveal that both con- 
form to the decision rule quite well, but 
there is a tendency for risky coalitions to do 
so more than cautious ones.* 

Mean rule. For the remaining 22 distribu- 
tions (13%), there is no obvious basis for 
forming coalitions. Either the choices are all 
adjacent but symmetric around Point 5 
(3-5-7), or no two are adjacent (e.g., 1-5-10). 


8 Jt should be noted that several of the cautious 
coalitions involve choices of 10. There is some ques- 
tion as to whether these choices should be considered 
as lying on the scale since, from the point of view 
of decision theory, they violate the assumption that 
v is less than V. In future research, special attention 
should be directed to the behavior of subjects who 
make these choices. 
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Here the simplest procedure might be for the 
group to select a choice closest to the mean 
of the distribution. Actually, 50% of these 
groups do so. Clearly, this principle does not 
predict decisions as well as the other two. It 
is possible that coalitions do form under 
these relatively rare conditions, but there are 
no apparent regularities in the data. 

In summary, the principles of majority or 
coalition rule permit predictions for 87° of 
the decisions, and 79% of these are correct. 
The principle of mean rule applies to 153% 
and successfully predicts 50% of them. Taken 
together, they accurately predict 75% of all 
group decisions. 

Although the results of a single study can 
hardly be considered conclusive, they are suf- 
ficiently promising to encourage further explo- 
ration of decision-making models of this sort. 
Since the model employed here applies only 
to three-person groups operating with a small 
number of available choices, the first step 
would be to extend the theory of coalition 
formation to larger groups with a larger set 
of choices. But even if this problem is solved, 
several questions remain as to whether such 
a model could deal with the full array of 
basic findings concerning choice dilemmas. 
Let us look briefly at some of these questions. 

First, it is clear that according to this 
model a group will display a risky shift on 
a particular dilemma if and only if it has a 
relatively risky majority or coalition. This 
prediction, it might be argued, is inconsistent 
with findings reported by Wallach and Mabli 
(1970), who obtained a risky shift when they 
created three-person groups so that each 
would have a relatively cautious majority. 
Actually, however, this study does not pro- 
vide an adequate test of the model, since 
the groups were formed on the basis of total 
scores for 10 items of the CDQ, and shifts 
refer to changes in means of these scores. A 
reanalysis of these data separately for items 
might lead to a rather different interpreta- 
tion of their theoretical significance. The re- 
sults of the analysis of the effects of skewness 
reported by Vinokur (1969) may appear to 
present a more serious difficulty for the model, 
since this analysis was conducted on an item- 
by-item basis. But, again, the test cannot 
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be taken as definitive, since the model makes 
no simple predictions about the effects of 
skewness. Although the principle of majority 
tule does have direct implications for these 
effects, the principles of coalition rule and 
mean rule do not, It is quite possible for 
coalitions to form even when the distribution 
of initial choices is symmetrical. 

It might also be argued that no model of 
group decision making can appropriately be 
employed to account for those changes in 
individual choices that result from discussions 
where there is no requirement for an explicit 
group decision. But this argument is not com- 
peling. The finding that individual choices 
tend to converge to the same choice as that 
selected by an explicit group decision sug- 
gests that group discussion eventuates in an 
implicit eroup decision as to which choice is 
L'correct." If this conclusion is justified, then 
"both kinds of shifts could be treated as 
‘effects of decision making, and a single model 
Could be applied to both. 
| Finally, there is the problem of how to 
P deal with the finding that substantial shifts 
in individual choices occur when subjects 
“simply observe a group discussion. Since these 
Choices are not part of a decision-making 
“Process, it is clear that shifts cannot be 
It must be 
; however, that the inability of the 
Model to account for these shifts in no way 
-Implies that it cannot explain shifts resulting 

‘Tom actual participation in group discussion. 
I The determinants of the two effects need not 
be the same, 

Since these problems do not appear to be 
insurmountable, efforts to account for the ef- 
ects of group discussion by means of some 
decision-making model should be encouraged. 
ven if these efforts eventually proved to be 
"Unsuccessful, they will serve to focus atten- 
“on on the concrete processes that actually 
determine the basic findings which we seek 
explain, 


PROBLEMS IN GENERALIZING FINDINGS 


— One basic question remains to be consid- 
"fred: What do the findings for choice dilem- 
$55 tell us about the relative riskiness of 
E hdividuals and groups in the real world? 
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Clearly, they do not justify the proposition 
that groups are invariably riskier than indi- — - 
viduals, for the effects of group discussion 
depend upon the content of the dilemmas and 
probably upon the distribution of initial 
choices within groups. A proper statement 

of the findings will have to recognize these 

and possibly other dependencies. But even 
if these influences are taken into account, E 
the implications of the research for natural 
groups are not immediately obvious. The p 
question of generalization requires careful 
consideration of the conditions under which 
the results have been obtained. 

It is important, first of all, to recall that 
nearly all research on choice dilemmas has 
used a repeated-measures design. Strictly 
speaking, the results refer to Shifts in choices 
and not to differences between the choices o 
individuals and groups who are considering 
the dilemmas for the first time. It is possible, 
therefore, that individuals and groups who f 
make choices de novo on the same dilemma: 
would not differ in their choices, Data pre- 
sented by McCauley, Teger, and Kogan 
(1971) suggest that this may actually be the 
case. If this finding were to be confirmed, 
the results of research on choice dilemmas 
could be generalized only to natural settings 
in which group members formulate personal 
preferences prior to group discussion. 

The problem is further complicated by the 
fact that the groups studied have certain 
“unnatural” properties. They are created for 
experimental purposes and consequently have 
no history, future, established Structure, or 
significant enduring relationships with a sur- 
rounding social system. Results derived from 
such groups can be extended to groups in 
general only if it is assumed that they are not 
dependent upon these unusual properties. 
Unfortunately, almost nothing is known about 
the effects of group properties on the basic 
findings. One might expect that in a group 
with a status hierarchy, higher status mem- 
bers would have more influence on group 
decisions, but Mackenzie (1970) and Siegel 
and Zajonc (1967) found little support for 
this assumption. On the other hand, Dion, 
Miller, and Magnan (1971) have obtained 
data indicating that shifts in CDQ scores 
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may be quite different in groups with different 
levels of cohesiveness. Further investigation 
of the effects of these and other group 
properties is clearly needed. 

The question as to whether results can be 
generalized from ad hoe experimental groups 
to natural groups with an extended history 
is especially interesting. If, as is maintained 
by one version of value theory, the critical 
feature of group discussion is to inform sub- 
jects of the risk-taking dispositions of others, 
then groups with a history of decisions in- 
volving risk would not be expected to show 
shifts. The confirmation of this hypothesis 
would seriously limit the applicability of 
findings to the real world. 

The properties of choice dilemmas also pose 
problems of generalization. Each dilemma 
refers to a hypothetical situation and calls 
for a recommendation to be given to an 
imaginary person who is not a member of the 
group. The assigned task is to select the 
minimum of odds of success that would jus- 
tify undertaking the uncertain alternative 
rather than to recommend simply whether 
it should be undertaken. And the conse- 
quences of abiding by the recommendation 
do not directly affect the members of the 
group. Few groups, if any, have ever faced 
a decision with this combination of properties. 

Since it can no longer be assumed that 
choice dilemmas measure a general disposi- 
tion to take risks, an understanding of the 
effects of these unnatural properties of 
choices becomes critical. A few investigators 
have rewritten the choice dilemmas so as to 
alter the relationship between the subject and 
the hypothetical person described in the 
dilemma, Results reported by Dion et al. 
(1970), Fraser et al. (1971), Graham and 
Harris (1969), and Rabow et al. (1966) sug- 
gest that this relationship may affect shifts 
in scores, but the findings are not entirely 
consistent. Several investigators have also at- 
tempted to assess the importance of the 
hypothetical nature of outcomes by com- 
paring individual and group decisions in 
situations where the outcome of the decision 
has direct and tangible consequences for the 
participants. But the experiments reported by 
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Bem, Wallach, and Kogan (1965), Kogan and 
Zaleska (1969), Marquis and Rietz (1969), 
Pruitt and Teger (1969), Zajonc, Wolosin, 
Wolosin, and Sherman (1968, 1969) produce 
quite heterogeneous results. Sometimes groups 
are riskier than individuals, sometimes more 
cautious, and sometimes there is no differ- 
ence. A great deal more research is required 
before we can confidently generalize from 
choice dilemmas to the kinds of decisions 
faced by natural groups. 

The experimental conditions employed in 
research on choice dilemmas certainly cannot 
be conceived as simulating any typical 
natural setting. Nor can they be considered 
a representative sample of any known uni- 
verse of such situations. How then can the 
findings from this research be generalized to 
the real world? The only feasible way would 
seem to be through the use of theory. If the 
findings can be given theoretical meaning, 
then they can be applied to any empirical 
situation that satisfies the requirements speci- 
fied by the theory. Thus, for example, if 
there were a clearly defined theoretical con- 
cept of risk, it could then be applied to both 
artificial and natural settings despite any 
phenotypic differences between them. A 
major reason for hesitation in generalizing 
findings from CDQ scores is the ambiguity 
concerning the theoretical meaning of these 
scores. 

The problem of generalization, then, can- 
not be solved until the theoretical significance 
of the experimental findings is clearly under- 
stood. There would appear to be no short cut 
to relevance. 


CONCLUSIONS 


The research of the past decade has pro- 
duced a number of firm findings and several 
ingenious explanations." One is left, how- 
ever, with the uncomfortable feeling that 
theoretical thinking has not kept pace with 
the discovery of facts. Most publications still 
refer to the risky shift as if it were a well- 
established phenomenon. And until quite re- 
cently most theoretical efforts have been di- 
rected to the task of accounting for the “fact” 
that groups make riskier decisions than indi- 
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viduals. But the facts are not so simple. The 
assumption that CDQ scores measure a uni- 
tary disposition to take risks is no longer 
tenable, and it is now clear that shifts in 
means of these scores poorly reflect the 
concrete changes in specific choices brought 


about by group discussion. Explanations of 
the risky shift which postulate some mecha- 
nism that uniformly moves choices toward 
greater risk are unable to account for the 
total body of findings. 

As so often happens in science, a paradigm 
has served to stimulate the discovery of facts 
which reveal the inadequacy of the paradigm 
itself. Research employing the risky-shift 
paradigm has cast serious doubt on the exis- 
tence of the risky shift and on the use- 
fulness of CDQ scores in the analysis of 
tisk taking. The findings even suggest the 
possibility that the basic phenomena may 
have been mislabeled and that they have 
nothing specifically to do with risk at all. It 
would be unfortunate, however, if dissatisfac- 
tion with the original paradigm were to 
diminish interest in the findings that have 
been established. The critical problem is to 
find a conceptual framework capable of deal- 
ing with phenomena that have proved to be 
More complex than first anticipated. 

Much of the uncertainty about the rele- 
vance of the research on choice dilemmas to 
the concept of risk might be reduced with 
the help of mathematical decision theory, As 
noted above, portfolio theory (Coombs & 
Huang, 1970) is well suited to the analysis 
of choice dilemmas and provides a means for 
Clarifying the role of risk in both initial 
choices and shifts. The model proposed by 
Miller (1970) approaches the problem in a 
Somewhat different way. Both of these con- 
Ceptualizations should be carefully examined 
to discover how well they can deal with the 
total set of findings. 

A curious feature of the history of research 
on choice dilemmas is the early rejection of 
theories of conformity and social influence 
as explanatory orientations. The risky shift 
Was interpreted from the beginning as being 
Mconsistent with previous findings from re- 
Search on conformity. The argument against 
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conformity theory has relied mainly on the 
assertion that it implies a convergence of post- 
discussion choices toward the mean of initial 
choices. Actually, however, conformity theory 
provides no clear basis for making any pre- 
dictions about the location, or content, of an 
emergent norm from the beliefs or preferences 
of group members, since it has concentrated 
primarily on processes arising after group 
norms have been established. To the extent 
that it makes use of theories of social influ- 
ence, it would predict than an emergent norm 
will coincide with the mean of initial choices 
only under rather special circumstances, as 
for example when all members have equal 
weight in the influence process. The model 
developed by Miller (1970) assumes con- 
vergence but predicts shifts in means as a 
consequence of differential weights among 
group members. This model, together with 
those of French (1956) and Harary (1959), 
might well provide a way to relate the find- 
ings on choice dilemmas to theories of 
conformity and social influence. , 

Since it is not yet clear how risk enters 
into the choices made by subjects, it is pos- 
sible that shifts would best be conceived as 
instances of the effects of group discussion on 
attitudes. Such a view has been advocated 
by Moscovici and Zavalloni (1969), and its 
plausibility has been enhanced by results 
reported by Fraser et al. (1971). These in- 
vestigators asked subjects to respond to each 
dilemma by means of a Likert scale on which 
they were to indicate how strongly they 
favored one or the other of the two alterna- 
tive courses of action. When these responses 
were compared with the usual probability 
choices, there was a remarkable similarity 
between the two kinds of responses both 
before and after group discussion. If re- 
sponses to choice dilemmas are in fact indi- 
cators of attitudes, then theories of attitude 
change should help to improve our under- 
standing of the processes that bring about 
changes in these responses, 

There is a real possibility, however, that 
the attitudes related to choice dilemmas have 
certain special characteristics which influence 
the way in which group discussion affects 
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them. The findings concerning subjects’ ex- 
pectations about the choices of others, the 
choices they most admire, and their evalua- 
tions of people who make various choices 
suggest that the dilemmas generate a conflict 
between ideals and reality. Several interest- 
ing suggestions have been advanced by 
Higbee (1971), Jellison and Riskind (1970), 
Levinger and Schneider (1969), and Pruitt 
(1969) concerning the nature of this conflict 
and the role of group discussion in resolving 
it. It may turn out that such a conflict is 
critical for the occurrence of shifts and that 
group discussion does not affect all attitudes 
alike. Since so little is known about the 
effects of group discussion on attitudes of any 
sort, the findings for choice dilemmas may 
contribute valuable information about this 
form of attitude change. 

Perhaps the greatest uncertainty of all sur- 
rounds the question of how best to conceptu- 
alize the processes of group discussion and 
group decision, Should group discussion be 
conceived as providing persuasive arguments 
concerning the correct answer to a problem 
or as a mechanism for combining individual 
preferences into a single group decision? 
Despite the demonstrated importance of 
group discussion for the occurrence of shifts, 
almost nothing is known about its nature. 
It is clear that discussion serves in some 
way to reconcile differences in initial choices 
and brings about a reduction in their vari- 
ance, but it is not clear how this is accom- 
plished. A direct examination of the content 
of group discussion is needed to identify the 
immediate determinants of shifts in choices. 

The search for a suitable theoretical home 
for the findings established by the research 
on choice dilemmas will require careful atten- 
tion to the concrete processes that determine 
both initial choices and changes in these 
choices. Unless these processes are clearly 
identified and placed in an appropriate theo- 
retical context, the work on the risky shift 
may well become nothing more than an 
interesting episode in the history oí social 
psychology. 
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decisions were not significantly 


larly in generalizing to those 


Groups have been shown to take more 
_ risks than individuals in several varieties of 
- decision tasks and under many conditions 
(Stoner, 1968). This risky-shift effect has 
been studied by means of a standard para- 
- digm. Individuals first make decisions on a 
- task involving risk. After this pretest they 
are required to discuss the task as a group 
and arrive at a group decision. Finally, indi- 
- viduals are again asked for their private deci- 
Sions. The difference between the mean level 
of risk chosen during the individual pretest 
and the level of risk chosen during the group 
Posttest has been termed a “shift.” The out- 
come has typically shown an increase in risk 
- from the individual to the group decision. 
It should be noted that the standard para- 
- digm does not permit us to conclude that dis- 
cussion per se produces the risky shift, but 
.Tather that discussion produces this shift in 
the context of a pretest. Yet, many of the 
theories of the causes of this phenomenon seem 
- to assume that it is produced by group dis- 
cussion per se, and some writers attempt to 
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The risky-shift paradigm (pretest, discussion to consensus, 
open the possibility that the risky shift depends upon both an initial pretest 
and a group discussion. Without a pretest, 
not significantly different from individual 
familiarization, group decisions showed a significant risky shift. However, group 
less risky in the no-pretest condition than in 
the pretest or familiarization conditions. These results suggest that caution 
should be observed in generalizing the findings of risky-shift research, particu- 
real-life situations where group members are 
unfamiliar with the decision task before entering discussion. 
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posttest) leaves 


group decisions in this study were 
decisions; with either pretest or 


generalize the finding to real-life groups in 
which the members are not exposed to the 
task until the group interaction. 

If we want to know the effect of group 
discussion per se, then the appropriate con- 
trol group (Solomon, 1949) for the standard 
risky-shift paradigm is a condition in which 
group discussion is not preceded by individual 
decision. That is, the members of a group 
would have their first exposure to the deci- 
sion task in the context of the group, rather 
than during an individual pretest. Group deci- 
sions in this no-pretest condition would then 
be compared to the initial individual decisions 
of other subjects. The typical within-group 
risky-shift paradigm would thus be replaced 
by a between-group design. The present study 
employed such a design. 

If the use of an initial pretest proves to be 
an important determinant of the level of 
risk subsequently chosen by the group, it 
would be useful to understand what aspect of 
the pretest produced that effect. One obvious 
possibility is that familiarity with the con- 
tents of the decision task plays a major role, 
A third condition was therefore included in 
the study to assess the role of familiarity. 


METHOD 


A total of 135 male undergraduates at the Uni- 
versity of Pennsylvania were paid for their partici- 
pation in the experiment, Six subjects were recruited 
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TABLE 1 


Meas Risk Scores ox Tux. 12-Item Test, 
vor Each CONDITION 


| 
Consensus | Posttest 


Condition | ht a ue Cu NC 

I. Standard risky shift | 664 | 392 | 596 
IL. Nopretest 62.0 61.9 
III. Familiarization only | 60.4 


| 58.7 


Note.—Ten groups were run in each condition. 


for each group. Owing to the failure of some sub- 
jects to show up for the study, the actual number of 
subjects in each group was either 4 or 5. Ten groups 
were assigned randomly to cach of three conditions. 
A total of 45 subjects were run in Condition I, 48 
in Condition II, and 42 in Condition III. The mea- 
sure of risk taking was the 12-item Choice-Dilem- 
mas Questionnaire devised by Kogan and Wallach 
(1964). 

Condition I. Subjects followed the usual risky- 
shift paradigm. They first answered the question- 
naire individually (pretest). They were then told 
that their first answers had been for the purpose of 
familiarizing themselves with the questions and were 
asked to discuss the questions as a group and arrive 
at a unanimous group decision on each. Aíter group 
discussion to consensus, they were asked to indicate 
once more their personal decisions on the questions. 

Condition II. There was no individual pretest; 
subjects began immediately with group discussion to 
consensus. After each discussion, the subjects indi- 
cated their personal decisions. 

Condition III. Subjects were given the question- 
naire and told to read it “in order to familiarize 
yourself with all of the situations." They were specifi- 
— cally told not to answer any of the dilemmas, After 
this familiarization procedure, the group discussed 
the questionnaire and arrived at a unanimous deci- 
sion on each item. After each discussion, the subjects 
indicated their personal decisions. 


TABLE 2 


Comparison OF INITIAL INDIVIDUAL Risk wrt 
Group DECISIONS AND FINAL INDIVIDUAL 
Decistons my ALL CONDITIONS 


Comparison Mean risky shift df t 
Condition T 
E: vs. Ib 7.2 9 3.18*** 
a vs. Ic 6.8 9 „4244k 
Condition II hs 
Ia vs. IIb 4.4 18 .99 
Ta vs. Ile 4.5 18 1.0 
Condition TIT l 
Ta vs. IIIb T 18 | 2,15** 
Ta vs. IIIc 6.0 18 Eras 
* p <10. 
** p « 05, 
> <.01 
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RESULTS 


The measure of risk for each of the 12 
items on the Kogan and Wallach questionnaire 
is the minimum odds of success in the risky 
alternative required by a subject in order to 
recommend the risky over the cautious alter- 
native. The scale goes from 1/10 to 10/10, 
where lower minimum odds of success imply 
greater riskiness. Each response was multi- 
plied by 10 to get a risk score on that item. 
When individual decisions were required, each 
group received a score which was the average, 
over group members, of the individual risk 
scores. 

Table 1 presents the mean group risk scores 
in all conditions. Table 2 contains compari- 
sons of the individual pretest in Condition I 
with group decisions and final individual de- 
cisions in all three conditions. As may be seen 
in the first two lines of Table 2, the standard 
risky-shift paradigm produced a significant 
risky shift. That is, there was a significant 
increase in the level of risk chosen by a group, 
and later by the individuals in the group as 
compared with the initial pretest decisions. 

The effect of discussion alone, without pre- 
test sensitization, is found by comparing the 
mean individual pretest of Condition I (Ia) 
with the mean gr consensus (IIb) and 
final individual posttest (IIc) in Condition II. 
As may be seen in Table 2, these differences 
are not significant. Thus, group discussion 
without pretest or familiarization did not 
produce a significant increase in level of risk 
over that chosen initially by individuals, al- 
though there was a nonsignificant trend in 
that direction. 

When a familiarization procedure preceded 
group discussion (Condition III) the risky 
shift appeared once more. Group decisions 
were significantly more risky than the indi- 
vidual pretest in Condition I. Final individual 
decisions also tend to be more risky than the 
individual pretest. 

The level of risk chosen by groups in the 
no-pretest condition did not differ signifi- 
cantly from that chosen by groups in the 
other two conditions (ITb versus Ib: £ = .58, 
df = 18, ns; IIb versus IIIb: / = —.13, df = 
18, ns). 


Discussion 


We noted earlier that if group decision in 
- the absence of a pretest did not show signifi- 
cantly greater risk than individual decisions, 
we might be forced to reconsider the nature 
of the risky shift. The results on this point 
are inconclusive, Although it is true that the 
group decisions in the absence of a pretest 
were not significantly riskier than individual 
pretest decisions, it is likewise true that the 
> level of risk chosen by the groups in the no- 
pretest condition did not differ significantly 
Írom the level chosen by groups which had 
received a pretest. In fact, a comparison of 
the means in Table 1 shows that the levels of 
_tisk chosen by the groups in the three condi- 
tions were closer to each other than they were 
lo the level chosen during the individual pre- 
test in Condition I. Thus, the trends in the 
data suggest that absence of the pretest, if it 
makes any difference at all in the nature of 
the risky shift, is not making a major differ- 
ence. 
On the other hand, we did not find a sig- 
nificant difference between individual decision 
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and group decision in the absence of a pre- 
test. This failure suggests that writers about 
the risky-shift phenomenon should exercise 
two kinds of caution: (a) Unless and until 
significance is found in this comparison, it 
would be wise to avoid the unqualified gen- 
eralization that “groups take more risk than 
individuals.” (6) The phenomenon should 
not be described as a vigorous and sizable 
one; rather, it appears to be so weak that it 
can be readily obtained only when intersub- 
ject variation is rigidly controlled by a before- 
and-after design. 
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Using consumer products which had been scaled for degree of perceived product 
risk, this study found that group discussion produced a risky shift with three 
low-perceived-risk products, no shift with three medium-perceived-risk products, 
and a conservative shift with three high-risk products. This result confirmed 
a previous study, which reported a negative relationship between amount of 
risky shift and the seriousness of negative consequences associated with the 


risky choice. 


Despite the large number of replications 
with a wide variety of populations (cf. Kogan 
& Wallach, 1967), evidence for the risky-shift 
phenomenon is severely limited by the fact 
that the overwhelming majority of studies 
have used the same procedures and the same 
set of 12 hypothetical risk situations—the 
Stoner problems (Stoner, 1961). 

When different hypothetical risk situations 
have been used, it has been found that group 
discussion can lead not only to a risky shift 
but also to a conservative shift or to no shift 
at all (Flanders, 1970; D. G. Myers, 1967; 
Nordhøy, 1962; Rabow, Fowler, Bradford, 
Hofeler, & Shibuya, 1966; Stoner, 1968; 
Zajonc, Wolosin, Wolosin, & Sherman, 1968, 
1969). This conclusion was tested in the pres- 
ent study with a novel set of stimuli, namely, 
consumer products which varied along a di- 
mension called *perceived product risk." As 
defined by Cunningham (1967a), perceived 
product risk is a function of the probability 
that the product might fail and of the severity 
of negative consequences if the product does 
fail. Applying this definition in a telephone 
survey of 1,200 housewives, Cunningham found 
that the purchase of headache remedies was 
considered more risky that the purchase of 
fabric softeners, which in turn was considered 
more risky than the purchase of dry spaghetti, 
In addition, Cunningham (1967b) demon- 
strated that consumers who perceived high 
risk in the purchase of a particular product 
were more likely to report that they had dis- 
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cussed that product with friends during the 
last 6 months. 


Of the nine consumer products chosen for 4 


the present study, three were high, three were 
low, and three were medium on perceived 
product risk. If we ignore the contrary evi- 
dence from the studies listed above and fol- 
low Wallach and Kogan’s diffusion of re- 
sponsibility theory, we would predict a risky 
shift following group discussion for all nine 
products, In fact, we might even predict that 
the higher the level of perceived product risk, 
the greater the opportunity for risk reduction 
and the greater the risky shift following group 
discussion of the product. 

On the other hand, if the negative evidence 
cited above is considered in combination with 
Clark and Willems’ (1969) finding that there 
was a negative relationship between perceived 
seriousness of consequences (in the 12 Stoner 
items) and degree of shift toward risk, we 
are led to quite different predictions. Instead 
of a positive relationship between perceived 
product risk and the amount of shift pro- 
duced by group discussion, we would expect 
a negative relationship. The most extreme 
form of a negative relationship would be a 
risky shift for products high in perceived risk 
and a conservative shift for products high in 
perceived risk. Since previous evidence was 
contradictory and consumer products had not 
been used as stimuli in previous experiments, 
the authors stated their hypothesis in a less 
extreme form, namely, that amount of risky 
shift following group discussion would be 
negatively related to the level of perceived 
product risk. 


D~ 


* 


METHOD 


Forty-seven consumer products were rated on two 
10-point scales by 50 females from an introductory 
psychology course. Half of the subjects rated the 
products first on a Probability of a Product Failing 
(PPF) scale and then on a Severity of Negative 
Consequences (SNC) scale. The order was reversed 
for the remaining subjects. Also, within subject 
groups there were four different randomly ordered 
lists of products. 

The six products with the highest mean ratings 
and the lowest standard deviations on both the PPF 
and SNC scales were chosen as the high-risk products: 
deodorant, hair conditioner, shampoo, cold remedies, 
perfume, and hose. In like manner six medium-risk 
products—body oil, hand cream, nasal spray, suntan 
lotion, eye shadow, and toothbrush—and six low- 
risk products—bobby pins, facial tissues, nail polish, 
cuticle remover cream, chewing gum, and Q-tips— 
Were selected. There was no overlap among high, 
medium, and low item sets on either the PPF or 
the SNC scales, 

Within cach of the three risk-level categories, 
three of the six products were chosen randomly as 
the products to be discussed by the experimental 
group. The remaining products in each risk category 

Were not discussed by the experimental group, but 

Were included in the list of products to be rated in 

both pre- and postdiscussion trials, 

In the experimental (Le,  product-discussion) 

group, 40 introductory psychology females were 

tested in 10 groups of 4 each, Friends and roommates 

Were not allowed in the same discussion group. In- 


F structions for prediscussion product ratings were as 
follows: 


You will be given a list of products which are 
frequently purchased by college girls. Imagine that 
you are shopping for your favorite brand of a 
particular product and you find that the store 
does not have your favorite brand. There is an 
introductory offer which would save you money if 
the new brand was as satisfactory to you as your 
favorite brand. You need the product now, and it 
Would be an inconvenience for you to go else- 
Where to look for your favorite brand. Listed be- 
low are several chances or odds that the new 
Unknown brand would prove as satisfactory as 
Your favorite brand. Please determine the lowest 
number of chances of success that you would de- 
Mand before you would give the new brand a try. 


The chances listed were 1, 3, 5, 7, and 9 chances 
n 10 that the new brand would prove as satisfactory 
^5 the subject’s favorite brand. The choice was also 
available (scored as 10) not to purchase the new 
een brand no matter what the chances of 
‘atisfaction, 

example was given to make certain that all 
Subjects understood the instructions. Subjects were 
7 Presented with the random list of all 18 products 


asked to rate each product according to the 
ctions, 
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Instructions for the group discussion of products 
were as follows: 


These instructions are the same as before. However, 
now you are to work together as a group. I will 
give the group a list of products. This time I 
want you to discuss each product on the list and 
reach a group decision. Most Eroups are able to 
come to some decision if those who disagree will 
restate their reasons and if the product is thought 
about carefully, Discuss each product until you 


agree and record the group decision on the answer 
sheet. 


Each subject was then given a random list con- 
taining the nine products selected for discussion. One 
member of the group was asked to record the group 
decisions. Group closure on all nine products was 
reached within 5-15 minutes for different groups. 


The average group discussion time was about 10 
minutes, 


Instructions for the 


Postdiscussion rating of prod- 
ucts were as follows: 


This questionnaire is the same as you had initially, 
The instructions are the same as before. Please 
reread the instructions. Go back Over each product 
and record your answers as you feel personally, 
Do not try to recall the group discussion. Just 
think about each product again, I am not trying 
to measure your memory but how you think right 
now. Your answer may or may not be the same 
as the group decision, 


Procedures for the primary control group (inter- 
polated task discussion) were the same as those used 
for the experimental group with the following ex- 
ception: instead of a group discussion and rating of 
consumer products, subjects were asked to discuss a 
speech given by the late Dr. Martin Luther King. 
In this condition there were 40 introductory psy- 
chology females run in groups of 4 each. 

In addition to the primary control group, 20 sub- 
jects completed the ratings of the 18 consumer 
products with a 2-week interval between the two 
rating sessions, No systematic Changes in products 
ratings were found in this control group. 


RESULTS 


The effects of perceived product risk level 
are readily apparent in Table 1, which com- 
pares the average of prediscussion individual 
decisions against group consensus decisions, 
There was a significant conservative shift for 
high-risk products, no shift for medium-risk 
products, and a significant risky shift for low- 
risk products. For all three high-risk prod- 
ucts, the shift was to a more conservative 
Choice of risk levels, while the reverse was 
true for all three low-risk products. Con- 
firming Madaras and Bem (1968), there were 
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TABLE 1 


AVERAGE INDIVIDUAL PREDISCUSSION SCORES AND 
Group Consensus Scores rog tae 10 Groups 
IN THE Propuct Discussion CONDITION 


Perceived Mean Group | Mean 

product pretest | consensus t 

flak level exe score | difference | 

High 20.58 24.50 3.92 |. 3.90* 

Medium 18.70 17.90 —.80 | «1.00 

Low 12.28 8.80 —3.48 3.00* 

Note.—Scores based on three products within each product 
tisk level. 

*p <.05, 


no significant shifts in the items which were 
rated but not discussed. 

The effects of perceived product risk level 
are also apparent in Table 2, which shows the 
amount of shift between prediscussion and 
postdiscussion individual decisions. Once again 
in the product discussion group there was a 
conservative shift for high-risk products, no 
shift for medium-risk products, and risky 
Shift for low-risk products. The significant 
shifts for the high- and low-risk products were 
produced by many people making small shifts, 
not because a few people made very large 
shifts after group discussion. In the inter- 
polated task control group, there were no Sys- 
tematic changes between pretest and posttest 
individual decisions. 

A 2 X 3 analysis of variance with repeated 
measures on the perceived-risk dimension was 
computed for the difference scores between 
pre- and postdiscussion decisions. The inter- 
action between the two independent variables, 
product discussion versus interpolated activity 
and level of perceived product risk, was sig- 
nificant at the .05 level with a conservative 
F test (J. L. Myers, 1966), thereby confirm- 
ing the notion that the effect of group dis- 
cussion varied with the level of perceived 
product risk. 


Discussion 


Using substantially different stimuli, this 
experiment confirmed previous studies which 
found that group discussion leads not only to 
tisky shifts but to conservative shifts or no 
shift at all. More important, the results of 
this study confirmed Clark and Willems’ 
(1969) finding that seriousness of negative 
consequences is one of the item characteristics 
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which influences the kind of shift which oe- 
curs, In a related experiment, Stoner (1968) 
found that direction of shift was a function 
of the balance between the seriousness of the 
negative consequences and the perceived value 
of the positive outcome. 

If group discussion produces conservative 
shifts and no shift as well as risky shifts—and 
if future experiments continue to demonstrate 
that the outcome can be predicted as a func- 
tion of prescaled item characeristics, what 
theory would best account for these results? 
The authors would like to suggest that at 
least three factors are involved. First, group 
discussion leads to a clarification of stimulus 
characteristics, to a better understanding of 
such things as the costs of negative con- 
sequences, the value and social acceptability 
of the positive outcomes, the probability of 
success or failure, and the weighting of these 
single cues in forming a combined cue judg- 
ment. 

Second, group discussion leads to role clari- 
fication, to a better understanding of the 
behavior which is expected in the experi- 
mental situation. Presumably, the definition 
of appropriate behavior would include such 
things as paying careful attention to item 
characteristics, varying risk level choices as 
a function of item characteristics, and where 
possible, making extreme choices that fall 
near one end of the scale. 

Last, there might be a bandwagon effect. 
When the group members discover that most 
of them are leaning in the same direction on 
an item—toward a high-, medium- or low- 
risk choice, this confirmation creates a climb- 


TABLE 2 


MEAN DIFFERENCE SCORES BETWEEN PRETESTS AND 
POSTTESTS FOR THE PRODUCT DISCUSSION AND 
INTERPOLATED TASK Discussion CONDITIONS 


Condition 

Perceived SEND A Un 
Tale Interpolated 
risk level Produet OSERE 

d iscussion n Difference Z 
High 2.05 .00 2.05 5.305 
Medium 63 —.15 —.50 | < 1.00 
Low —2.32 —40 | —2.22 5.84 


*p «.05 (using Tukey's HSD test as described in. Kirk, 
1968), 


K 


CONSUMER PRODUCTS AS STIMULI 


on-the-bandwagon pressure for perceiving that 
paticular level of risk as appropriate. On items 
where a conservative or a risky choice is 
indicated, this bandwagon tendency can carry 
group decision and postdiscussion individual 
decisions to a more extreme choice than would 
be justified by an objective appraisal of the 
item characteristics, 

The authors believe that this combination 
of cognitive change, social role clarification, 
and affective pressure will produce more ex- 
treme judgments by laboratory groups on a 
variety of stimuli, not just those involving 
tisk (e.g., Doise, 1969; Moscovici & Zaval- 
loni, 1969). If so, this suggests the possibility 
that risky and conservative shifts can be 
treated as a special case of a more general 
phenomenon, the tendency of laboratory 
groups to reach more extreme judgments than 
individuals, 
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;NHANCEMENT OF DOMINANT ATTITUDES 
IN GROUP DISCUSSION ' 


DAVID G. MYERS ? anp GEORGE D. BISHOP 3 
Hope College 


Two studies were performed in an attempt to generalize the risky-cautious- 
shift phenomenon to attitudes and to test "information exchange" and “mutual 
reinforcement” explanations of discussion-produced shifts. Study I confirmed 
an assumption of the information-exchange hypothesis by extending to the 
attitude realm the finding that subjects perceive their peers as less in the valued 
direction than themselves. In Study Il, eight of these items were responded to 
before and after treatment by subjects in discussion, information-exchange, and 
control conditions. Only discussion-condition groups evidenced significant atti- 
tude shifts and polarization effects. Observations of tape recordings of the 
discussions indicated that a group's initial mean on an item was a significant 
predictor of the direction of the discussion rhetoric, which in turn was a 
predictor of group shifts. However, group initial mean did not directly predict 


group shiíts. 


A large amount of research in the past 
decade (cf. Kogan & Wallach, 1967) has in- 
vestigated the effects of group discussion on 
responses to questionnaire items on which 
subjects recommend the amounts of risk to be 
taken by hypothetical persons facing various 
life dilemmas (Kogan & Wallach, 1964, Ap- 
pendix E). It is now well documented that 
discussion tends to increase risk taking on 
items where the predominant initial response 
tendency is in the risky direction, and tends 
lo enhance caution on items which tend to 
elicit cautious initial responses (Myers, Mur- 
doch, & Smith, 1970; Stoner, 1968; Teger & 
Pruitt, 1967). 

One popular explanation of this phenome- 
non (Brown, 1965) is that discussion provides 
arguments mostly in support of the dominant 
value and, also, that mere information about 
others' decisions indicates to the average sub- 
ject that relative to others, he is not as 
strongly in the valued direction as he had 


1 This research was Supported by a grant (MH- 
15999-01) to the senior author from the National 
Institute of Mental Health and by a summer Col- 
lege Science Improvement Program grant (GY5335) 
to the junior author from the National Science 
Foundation. Appreciation is expressed to Neil Blahut 
for his assistance in collecting the data. 

? Requests for reprints should be sent to David 
Myers, Department of Psychology, Hope College, 
Holland, Michigan 49423. 

* Now at Yale University. 


supposed. This information is said io be a 
stimulus for his shift in the valued direction. 
The first suggestion, which could in behavioral 
terms be labeled the *mutual reinforcement? 
explanation, has not been directly investi- 
gated, except for Nordhgy’s (1962) observa- 
tion that the content of arguments in Stoner's 
(1962) original groups tended to favor the 
direction of group shift. The second explana- 
tion, "information exchange," has received 
more investigation. Stoner (1968), Wallach 
and Wing (1968), and Levinger and Schneider 
(1969) all confirmed that individuals do tend 
to see themselves as riskier than "fellow stu- 
dents" on items where a shift to risk has been 
found. However, other research (Madaras & 
Bem, 1968; Murdoch, Myers, & Smith, 1970; 
Teger & Pruitt, 1967; Wallach & Kogan, 
1965) has yielded conflicting data when mere 
information about other's decisions is ex- 
changed without opportunity for discussion. 
If Brown's “value enhancement” theory of 
group risk-taking effects is correct, it sug- 
Bests that such discussion effects might gen- 
eralize to other nonrisk response tendencies. 
The present research applies to the attitude 
realm the general hypothesis that discussion 
enhances the dominant value, if such exists. 
Evidence that this hypothesis might prove 
fruitful was provided by post hoc examina- 
tion of group-shift data obtained on discus- 
sion of 12 "risk-neutral" items used in a 
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y by Wallach, Kogan, and Burt (1968).* 
he mean shift scores on these 12 items were 
"correlated with the means of individual deci- 
sions of a small sample of Hope College stu- 
"dents. The resulting correlation of .71 sug- 
‘gested that although few significant shifts oc- 
curred, the direction and amount of mean 
shift on an item was predictable from the 
‘Mean of initial decisions, just as has been 
M observed with the life-dilemma items. Recent 
research by Moscovici and Zavalloni (1969), 
Doise (1969), and Myers and Bishop (1970) 
also supports the hypothesis. 
T Study I examined an assumption of the 
information-exchange explanation by asking 
“Subjects to respond to 12 attitude items and 
§ estimate the responses of fellow students. It 
was hypothesized that subjects would tend to 
‘See others as more neutral than themselves 
| (less in the valued direction) on these non- 
risk materials. On the basis of data provided 
by Study I, items were selected for a subse- 
quent study which investigated discussion 
effects and the extent to which information 
 €xchange and mutual reinforcement processes 
could predict any observed discussion effects. 


| Srupy I 
Method 


Subjecis. Participants were 118 introductory psy- 
“chology students at Hope College who completed 
T the questionnaire at the end of a regular class period. 

"Materials. From among a number of specially 
Teacted items, 6 were selected which pretesting sug- 
-Eésted would elicit initial responses deviating in a 
Predictable direction from neutral. Using the same 
criterion, 6 items from the Wallach et al. (1968) 
© honrisk “Opinion Questionnaire" were also included. 
- These 12 items had a common format as illustrated 
below, 


Mary N., a college junior, desires to spend her 

- Spring vacation on a biology field trip to another 
_, tegion of the country, Her parents, who have not 
^" seen her in several months, would like very much 
to have her home over vacation. Mary N. is un- 
Sure what to do. Since she is interested in becom- 
ing a biologist, she considers the field trip a rare 
9pportunity for firsthand research experience. Tt 
Would also help her test out her interest in becom- 
Ing a biologist. On the other hand, she knows that 
ler parents have been looking forward for some 
€ to having her home for a few days. Mary N. 


^ The authors are grateful to Michael Wallach for 
‘Providing these items and shift data. 
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loves her parents and also appreciates the fact 
that they are financing her education. 


Please put a check next to the ONE statement 

that best expresses your opinion: 

——I strongly favor that Mary N. go home for 
vacation. 

—I moderately favor that Mary N. go home for 
vacation, 

— slightly favor that Mary N, go home for 
vacation, 

— Undecided 

— slightly favor that Mary N. go on the bi- 
ology field trip. 

——1 moderately favor that Mary N. go on the 
biology field trip. 

— strongly favor that Mary N. go on the bi- 
ology field trip. 


Procedure. Using verbatim the instructions of 
Wallach et al. (1968), subjects completed the 12 
items and on the last page of the booklet found the 
following instructions.5: 


Now that you have finished making your own 
recommendations, go back over the Opinion Ques- 
tionnaire again for a different purpose. This time, 
please indicate what you think the majority of 
your fellow students here at Hope College would 
mark for each item. For each item, write an “M” 
in the space that corresponds to the choice that 
you believe the majority of your fellow students 
would make. You can put the *M" in the same 
space you check for yourself, or in a different 
space. 


Results and Discussion 


Responses on the 7-point scale were scored, 
ranging from —3 (strongly favor the first 
alternative) to +3 (strongly favor the second: 
alternative). Table 1 summarizes the mean 
data for all 12 items. The table indicates that 
on 9 of the 12 items, subjects did on the aver- 
age view their fellow students as significantly 
more neutral than themselves. These data ex- 
tend the results from similar studies cited 
above to nonrisk opinion materials, suggest- 
ing a general tendency to see oneself as more 
in the valued or dominant direction than one's 
fellows. 


Stupy II 


The present experiment investigated the 
effects of group discussion on responses to 
selected items from Study I. In addition to 
the main discussion condition, two other 


These instructions are those used by Ivan D. 
Steiner (personal communication, 1967). 


m" 
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TABLE 1 


Mean or Own Responses, MEAN Or RESPONSES 
ESTIMATED FOR FELLOW STUDENTS, AND MEAN 
DIFFERENCE BETWEEN OWN AND ESTIMATED 
FELLOW STUDENTS’ RESPONSES 


item no, Own Barisi | Difference 
3 —.70 —.08 | 63" 
4 -200 | -52 155" 
5 —1.60 —91 0t 
6 — 60 | 19 41% 
9 —1.10 —.38 729949 
10 -.79 | -ri —.35* 
1 122. eet 
2 A 2B ES —.20 
T 129 | 58 —.719*** 
8 1.56 | 1.19 —.37* 
i 1.74 1.34 | — Aet. 
12 1 00 ~1,134*** 


43 | | 


Note.—ps were determined by two-tailed £ tests of the 
deviations of the difference scores from zero. N = 11 


8, except 
for Items 5 aa 12, on which some subjects failed to respond. 
P» <.10, 
** p <.05. 
p <.01 
“et c .001 


treatments were used—information exchange 
and control. Study I extended the results of 
previous research in demonstrating the infor- 
mation-exchange assumption that individuals 
mistakenly perceive themselves as more in 
the valued direction than others, The present 
information-exchange condition provides the 
predicted necessary and sufficient condition 
for the shift phenomenon by merely allowing 
exchange of information about others’ deci- 
sions without opportunity for discussion. Con- 
trol subjects completed the instrument twice 
with irrelevant intervening activity to ascer- 
tain whether shifts occur in the absence of 
discussion and information-exchange proc- 
esses. 


Method 


Subjects. Volunteer participants were 160 intro- 
ductory psychology students at Hope College who 
signed up for the experiment, 5 for a given time, 
Fifteen discussion groups, 10 information-exchange 
groups, and 10 control groups resulted. In each con- 
dition, three-fifths of the groups were all female, and 
two-fifths were all male. Due to the occasional fail- 
ure of a volunteer to appear, 4 of the discussion and 
3 of the information exchange groups had 4 persons. 
Of the 10 control groups, 2 had 4 members and 3 
had 3 members. 

Materials. Since the purpose of the study was to 
explore the hypothesis that discussion enhances domi- 
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nant values as defined by the mean of initial re- 
sponses, items from Study I which elicited mean 
responses more extreme than —1 or +1 were em- 
ployed. None of the eight items meeting that cri- 
terion appeared to elicit initial responses so extreme 
as to preclude further shifts toward the extreme. As 
in Study I, the format was structured such that on 
half of the items the dominant tendency would be 
toward the first alternative and on the remaining 
halí, toward the second listed alternative. 

Procedure. Subjects were seated around a round 
table and requested to follow along as the experi- 
menter read the instructions for the Opinion Ques- 
tionnaire. After all were finished the booklets were 
collected and the procedures defining the particular 
experimental condition were administered. 

Discussion groups received new identical booklets 
with instructions paralleling those in Wallach and 
Kogan's (1965) discussion without consensus condi- 
tion. Subjects were asked to discuss each situation 
for 3 minutes. After 3 minutes had elapsed (or after 
a 20-second silent interval if discussion finished 
early), the experimenter interrupted and requested a 
new decision. 

Information-exchange subjects were instructed to 
Share information about initial decisions but with- 
out discussion. 


Now we would like to give you an opportunity to 
compare notes by telling each other your initial 
decisions as best as you can remember them. For 
each item I will ask one of you to start by simply 
reading the statement which seems to reflect how 
you feel. Then you can just continue clockwise 
around the table, each person reading his opinion. 
Please do not discuss or comment on the items; 
just read, your choice as it is written. After this 
exchange of information on each item I will ask 
you to again mark a decision. In making your final 
decision you in no way need to feel bound by 
your previous decisions. Okay, would you look at 
the first item [pause] and think about which 
Statement best reflects how you feel [20-second 
pause]. Why don't we just go around the table, 
starting this time\with you [name] and simply 
read your choice. . . . That's good. When making 
your decision now, remember that we're not 
interested in your prior opinion, but rather in just 
how you feel about the situation now. All right, 
go ahead and make your decision for the first 
situation, 


Control groups were given a brief anagram-solving 
task as neutral intervening activity and then asked 
io reconsider the situations without feeling bound 
by previous decisions. 


Results 


Responses were again converted to a +3 
to —3 scale. This time for simplicity of data 
presentation, +3 was scored as strongly favor- 
ing the valued alternative and —3 as strongly 


| 
| 
| 
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favoring the less popular alternative. Initial 
means on each item averaged across all 160 
subjects were calculated. Two items (9 and 
12 from Study I) did not elicit the predicted 
dominant tendency," and hence these items 
. were not included in the computation of com- 
posite shift scores across items. 
Enhancement of dominant value. For each 
group, the pre- and posttreatment means of 
decisions were computed. Table 2 presents 
these data and the mean shiít scores for each 
condition. It indicates that significant en- 
hancement of initial mean tendency occurred 
only in the discussion condition. However, an 
analysis of variance indicated that shift scores 
for the three conditions did not differ signifi- 
cantly from one another (F = 2.02, df= 


2/32). 


Tendency toward extremity. Moscovici and 
Zavalloni (1969) observed that groups were 
more extreme than individuals. In replication 
of their observation, Table 2 indicates that 
postdiscussion mean distances from zero were 
significantly greater than the mean of pre- 

"discussion distances from zero. It is possible 
that this “polarization effect” of discussion 
applies to groups but not to individuals. For 
example, a group with —1, —1, +1, and +3 
initial decisions and a +1 final consensus 
would be more extreme than its initial mean 
distance from zero of +.5. But the group 
product of +1 is mot more extreme than the 
mean initial absolute extremity score of indi- 
viduals (1.5). (Consistent with this analysis, 
Kogan & Wallach, 1966, did not observe 
group probability judgments to be more ex- 
treme than the mean of initial absolute ex- 
tremity scores.) Hence, the difference between 
the average of pre- and postdiscussion indi- 
vidual extremity scores was calculated for 
each group. Table 2 indicates this difference 
to be small, but also significant, in the discus- 
sion condition. 


the arbitrary criterion of 1.00. No significant shift 
on these items was observed in any condition. One 
other item (which also failed to show significant 
shift) had an initial mean of less than 1.00 for dis- 
cussion subject, but was retained in Table 2, since 
‘cross all conditions the initial mean was greater 
than 100, 


The initial means of .31 and .60 were less than 


ae ae 
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TABLE 2 


Group MEAN SCORES AND GROUP AND INDIVIDUAL 
EXTREMITY SCORES BEFORE AND 
AFTER TREATMENT 


“onditi Ja | Differ- 
Condition | N* | Before | After eno t 
Discussion 
Group M {15| 143 | 1.66 34 |2.06* 
Group extremity [15] 147 1.76 29 |273 
Individual extremity | 15| 2.10 | 2.24 13 | 2.25¢¢ 
Information-exchange | 
Group M. 10| 1.52 | 1.51] —01 | 21 
Group extremity 10| 1,56 1.55 —01 23 
3 Individual extremity | 10 | 2.06 1.98 —.09 1.58 
Control 
Group M 10| 163 1.68 404 Bl 
Group extremity 10 | 1.63 1.60 06 70 
Individual extremity | 10 | 2.08 | 2.13 06 |154 


aN isthe number of groups. 
*$ «.06 fero ed 
** p < .05 (two-tailed), 


Mutual reinforcement effects. The discus- 
sion data were further examined to see if 
mutual reinforcement processes might help 
predict shift scores. To explore mutual rein- 
forcement effects, it was assumed that the 
items elicited two conflicting response tenden- 
cies and that the expression of an argument 
favoring one alternative constituted a rein- 
forcement of that response tendency. It was 
predicted that arguments would tend to favor 
the dominant (valued) tendency. (See Byrne, 
1969, for evidence that hearing an attitude 
statement similar to one's own attitude ten- 
dency does constitute a reinforcement.) 

Audiotapes were made of 14 of the 15 
discussion groups (through a ceiling micro- 
phone pointed out to subjects before the dis- 
cussion). For each discussion, two observers 
independently counted the number of argu- 
ments in support of each alternative, and the 
proportion of arguments favoring the domi- 
nant alternative was computed from this. 
Across all items and groups, the two observ- 
ers correlated .77 in their observation of these 
proportions, so their observations were aver- 
aged. The mean proportion of arguments (re- 
inforcements) favoring the dominant alterna- 
tive was .76, a significant deviation from .50 
(t= 10.87, p< .001). The proportion for 
each item was then correlated across 84 group 
discussions (6 items X 14 groups) with the 
group initial means and group shifts. Results 
(r= 49 and .37, respectively, V = 84, p< 
.001) indicated that the group initial mean 
was a significant predictor of the number of 
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reinforcements in support of the dominant 
alternative which was, in turn, a predictor of 
the magnitude of shift in the dominant direc- 
tion. However, a group's initial mean on an 
item did not predict the group's shift on that 
item (r — —.12). This latter finding has also 
been observed on the risk-taking items (My- 
ers & Arenson, 1970), and may be due in 
part to regression and ceiling effects—a group 
already near the extreme on an item cannot 
shift much further toward the extreme. In 
summary, although group initial means did 
not directly predict group shift scores, the ini- 
tial means did significantly predict the direc- 
tion of the discussion rhetoric, and this was a 
predictor of shift scores. 


Discussion 


The present research explored the hypothe- 
sis that discussion enhances the dominant 
value on attitude materials as well as on life- 
dilemma risk problems. A second purpose was 
to examine the mutual reinforcement and 
information-exchange explanations of any 
such discussion effect. 

Study II revealed marginally significant 
discussion enhancement of mean initial ten- 
dencies. This is comparable to results ob- 
tained by Moscovici and Zavalloni (1969), 
who also used a 7-point Likert scale (for pre- 
and postdiscussion attitude ratings of French 
students concerning policies of General 
deGaulle). Moscovici and Zavalloni observed 
an average preconsensus-consensus shift of 
-28 in comparison with the pre-post discussion 
shift of .24 in the present research. 

Study IT also replicated Moscovici and 
Zavalloni's (1969) observation that discussion 
"results in a polarization of responses [p. 
134]" when one compares the extremity of 
group pre- and postdiscussion mean distances 
from zero. However, this polarization effect 
was of small magnitude when the mean abso- 
lute extremity of individuals’ pre- and post- 
discussion responses were compared. 

Why does this small, but apparently reli- 
able, discussion effect occur? Study I con- 
firmed the information-exchange assumption 
that people tend to see themselves as more 
strongly in the valued direction than others. 
Study II therefore provided the necessary 
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and sufficient conditions for information ex- 
change by having subjects report initial re 
sponses without further discussion. Contrary 
to the information-exchange explanation, no 
shift was observed. 

The discussion-based mutual reinforce 
ment hypothesis was examined by content 
analysis of tape recordings. As predicted, 
arguments did tend to favor the predominant 
response tendency. Also, the extent to which 
arguments favored the dominant alternative 
did significantly predict the group shiít 
scores, These findings support Brown's (1965) 
and Vinokur’s (1971) suggestion that shift 
effects are due partly to an enhancement of 
the prevailing value by the discussion argu- 
ments. 
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To investigate the hypothesis that the risky shift is a function of a cultural 
value favoring risk, group and individual decisions of Ugandan and United 
States students were compared on two tasks: the choice dilemmas devised by 
Kogan and Wallach and the actual consequence situation of Wallach, Kogan, 
and Bem. On both tasks the individual decisions of the Ugandans were more 
conservative, supporting the premise that the Ugandans are less favorably dis- 
posed to risk taking than are the Americans. For the Ugandan sample, group 
discussions produced no evidence of a risky shift. For the American sample, 
there was evidence of a risky shiít, but not to the extent reported by other 
investigators. The results are interpreted as consistent with the cultural value 
hypothesis; some reservations are discussed. 


During the past decade many investigators 
have demonstrated that following group dis- 
cussion, both individual and group decisions 
differ significantly from the average of prior 
individual decisions. Among the hypotheses 
advanced to explain these shifts (cf. Brown, 
1965; Kogan & Wallach, 1967), one, the 
cultural value hypothesis, has recently re- 
ceived considerable attention. Most fully 
elaborated by Brown, the approach appears 
to rest on three assumptions: (a) Individual 
decisions are influenced by and are generally 
consistent with widely held values; (5) indi- 
viduals consider their own behavior to be 
more consistent with widely held values than 
that of their peers; (c) group discussion, or 
any other source of information that others 
are as consistent or more consistent with a 


1 This research was supported by the Educational 
Testing Service, The authors are indebted to Sidney 
J. Arenson, Lawrence Stricker, and William Ward for 
their comments on the manuscript. 
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ing the tenure of a National Institute of Mental 
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prints should be sent to Julia Carlson Rosenblatt, 
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keepsie, New York 12601. i 

3 This research was conducted during the tenure 
of a Social Science Research Council Foreign Area 
Fellowship and affiliation with Makerere University 
College, Kampala, Uganda. Appreciation is expressed 
to the government of Uganda for permission to con- 
duct the experiment in the secondary schools, 


cultural value, will result in greater conform- 
ity to that value. Thus, the risky'shift should 
occur when the cultural value favors ris]: and 
the subjects learn that they are not riskier 
than the others in the group. Conversely, a 
cautious shift should occur when the value 
favors caution. Several lines of evidence sup- 
port this hypothesis (e.g., Hinds, 1962; Lev- 
inger & Schneider, 1969; Madaras & Bem, 
1968; Myers, 1967; Teger & Pruitt, 1967; 
Stoner, 1968; Vinokur, 1969; Wallach & 
Wing, 1968). 

To date, however, no investigations have 
been conducted among cultural groups who 
place less of a positive value on riskiness. Ex- 
tensive anecdotal evidence and personal ob- 
servation on the part of one experimenter sug- 
gested that many African societies do not 
evaluate risk positively. Consequently, it was 
assumed that if this assertion could be docu- 
mented, further evidence concerning the cul- 
tural value hypothesis | could be obtained by 
exploratory experimentation in an African 
cultural setting. 

The most general hypothesis was initially 


entertained: If cultural values determine the : 


direction of the postdiscussion shift, and if 
the African subjects are more conservative 
than their American counterparts, the risky 
shift will be reduced or eliminated accord- 
ingly, Two decision situations were employed: 
the Kogan and Wallach (1964) choice-dilem- 
mas problems and a series of actual conse- 
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quence problems similar to those employed 
by Wallach, Kogan, and Bem (1964), 


METHOD 
Subjects 

Uganda. Two hundred and seventy-nine male, 
African, Form VI students (the final year of sec- 
ondary school and the thirteenth year of formal 
education) in the government-supported secondary 
schools constituted the sample.* Subjects were ac- 
quired in the following manner: There are 17 govern- 
ment schools in the country with Form VI students, 
Ten of these are in the Central Region of the coun- 
try, four of which are in the capital city of Kampala, 
Four are in the Eastern Region, one in the Northern 
Region, and two in the Western Region, For par- 
ticipation in the experiment, two of the six Central 
Schools, one of the four city schools, and two of the 
four Eastern Schools were selected. All Form VI 
- students in these schools were asked to participate 

by the experimenter and the Headmaster of each 

School. Approximately 296 declined. All. others (122 

Students who were female or of Asian origin were 
. $50 tested, but their data were mot considered) 
- Participated in at least one of the two experimental 
sessions. English is the only language of instruction 
for these students after the sixth year of schooling 
and was used exclusively in the experiment. 

United States. One hundred and eight male volun- 
leer students were recruited from fraternities and 
_ other campus organizations at Carthage College in 
Kenosha, Wisconsin,5 Organizations from which sub- 
Jects were recruited were paid $5 for each four-man 
Eroup that completed both sessions of the experiment, 


Procedure 


The subjects met in groups of four, except in 
Cases where one or more members of the scheduled 
group, without notice, did not appear for the testing 
Session. This occurred nine times for the Ugandan 
Sample and once for the American sample. In each 
group, subjects were previously acquainted with each 
other, having come from the same class (in no case 
larger than 50) in the Ugandan sample and the 
Same organization in the United States sample. 
Testing consisted of two sessions, separated by a 
. Binimum. of 1 week and a maximum of 10 days. 
Group composition was constant, except in the cases 
(described above) where a person who had partici- 
Pated in the first session did not appear for the 
Second. For the American sample, the Choice- 
E———-.. 


*An ethnic group breakdown of the African stu- 
dents is available on request. Ns were too small to 
Profit from analyzing ethnic group differences for 
* present report, 

>. Appreciation is expressed to Carolyn Arnold, 
Penney Peterson, and Bonnie Engle, who collected 
‘Me data, and to the Department of Psychology and 
Administration of Carthage College, who made it 
Possible to collect data on campus. 
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TABLE 1 
Cuorce-Ditemmas SITUATIONS 


———————MÁda( € ——— 


Central figures Decision area 
erase Ei 
1. Young engineer Company affiliation 
2. Middle-age accountant Heart operation 
3. Prospective graduate student University affiliation 
4. Chess player Playing strategy 
5. Successful businessman Political office 
6. Young couple Marriage 
7. Young father Stock investment 
8. Corporation president Foreign investment 
9. College senior, pianist Career choice 
10. Prisoner of war Escape 
11 


- Young physicist Research problem 


Dilemmas (CD) Questionnaire was administered the 
first week and the actual- or choice-consequence 
(CC) task administered the second week. For the 
Ugandan subjects, half were administered the tasks 
in the CD, CC order and the other half in the CC, 
CD order. 

Choice dilemmas, The Kogan and Wallach (1964) 
items and procedure were used. Of the 12 items, 1, 
involving the decision of a quarterback in a football 
game, was eliminated from the present experiment, 
since it was the only one describing a situation not 
understandable to the African subjects. The 11 items 
employed, classified with respect to the central 
figure and decision area, are presented in Table 1. 

Approximately one-third of the subjects, although 
meeting as a group, were required to make their 
decisions individually and without consultation with 
other group members. After making decisions for 
all 11 dilemmas, subjects were asked to review each 
one, to briefly write the reasons for their choices, 
and finally, to estimate the choice made by a ma- 
jority of their peers (described for the Ugandan 
subjects as all other Form VI students in the country 
and for American subjects as all other college stu- 
dents in Wisconsin). This condition will be hereafter 
referred to as CD-individual (CD-I). 

The remaining two-thirds of the subjects were re- 
quired to discuss each dilemma until they reached a 
consensus about the lowest probability that could be 
accepted for that situation. This condition will be 
referred to as CD-group (CD-G). 

Since group discussion took longer than individual 
choice, the number of dilemmas considered by the 
CD-I and CD-G groups differed. All CD-I groups 
completed all 11 dilemmas, All African CD-G groups 
completed a minimum of 4 items, with only 2 of 41 
groups completing more than 4. The large majority 
of African groups were able to complete 4 within 1 
hour, but in those cases where it took longer the 
group was required to continue until 4 problems 
had been completed. The American CD-G groups 
completed an average of 6 dilemmas in an hour. 
For the American subjects a time limit of 15 minutes 
per item was set, so that a minimum of 4 items 
could be completed in the 1-hour session. If group 
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consensus was not reached within the 15-minute 
period, individuals recorded their own decision and 
an average was taken as the group decision. This 
occurred only 6 times in 109 discussions, The items 
discussed by the CD-G groups and the order in 
which they were discussed were determined by a 
random-numbers table. 

Choice consequences. This task was similar to that 
used by Wallach et al. (1964), using only mathe- 
matics and sentence-completion problems, For the 
American subjects, the item pool for each type was 
identical to those used by Wallach et al. They con- 
sisted of items from the College Entrance Examina- 
tion Board examinations. For the Ugandan subjects, 
items were drawn from this same pool and from the 
Cooperative School and College Ability Tests.¢ The 
difficulty levels for the African subjects were deter- 
mined by pretesting a sample of items on a sample 
of 103 first-year students at Makerere University 
College enrolled in an introductory sociology course. 

Subjects were informed that they would be able 
to earn some extra money by solving four problems, 
two mathematics and two sentence completion. The 
amount of money they could earn would depend 
upon the difficulty of the problems they chose and 
the correctness of the solutions: solution of difficult 
problems yielded higher pay. Questions represented 
nine difficulty levels, ranging from a 10% to a 90% 
probability of failure. Prize money was set so that 
the expected value of winning (amount of prize 
multiplied by probability of success) was the same 
at each difficulty level. In an attempt to establish 
reasonably comparable incentives for both samples, 
rewards were determined in terms of presumed pur- 
Chasing power, For the American subjects the actual 
reward, in cents, for a correct solution to a problem 
at the nine difficulty levels was: 11, 12, 14, 16, 20, 
25, 33, 50, 100. For the Ugandian subjects the com- 
parable United States currency equivalents were 5.6, 
63, 7.0, 84, 9.0, 12.6, 16.8, 24.5, and 49.0, 

Before any choices were made, subjects were per- 
mitted practice questions at the 10%, 35%, 60%, 
and 85% difficulty levels for each type of problem. 
They were then given the correct answers to each 
Practice problem, and any questions of their own 
were answered. Each subject was then asked to select 
the difficulty level for one mathematics and one 
sentence-completion problem, He Was required to 
make that choice individually and without consulta- 
tion. Following these individual choices, treatment 
differences (randomly assigned to groups) were intro- 
duced. Control subjects simply repeated the indi- 
vidual-choice procedure for the remaining two items. 
Group-choice subjects were asked to discuss and 
make a group decision as to the difficulty level to 
be tried by each member of the group on each of 
the last two problems. Group-representative subjects 


were asked to make a group decision, as in the 
group-choice condition, and, in addition, to select 
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one member of the group to solve cach problem for 
the group. These subjects were iníormed that the 
entire group would be rewarded only if their repre- 
sentative was correct. Following the choice of the 
difficulty levels for the last two problems, the groups 
were given the problems to solve 


RESULTS 
Choice Dilemmas 


The basic data were the probability levels 
chosen by each subject or group, that is, the 
odds of success, based on chances in 10, that 
one would demand before advocating risk; 
the higher the score, the more conservative 
the decision. For the CD-I condition, mean 
probability levels for each item were obtained 
as the mean of the group means. The means 
for both CD-I and CD-G conditions are pre- 
sented in Table 2. 

For the Ugandan sample the 24 CD-I 
groups initially consisted of 21 four-man 
groups and 3 three-man groups. In 5 of the 
four-man groups, however, one subject was 
eliminated for misinterpreting the task, leav- 
ing 16 four-man groups and 8 three-man 
groups. Only those who indicated (in their 
written reasons for the probability choices) 
that they ue atng actual odds of suc- 
cess rather than setting probability levels for 
risk taking were judged to have misinterpreted 
the task. Of the 9 United States CD-I groups, 
7 were initially four-man groups, but elimi- 
nation for misunderstanding the task as 
above produced 6 four-man and 3 three-man 
groups. 

First, the United States and Ugandan CD-I 
decisions were compared. As expected, the 
mean of the Ugandan decisions was signifi- 
cantly more conservative than that of the 
United States sample. The overall mean for 
the 11 items was 6.97 for Uganda and 5.80 
for the United States (1 = 2.21, df = 20, p< 
.05). As reported in Table 2, 10 of the indi- 
vidual item comparisons showed less risk ad- 
vocated by the Ugandans, 6 significantly so. 
On only 1 item, the foreign-investment di- 
lemma, was there an opposite tendency. Thus, 


using initial individual decisions as a defini- - 


tion of the strength of the cultural value on 
tisk, the data support the notion that the 
Ugandans place a lower positive value on 
riskiness, 


CULTURAL VALUES AND THE RISKY Suirr 395 
TABLE 2 
^icE-DiteMMAS Irem MEANS FOR INDIVIDUAL, GROUP, AND CULTURAL VatvE Cmoicrs 
BY UGANDANS AND AMERICANS, AND TESTS OF DIFFERENCES 
Choice-dilemmas item 
= = 
E 3 4 5 6 7 8 9 10 "n 
| I 

RR 6.01 6.55 | 706 | 8.54 7.83 | 734 | 7.30 $.35 | 5,87 
6.47 5.44 6.08 7.12 8.67 8.65 8.50 7.21 $.27 | $12 

743 6.79 | 6.24 6.71 7.47 7.23 7.36 | 743 6.14 | 696 
6.83 440 | 4.44 $99 | 7.45 5.79 847 | 6.57 3.84 | 456 
6.11 4.50 3,18 5.30 7.67 | 8.30 618 4.40 4.06 4.88 
7.21 588 | 4.87 6.15 7.18 5.93 7.89 | 6.65 4.91 548 

| 

78 2.82%} 3.06% | 157 2,51* 348* | —1.82 1.24 1.86*| 1.58 
| —1.06 | —242* | 1.08 E 1875 | 158 | —06 | —.33 | —1.96* | —3.80* 
—146 | —5.31* | —71 —.60 1.07 -41 2.9*| —.25 | —2.47* | -2:20* 
1.19 89 .60 -M | —31 | —L35 | -200*|  .11 .09 E 
105 | —10 | 1.35 7 | —36 | -381*| 223*| 317° | —23 | —27 


` 1 for a description of items. 

individual decisions; » — 24 groups. 

© group decisions; xs range from 13 to 17, 

to the “majority of peers decision” estimate: x = 


c from 8 to 12, 


* value indicates a risky shift. 
* f < .05, two-tailed. 


Next, CD-I decisions were compared with 
the “majority of peers” or cultural value es- 
timates (hereafter referred to as CD-CV) 
Within each sample. For the United States 
data, the CD-CV decisions, averaged over the 
11 items, were significantly more conservative 
(D = 35, t = 2.20, df = 8, p < .05), as pre- 
dicted by the cultural value hypothesis, The 
individual item comparisons (see Table 2) 
also revealed the conservative trend.’ The 
majority (9 of 11) of the CD-CV decisions 
Were more conservative than the CD-I deci- 
sions, although only 3 were significant. For 
only 2 items, the marriage dilemma and the 
foreign-investment dilemma, were the differ- 
ences in the opposite direction (only 1 was 
Significant), This replicates Levinger and 
Schneider ( 1969), and these items are the 2 


"The results reported in Table 2 use groups as the 
Unit for the sake of consistency, The Uganda versus 
United States CD-I, Uganda CD-I versus CD-CV, 
and United States CD-I versus CD-CV comparisons 
Were made also using individuals as the unit. The 
Same pattern of significant and nonsignificant re- 
ts was found with one exception: The United 


States CD-I versus CD-CV comparison for Item 11, 
Senificant by groups, fails to reach significance by 
ndividuals, 


A. 


which have consistently failed to produce the 
risky shift. 

For the Ugandan sample, the CD-I versus 
CD-CV comparison over all 11 items was not 
significant (D —.10, t= 52, df = 23, p> 
.10). Of the 11 separate item comparisons 
(see Table 2), 6 produced a more conserva- 
tive CD-CV choice. Three of these, the same 
3 as for the United States sample, were sig- 
nificant. Of the other 5 items, the CD-CV 
choice was more risky, but only 1 of these, 
the marriage dilemma, was significant. Direct 
comparison between the Ugandan and United 
States samples revealed that the CD-I versus 
CD-CV differences were greater for the 
United States sample on 7 of the 11 items, 
suggesting a greater positive value on riski- 
ness. For only 2 of these, however, was the 
difference statistically significant.* 

Finally, the effect of group discussion was 
assessed. The CD-I and CD-G means for the 


5 These comparisons were made analyzing both the 
group and the individual as the unit of analysis. 
Items 3 and 6 resulted in significantly more con- 
servative differences (p < .05) in the group analyses, 
and Items 3, 4, and 6 approached significance (p < 
10) in the individual analyses. 
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TABLE 3 

Mean Dirricutry LEVELS CHOSEN ny UGANDAN AND 
Unrrep States GROUPS ON THE CHOICE- 

Consequence Items 


Group Uganda U. S. 
Control 
N | 23 8 
Math 1* | 6.82 8.12 
Math 2 7.02 8.19 
SCP 1 7.58 847 
SC 2 | 7.59 8.62 
Group choice | | 
N 21 8 
Math 1 6.99 7.65 
Math 2 7.00 8.38 
SC 1 7.62 7.97 
Sc2 8.05 8.38 
Group representative 
N 24 7 
Math 1 6.90 7.28 
Math 2 7.25 9.00 
SC1 7.40 7.61 
SC 2 7.70 8.57 


* Number refers to order of choice. All first choices were in- 
dividual choices. Treatments were introduced before the 
second choices. 

^ Sentence completion. 


Ugandan sample were virtually identical: 
6.97 and 7.04, respectively. Analyzing by 
item, for no item was there a significant risky 
shift, but two items resulted in significant 
conservative shifts (see Table 2). The rank- 
order correlation between initial risk level 
and amount of shift was 40, not significant. 
More surprising, the United States overall 
means of 5.80 and 5.27 for CD-I and CD-G, 
respectively, were not significantly different. 
Table 2 shows three significant risky shifts 
and one significant conservative shift. The 
rank-order correlation between initial risk 
levels and shift was —.32, not significant. 


Choice Consequences 


The basic data were the difficulty levels 


chosen by the subject or the group, expressed .- 


as the number of times out of 10 the problem 
was failed by the normative population. 
Hence, a higher score reflects a more difficult 
or risky choice. For the individual decisions, 
group scores were formed by computing the 
mean for the group. The mean difficulty lev- 
els chosen are reported in Table 3. Of the 68 
Ugandan groups, 62 were four-man and 6 
were three-man groups. All 23 United States 
groups were four-man groups. 
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As a check on the comparability of the 
treatment-condition groups, the individual 
initial choices (prior to the introduction of 
the treatments) were analyzed for each sam- 
ple. In neither sample was there an interac- 
tion between item type and condition nor a 
main effect for conditions. For the Ugandan 
sample there was a significant item difference 
(F—14.50, df —1/65, p< 01). Subjects 
chose more diíficult sentence-completion items 
than mathematics items. 

Since the two samples had received prac- 
tice on problems of presumably equivalent 
difficulties, it was possible to compare the 
initial choices of the two samples. A £ of 3.71 
for the mathematics items and 2.19 for the 
sentence-completion items (df — 89 in each 
case) were both significant, indicating that 
the Ugandan subjects were initially more 
conservative in their decisions. These differ- 
ences cannot be ive | to differences in 
practice-item difficulties, although analysis of 
the practice data revealed substantial devia- 
tions from the expected difficulty levels of 
90%, 65%, 40%, and 15% passing. For the 
mathematics items the percentages passing 
the four items were 96%, 80%, 42%, and 
26% for the United States sample and 93%, 
79%, 83%, and 55% for the Ugandan sam- 
ple. Since the practice items for the Ugandans 
were actually easier than they had been led 
to believe, the impact would have been to 
raise to the subsequent difficulty levels chosen 
and thereby reducing the differences between 
the two samples. 

For the SC items the percentages passing 
were 97%, 84%, 85%, and 37% for the 
United States and 95%, 59%, 73%, and 
30% for the Ugandan subjects, compared to 
the expected 90%, 65%, 40%, and 15%. In 
this case the 40% and 15% items were sub- 
stantially easier than expected for both sam- 
ples and should not have had a differential 
impact on the subsequent decisions made by 
the two samples. 

Because there was no initial interaction be- 
tween item type and treatment conditions, 
subsequent analyses were based on shift scores 
averaged over the two item types. In order to 
create proportional subgroups across condi- 
tions for an analysis of variance, 1 United 
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States group and 2 Ugandan groups from the 
control condition, 1 United States group from 
the group-choice condition, and 3 Ugandan 
groups from the group representative condi- 
tion were randomly eliminated, leaving 7 
United States and 21 Ugandan groups. The 
average shift scores are reported in Table 4. 
The interaction F of 2.21 for Samples x 
Treatments was not significant (p > .10). 
Both main effects, for treatments (F = 3.29, 
df= 2/78, p < .05) and for samples (F = 
6.06, df = 1/78, p< 01), were significant. 
Within the Ugandan sample none of the 
shifts differed significantly from one another. 
Comparison of the initial individual decisions 
with the later, posttreatment decisions within 
each condition also showed the former to be 
not significantly different from the latter. 

For the United States sample, on the other 
hand, the amount of shift for both the group- 
choice and group-representatiye conditions 
Was greater than that of the control condition 
(ts = 1.90 and 3.07, respectively, df = 70, p 
< .05), but the group-choice and group-rep- 


` resentative shifts did not differ from one an- 


other. Within conditions, as shown in Table 
4, only the group-representative shift differed 
significantly from its own control. 

Between samples the only significant differ- 
ence in the amount of shift occurred in the 
8roup-representative condition. The shift was 
greater for the United States than the Ugan- 
dan sample (£223.03, df — 78, 2 « .05). 
Thus, group discussion and decision making 
Produced a risky shiít for only the United 
States sample, and even for that sample the 
Shift in the group-choice condition was equiv- 
ocal in that it differed from the control con- 
dition but not its own control and was not 
Significantly greater than that of the Ugandan 
Sroup-choice condition. 


Discussion 


The findings obtained from the two experi- 
Mental tasks were essentially equivalent. Ini- 
tial decisions of the Ugandan subjects were 
consistently more conservative than those of 
the American subjects. This upholds the ini- 
tia] assumption of the investigation, that 
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TABLE 4 
MEAN .Surrrs IN Risk-TaKiNG LEVELS FOR THE 
UGANDAN AND UNITED STATES SAMPLES, 
AVERAGED OVER THE Two CHOICE- 
CONSEQUENCE Item Types 


Group | Uganda w = 21) | u.s. (N =7) 
Control | 10 42 
Group choice 22 59 


Group representative 


* p < 01, two-tailed test. 


there was less of a cultural value favoring 
risk within the Ugandan sample. The major- 
ity of peers estimates also support the as- 
sumption. Although the direct comparisons of 
United States and Ugandan CD-I versus CD- 
CV differences were significant for only 2 of 
the 11 items, there was a clear trend for a 
greater conservatism in the United States es- 
timates than in the Ugandan. In addition, 
within-sample comparisons resulted in signifi- 


cantly more conservative CV decisions for 


only the United States sample. 

The evidence regarding the effects of group 
discussion on decision making was somewhat 
equivocal but supportive of the cultural value 
hypothesis. As predicted, the Ugandans did 
not make riskier decisions in the group dis- 
cussion conditions. In the CD task the only 
significant CD-I versus CD-G differences were 
more conservative group decisions, and in the 
CC task neither the group-choice nor the 
group-representative conditions produced a 
risky shift. These findings must be viewed 
with some caution, however, since the compa- 
rable findings for the United States sample 
did not entirely replicate the results of most 
previous investigations which report a risky 
shift as a result of group discussion. In the 
CD portion of the experiment, there was 
significant risky shift on only 3 of the 11 
items. The previously reported positive rela- 
tionship between level of initial risk and 
amount of shift was not obtained. In the CC 
situation, the effect of the manipulation in- 
troduced in the group-choice condition was 
only marginally effective in producing a risky 
shift. 

Several considerations are relevant to this 
partial failure to replicate. First, the design 
of the present experiment resulted in compar- 
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ing CD-I and CD-G decisions which were 
derived from different groups of subjects, 
whereas the more common approach has been 
to employ a within-subjects design in which 
respondents first make individual decisions 
and then participate in group discussion. 
Whether an initial individual decision is a 
prerequisite to a significant postdiscussion 
shift, and why this should be the case, is un- 
clear, but there is some evidence (Davis, 
Hoppe, & Hornseth, 1966; Hunt & Rowe, 
1960; Nordhgy, 1962) that there is a smaller 
shift in a between-subjects design. 

Of course, one function of employing a 
between-subjects design, given the same num- 
ber of groups, is to reduce the power of the 
statistical test. Thus, failure to find a signifi- 
cant difference could be a statistical artifact. 
Coupled with the rather small number of 
groups in the United States sample (ranging 
from 7 to 12 depending upon the condition), 
this possibility is plausible. Therefore, the 
amount of shift obtained in the present in- 
vestigation was compared with that obtained 
by others. For the CD items Wallach, Kogan, 
and Bem (1962) obtained an average shift 
per item of .75 for males over the 11 items 
employed in this study. Wallach and Kogan 
(1965) reported an average shift of 32 for 
males over all 12 items (individual item data 
were not reported). Teger and Pruitt (1967) 
found an average shift of .53 for male groups 
over the 11 items. The procedures in each of 
these investigations were very similar to that 
of the investigation reported here, with the 
major exception of the use of the within-sub- 
jects design. The average shift for the United 
States groups of this investigation was ora 
value well within the range of that obtained 
by others. It should be noted that Teger and 
Pruitt also tested the amount of shift against 
a control group (a between-subjects analysis), 
but only for the 10 items predicted to shift 
toward risk. It is apparent that such an analy- 
sis of the present data would not have pro- 
duced a significant overall shift, since Item 4 
(the stocks-investment dilemma) has typi- 
cally shifted toward risk, but produced a 
large and significant shift toward caution 
here. In general, and with the notable excep- 
tion of the stocks dilemma, item-by-item com- 
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parison with the Wallach et al. (1964) data 
and the Teger and Pruitt data also reveal 
highly similar initial decision levels and 


shifts. Thus, it would appear that the present 
data are not atypical of other findings and 
should not be definitely interpreted as a 


failure to replicate. 
A similar comparison is possible with the 
CC task. Extracting from Wallach et al, 


(1964) the data for the mathematics and ' 


sentence-completion item types, the average 
shift was .80 for the control condition, 7.45 
for the group-choice condition, and 14.35 for 
the group representative condition. These dif- 
ferences compare closely with the 1.09, 5.62, 
and 13.39, respectively, of the present data 
and again suggest that the equivocal replica- 
tion in the group-choice condition should be 
interpreted cautiously. : 

Further reservations concern the appro- 
priateness or relevance of the test situations 
(particularly the CD items) for the Ugandan 
sample, and the appropriateness of the sec- 
ondary school sample for the cross-cultural 
comparison. By the time the subject has 
reached the final year of secondary schooling, 
all of the hypothetical decision areas of the 
dilemmas are plausible and, in most cases, po- 
tentially encounterable for him. Even so, the 
typical Ugandan is less likely than the aver- 
age United States respondent to encounter 
most of the situations. The effect of such 
lowered probability is unclear. It could be ra- 
tionally argued that it would both raise and 
lower risk levels. 

Even if the CD situations were equally en- 
counterable for Ugandans and Americans, the 
Specific items may evoke different values that 
are independent of risk. This is exemplified by 
one item in particular, the one in which a 
Corporation president must decide whether or 
not to build a plant in a developing country 
which has had a recent history of political 
instability. In the United States this item has 
usually produced relatively conservative indi- 
vidual decisions and has not always produced 
a risky shift. If there were any item for which 
one would expect the Ugandan students to 
have an initially different perspective, it 
would be this item. In fact, this was the only 
item on which the Ugandans initially made a 


| 
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significantly more risky decision. It is interest- 
ing to note that the shift on this item was 
risky for the United States sample and con- 
servative for the Ugandan sample. 

With regard to the appropriateness of the 
Ugandan sample, it should be recognized that 
to generalize from a sample of secondary 
school students to the Ugandan population is 
entirely unwarranted, since this group repre- 
sents a highly selected segment of that popu- 
lation. The rationale for employing the school 
students in this investigation was simple: to 
begin the cross-cultural exploration of the 
risky-shift phenomenon using situations and 
stimuli for which there was already a litera- 
ture base and to select a sample from the 
"other culture" which was as similar as possi- 
ble in age and education. Future research 
must include other more typical samples and 
other more culturally relevant decision situ- 
ations. In general, however, the results of this 
exploratory cross-cultural investigation must 
be interpreted as supporting a. cultural value 
explanation for postdiscussion shifts in de- 
Cision behavior. 
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The aífective-bonds hypothesis was formulated as part of the diffusion-oí- 
responsibility explanation of the risky shift. It predicts that affective bonds 
among group members diminish personal responsibility and, therefore, increase 
risk taking by groups. Counter to this hypothesis, high cohesiveness reduced 
risky shifts, These opposite findings were interpreted as nevertheless consistent 
with a responsibility-diffusion explanation. That is, instead of increasing risk 
taking by diminishing members’ sense of personal responsibility for a risky 
decision, strong affective bonds may make individuals less willing to blame 
their co-members for the negative consequences of an advocated action (i.e, 
increase personal responsibility) and, therefore, inhibit the risky shift. Addi- 


tional analyses impugned an alternative 


interpretation of the obtained cohesive- 


ness effect in terms of differential influence processes. 


When groups make decisions about risk- 
oriented problems for a fictitious person, they 
typically decide upon a riskier solution than 
they did when each individual member previ- 
ously considered the same problems (cf. Dion, 
Baron, & Miller, 1970). A frequent expla- 
nation of this phenomenon emphasizes dif- 
fusion of responsibility (Bem, Wallach, & 
Kogan, 1965; Wallach, Kogan, & Bem, 1962, 
1964). Its proponents first suggested that a 
unanimous group decision reduces each mem- 
ber’s feelings of personal responsibility for the 
potentially negative consequences of a risky 
decision and Consequently increases risk tak- 
ing. That is, a joint (unanimous) decision 
Psychologically implies common fate or shared 
responsibility. However, this emphasis no 
longer remains tenable, in that unanimous 
consensus for a group decision does not pro- 
duce a stronger risky shift than mere discus- 
sion without consensus (Wallach & Kogan, 
1965). Since Wallach and Kogan’s outcome 
Suggests that the strong sense of shared re- 
sponsibility implicit from group consensus 
was not critical, they searched for some other 


1 This research was facilitated by a National In- 
stitutes of Health Predoctoral Fellowship to the first 
author as well as a National Institutes of Health 
Special Fellowship and sabbatical leave from the 
University of Minnesota to the second author, 

? Requests for reprints should be sent to Kenneth 
Dion, who is now at the Department of Psychology, 
University of Toronto, Ontario, Toronto, Canada, 

? Now at the University of Southern California, 
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characteristic of group interaction that might 
diffuse responsibility. Therefore, they pro- 
posed the “affective-bonds” hypothesis: it is 
the affective bonds formed among group mem- 
bers during discussion that mitigate their 
sense of personal responsibility and increase 
risk taking by groups. 

Although several studies (Kogan & Wal- 
lach, 1967; Pruitt & Teger, 1969: Wallach, 
Kogan, & Burt, 1967) provide correlational 
Support for the affective-bonds hypothesis, 
they fail to confront it directly by manipulat- 
ing the strength of affective bonds among 
group members. In contrast, the present study 
tested the prediction that affective bonds in- 
crease risk taking in groups by experimentally 
manipulating group cohesiveness. 

A second purpose of the present study was 
to assess the impact of the social responsibil- 
ity norm upon group risk taking. Prior stud- 
ies primarily focus upon individual or per- 
sonal responsibility; that is, an individual’s 
feeling of liability for a decision whose conse- 
quences affect one’s self and one’s fellow 
group members, In contrast, social responsi- 
bility refers to a group’s sense of liability 
when its decision affects others—beyond the 
immediate group—who lack any voice in 
formulating that decision, Jones and Gerard 
(1967) suggested that social responsibility 
might counteract the diffusion of personal re- 
sponsibility among the individual members 
comprising a group and, therefore, inhibit 


]hos aurou dh on 


y shifts. Consistent with this notion, Ko- 
gan and Zaleska (1969) found a nonsignifi- 
fant risky shift when a group made bets de- 
termining another group’s outcomes on a 
Gambling task. Unfortunately, their study 
does not provide confirmation of this line of 
“thought, in that the experimental situation 
"Ted Kogan and Zaleska to interpret their re- 
Sults in terms of a reciprocity norm, The pres- 
Lent study more clearly tests the hypothesis 
that high social responsibility inhibits the 
Tisky shift, while low social responsibility 
Promotes riskiness in group decisions. 

| Alternatively, cohesiveness and social re- 
_ sponsibility may interact as determinants of 
the risky shift. Past research shows that high 
Cohesiveness increases responsiveness to social 
"norms. For example, Festinger, Schachter, and 
Back (1951) found positive correlations be- 
veen cohesiveness and conformity to group 
tandards. In experimental work groups, 
Berkowitz (1954) found that high-cohesive 
groups produced more than low-cohesive 


ow-productivity norm, high-cohesive groups 
Inhibited productivity to a greater extent 
than low-cohesive groups. These results argue 

at high cohesiveness will enhance whatever 
ectional tendencies are provided by group 
norms. If this latter assumption is correct, the 
Sllects of high cohesiveness upon group risk 
aking should vary depending upon the level 

the social responsibility norm. With high 
ocial responsibility, high cohesiveness should 
nhibit the risky shift to a greater extent 
low cohesiveness. Contrariwise, with 
Tow social responsibility, high cohesiveness 
‘Should produce a greater risky shift following 


Broup discussion than does low cohesiveness. 


METHOD 


lled in introductory psychology at the University 
Minnesota served as subjects and received points 
ard their final examination for participation. 
ibjects were randomly assigned to one of four 
perimental conditions in a 2 X 2 factorial design. 
were 28 four-person groups—7 groups in each 
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Measure of Risk Taking 


The measure of risk taking was 10* oí the u 
items from the Kogan and Wallach (1964, Appendix 
E) Choice-Dilemmas Questionnaire. Choice-dilemmas 
items depict a variety of hypothetical risk-taking 
situations in which a fictitious person must choose 
between a risky or a conservative course of action, 
Respondents are asked to indicate the lowest odds 
for success (e.g., 1, 3, 5, 7, or 9 out of 10) that they 
would consider acceptable in advising the fictitious 
person to attempt the risky alternative. 


Procedure 


Prior to appearing for an alleged experiment on 
group decision making, subjects completed a Group 
Experiments Questionnaire containing items on their 
academic and social background, personal interests, 
self-ratings of characteristic personality traits, and 
a checklist of personality traits considered desirable 
in others. Although this information was actually 
discarded, subjects were led to believe that the in- 
vestigators were attempting to assemble congenial 
discussion groups on the basis of responses to this 
questionnaire. In this way, the stage was set for the 
experimental manipulation of cohesiveness during 
actual sessions, 

Subjects were scheduled in groups of four. Upon 
arrival, subjects sat behind individual partitioned 
booths and received standard pretest instructions (cf. 
Wallach & Kogan, 1965) for completing the risk- 
taking measure. After subjects indicated their private, 
initial levels of risk taking on each item, the experi- 
menter collected their choice-dilemmas booklets, 

Cohesiveness manipulation. While the subjects re- 
mained seated in their booths, the experimenter pre- 
sented instructions for the cohesiveness induction. 
Subjects in the high-cohesiveness condition were told: 


First, let me give you some background concern- 
ing this study. We are currently conducting a 
series of research projects on group decision 
making. Remember the questionnaires you filled 
out? The reason for the questionnaires is that we 
had the idea of trying to select people who would 
form congenial discussion groups composed of 
persons who are highly similar in terms of aca- 
demic and social background as well as personality 
traits. On the basis of these criteria, we were able 
to compose only 10 out of 80 groups as model 
groups. In terms of academic and social back- 
ground, this group possesses the qualifications 
which have been set up to be extremely good at 
this task. Moreover, in terms of personality traits 
you listed for yourselves and those you find de- 
sirable in others, we were able to get a very con- 
genial matching in this particular case. You should 
get along very well with one another. 


On the other hand, the experimenter told groups 
in the low-cohesiveness condition: 


*Ttems 8 and 9 were the ones excluded. 
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First, let me give you some background concerning 
this study. We are currently conducting a series 
of research projects on group decision making 
Remember the questionnaires you filled out? The 
reason for the questionnaires is that we are trying 
to select congenial discussion groups composed of 
persons who are highly similar in terms of aca- 
demic and social background as well as personality 
traits. In some cases, such as with this particular 
group, conflicting time schedules prevent us from 
fulfilling this objective, I’m afraid that as a group, 
you're really quite different from one another in 
terms of expressed interests and personality traits— 
criteria we've found important in the past. Quite 
frankly, we're not quite sure how things will turn 
out. 


Following instructions for the cohesiveness in- 
duction, subjects were assembled around a discussion 
table and received instructions for the conduct of 
group discussions. Specifically, they were told to 
discuss and generate unanimous decisions for each 
item within a time limit of 5 minutes per item. 
Before allowing groups to engage in discussion, the 
experimenter added a final set of comments that 
constituted the induction of social responsibility. 


Social responsibility manipulation. Subjects in the 
high-social-responsibility condition were told: 


There is one more point that I should mention 
concerning your role as advisor. You will notice 
that for each item, you are asked to imagine 
that you are advising the person depicted in the 
hypothetical situation. In this case, since you're 
engaging in group discussion, you are giving him 
advice in the form of a group decision. You are to 
assume that the person will take the advice you 
generate as a group. That is, your group decision 
will be acted upon by that person, and he will 
carry out the course of action you suggest under 
the odds specified in your group decision. 


On the other hand, the experimenter told groups 
in the low-social-responsibility condition: 


There is one more point that I should mention 
concerning your role as advisor. You will notice 
that for each item, you are asked to imagine that 
you are advising the person depicted in the hypo- 
thetical situation. In this case, since you're engaging 
in group discussion, vou are giving him advice 
in the form of a group decision. Much as in 
any advisory situation, you are to assume that 
the person always retains the personal freedom to 
accept or reject the advice that you, as a group, 
provide him. That is, he is not necessarily bound 
to take the course of action under the odds speci- 
fied in your group decision, 


Discussion and posttest Phases. After receiving new 
copies of the choice-dilemmas booklets, the groups 
discussed and arrived at unanimous decisions for 
each item. This consensus decision constituted one 
postmeasure of risk taking. Subjects were then re- 


or 
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turned to their booths and indicated their final, 
private levels of risk taking—a second postmea 
of risk taking—on another fresh set of « 
dilemma booklets. After postmeasures of risk : 
subjects completed a postexperimental questi 
containing items to check the effectiveness 
cohesiveness and social responsibility manipulations 


RESULTS AND Discussion 
Manipulation Checks * 


Four items assessed group cohesiveness. On 
two items, the mean differences between his 
and low cohesiveness were in the expected 
direction but  statisticall mificant. 
" . 5. Toes, M 
High-cohesive groups expressed greater liking 
a great > 


persons 


sire to work with these sam 


other experiment than ohesive groups 
(F = 1.51, df = 1/24, ns, and F= 1.42, dj 
ES 1/24, respectively). More clear-cut dif- 
ference obtained with the other two co- 


hesiveness items. Specifically, high-co 
pared their discussion groups n 
favorably to other groups they had come in 
contact with (F = 3.22, df — 1/24, p < .05). 
Also, on a Guttman scale * (cf. Back, 1951), 
they indicated agreement to levels of greater 
personal intimacy with their fellow members 
than did low-cohesive groups (F = 3.95, df 
= 1/24, p < .05). On a composite index of 
these four items, high- and low-cohesive 
groups also differed significantly (F — 3.32, 
dj = 1/24, p< DA. 

Three different items assessed social respon- 
sibility. Groups in the high-social-responsibil- 
ity condition considered the fictitious person 
to be significantly more dependent upon the 
outcome of their group decision for determin- 
ing an appropriate course of action than 


?Since direction of outcome was predicted, the 2 
values reported for the manipulation checks are all 
one-tailed and were obtained by halving the # values 
from standard F tables. 

5 Green's (1956) index of consistency was em- 
ployed to determine whether this cohesiveness mea- 
sure was indeed a Guttman scale. Unlike the error 
of reproducibility coefficient, the index of consistency 
is unaffected by the popularities of individual items 
comprising a scale. According to Green, a scale pos- 
sesses Guttman properties if its index of consistency 
exceeds the criterion value of .50 in a possible range 
of scores from O to 1. Since the consistency index 
for the cohesiveness measure was .96, it is clearly a 
true Guttman scale. 


groups in the low-social-responsibility condi- 
lion (F= 16.62, dj = 1/24, p < .01). Like- 
Wise, they indicated significantly greater re- 
sponsibility for the fate of the person depicted 
in the hypothetical decision items (F= 
| 18.93, df = 1/24, p< 01). However, reports 
- en confidence that the group decision repre- 
_ sented advice in the best interests of the fic- 
titious person showed no difference (F <1) 
between high and low social responsibility. 
Finally, on a composite index of these three 
items, groups in the high-social-responsibility 
condition differed significantly from those in 
- the low-social-responsibility condition (F = 
14.04, dj = 1/24, p< 01). 


| Risk Taking 


Individuals’ risk scores on each choice-di- 
] item were first reversed so that a 
- high score indicated greater risk. (The lowest 
odds out of 10 that an individual would ac- 
Cept for a given problem were subtracted 
from 10.) Each individual subject's reversed 
Scores were then summed across the 10 items. 
| Group risk scores, whether pretest, consensus, 
or postconsensus, 
the summed risk-taking scores of individuals 


^ 
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The repeated-measures format of the design 
- Provides two measures of group shifts toward 
“tisk: pretest-consensus shift and pretest-post- 
Consensus shift. Whereas the consensus shift 
‘Primarily reflects overt compliance to group 
- influence, the postconsensus shift presumably 
indexes the private compliance remaining 
after direct influence, On the other hand, any 
differences in group risk taking between con- 
Sensus and postconsensus decisions reflect 
noncompliance. 

- Table 1 presents the mean pretest, con- 
j Sensus, and postconsensus scores as well as 
the various risky-shift indexes for each ex- 
‘Petimental condition. Analysis of variance of 
Pretest scores revealed no significant. differ- 
ces (F = 1.46, df = 3/23, ns). "Therefore, 
Indom assignment effectively ruled out initial 
lerences in group risk taking as an expla- 
on of the experimental effects. 
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TABLE 1 
Risk TAKING UNDER Eacu CoxnprTioN 
| | 
Mean Mean 
| Mean | Mean post- | Mean | one 
Conditions pretest come con- EE con- 
score | score pere shift "t 
High Co-High SR | 40.10 | 41.86 | 40.24 | 136 | es 
High Co-Low SR | 44.89 47.06 | 48.10 2.7 321 
Low Co-High SR | 45.64 | 50.28 | sos | 4.04 4.89 
Low Co-Low SR | 44.57 4942 | 49.53 4.85 4.96 


= Co and SR refer to cohesiveness and social responsibility, 
respectively, 

To assess the experimental effects, the 
group risk-taking scores were incorporated in 
a2X2*X 3 ANOVA with repeated measures 
(i.e., pretest versus consensus ver: postcon- | 
sensus) as the third "within? fatior (cf. ^ 
Winer, 1962). The results revealed an interac- - 
tion between cohesiveness and measurements * 
(F = 3.94, df = 2/48, p < 05). Simple-effects 
analysis showed that low cohesiveness pro- 
duced significant consensus and postconsensus 
increases in risk taking (F — 5.29, df — 1/ 
48, p < .05, and F= 547, dj = 1/48, p< 
:05, respectively), whereas high cohesiveness 
yielded only nonsignificant increases (Ez 
2.06, df = 1/48, ns; F=1.94, df= 1/48, 
75). For both high and low cohesiveness, there 
were no differences in risk taking between 
consensus and postconsensus decisions (F < 
1). On the other hand, there was no main 
effect for social responsibility (F — 1.19, df 
— 1/24, ns) nor any interactions with social 
responsibility (Fs < 1), 


Cohesiveness and Risk Taking 


The results of the aforementioned analysis 
directly contradict the affective-bonds hy- 
pothesis as originally stated, in that high co- 
hesiveness inhibited rather than increased the 
risky shift. Yet, one interpretation of the ob- 
tained cohesiveness effect does mesh nicely 
with an explanation in terms of diffusion of 
responsibility. That is, when members become 
more attracted to a group, their strengthened 
affective feeling might make them /ess prone 
to blame one another for the possible negative 
consequences of a risky decision. In other 
words, high cohesiveness might well be ex- 


* Given the significant interaction, the significant 
main effects for cohesiveness and measurements lack 
any interest. 
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pected to interfere with the diffusion of per- 
sonal responsibility. If so, it might also re- 
duce the risky shift—the result of the present 
study. (In accounting for their finding that 
in bystander intervention friends intervene 
more quickly in an emergency than strangers, 
Latané & Rodin, 1969, also suggested that 
affective ties inhibit diffusion of responsibil- 
ity.) Thus, although diametrically opposed to 
the affective-bonds hypothesis (Wallach & 
Kogan, 1965), this interpretation is neverthe- 
less consistent with an explanation of the 
risky shift in terms of responsibility diffusion. 
Further, this suggests that the affective-bonds 
hypothesis was simply ill conceived and not 
logically interrelated to the concept of dif- 
fusion of responsibility. 

An alternative interpretation, however, is 
that the obtained effect of cohesiveness upon 
group risk taking was due to differential in- 
fluence processes. One of the more firmly es- 
tablished findings in the cohesiveness litera- 
ture is that high cohesiveness increases influ- 
ence attempts among group members. Yet 
while most theorizing about the risky shift 
suggests that greater mutual influence among 
group members should increase risk taking 
(cf. Brown, 1965; Dion et al, 1970), one 
could construct theoretical arguments yield- 
ing decreased risk taking as a function of in- 
fluence. Whatever the case, if influence proc- 
esses were assessed, it would be possible to 
determine whether differential influence pro- 
duced the lower risk taking found under high 
cohesiveness or, on the other hand, whether 
the effect depends on some other process such 
as the inhibition of responsibility diffusion. 

Therefore, several analyses were performed 
to estimate the amount of social influence in 
the different experimental conditions. As in 
previous risky-shift studies (Flanders & This- 
tlethwaite, 1967; Teger & Pruitt, 1967; Wal- 
lach & Kogan, 1965), these analyses indexed 
influence processes in terms of reductions in 
the variability of risk taking among group 
members following discussion. Since instruc- 
tions required unanimous decisions in the con- 
sensus phase, consensus decisions necessarily 
had zero variability. Thus, the only relevant 
data for assessing social influence are pretest— 
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posttest changes in the variability oí risk- 
taking scores in groups. Consequently, the 
ranges and variances of individual risk-taking 
scores in the pretest and posttest phases were 
obtained for all groups to compute two sets of 
difference scores. These difference scores were 
used as measures of social influence in the 
experimental conditions. 

If high cohesiveness mitigated the risky 
shift by generating greater social influence, 
the reduction in variability following group 
discussion should be greater for high than 
for low cohesiveness. An ANOVA of the dif- 
ference scores between pretest and posttest 
ranges failed to show any effect for cobcsive- 
ness (F < 1). However, an ANOVA of the 
difference scores between pretest and posttest 
variances did reveal a significant interaction 
between cohesiveness and social responsibil- 
ity (F = 4.56, df = 1/24, p < .05). Simple- 
effects analysis showed that there were greater 
reductions in the variances of decisions for 
high-cohesive groups under high social re- 
sponsibility (F = 4.45, df = 1/24, p < .05), 
whereas with low social responsibility, low- 
cohesive groups exhibited nonsignificantly 
Breater convergence among individual deci- 
sions. If it is the greater social influence from 
high cohesiveness which deters increased risk 
taking by group members, then nonsignificant 
postconsensus shifts toward risk should have 
occurred only for groups in the high-cohesive— 
high-social-responsibility cell. Yet, as men- 
tioned earlier, reduced risky shifts on post- 
consensus decisions with high cohesiveness 
were obtained across both levels of social re- 
sponsibility. Thus, these analyses provide little 
systematic evidence that influence processes 
played any role in producing the obtained co- 
hesiveness effects in group risk taking. 


Social Responsibility 


As mentioned earlier, the results of the 
main analysis on risk taking showed no effect 
for social responsibility. However, an internal 
analysis was also performed with risk taking 
assessed as a function of perceived social re- 
sponsibility. Perceived social responsibility 
Was assessed by a median split of the com- 
posite scores on the social responsibility ma- 


nipulation checks.* Group risk scores for con- 
sensus and postconsensus decisions were ana- 
lyzed by means of 2 X 2 analyses of covari- 
ance with pretest risk-taking scores as the 
covariate (cf. Kiesler, Collins, & Miller, 
1969).° On consensus decisions, there was a 
main effect for perceived social responsibility 
(F= 2.79, df = 1/23, p<.10) which ap- 
proached significance. Providing some support 
for Jones and Gerard’s ( 1967) hypothesis, 
group decisions were less risky with high 
perceived social responsibility, Further, on 
postconsensus decisions, there was a signifi- 
cant interaction between cohesiveness and so- 
cial responsibility ° (F = 4.31, df = 1/23, p 
< .05). Analysis of the simple effects revealed 
that under high perceived social responsibil- 
ity, high cohesiveness produced less risk tak- 
ing than low cohesiveness (F = 7.22, df=1/ 
- 23, p< 01), whereas under low perceived 
Social responsibility, cohesiveness had no ef- 
fect (F < 1). Also, high perceived social re- 
sponsibility produced less risk taking than low 
peceived social responsibility in high-cohesive 
groups (F = 10.70, df = 1/23, p < .01) but 
not in low-cohesive groups (F < 1). Thus, 
the internal analysis provides some evidence 
that high perceived social responsibility deters 
increased risk taking—whether in consensual 
Broup decisions or in the subsequent, indi- 


"The three items which served as checks on the 
- Social responsibility manipulation were phrased as 
follows: (a) *How dependent were the persons de- 
Dicted in the hypothetical situations upon the out- 
Come of your group decisions for determining an 
Appropriate course of action?" (b) “To what extent 
- Was your group decision responsible for the fate of 
- the person depicted in the hypothetical situations?" 
(c) "To what extent are you confident that the 
Advice generated by your group decisions is in the 
t interests of the person depicted in the various 
4 hypothetical situations?" Subjects responded to these 
~ items on 5-point scales, with possible scores ranging 
T from 0 to 4 on each item. 

"The median split on social responsibility scores 
for the internal analysis produced unequal cell fre- 
quencies, Due to computational difficulties in re- 
Peated-measures ANOVA with unequal frequencies, 
an alternative, statistically powerful analysis was 
Performed—namely, unweighted-means ANCOVA 
With pretest as covariate, 

"" Given the interaction, the significant main effect 
yt perceived social responsibility on postconsensus 
“cisions lacks interest, 
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vidual decisions of persons in high-cohesive 
groups. 


Inhibitors of the Risky Shift 


In conclusion, the present study provides 
evidence that high cohesiveness and high per- 
ceived social responsibility depress the usual 
phenomenon of increased risk taking by 
groups. From a theoretical viewpoint, both 
cohesiveness (see Discussion) and social re- 
sponsibility (cf. Jones & Gerard, 1967) may 
produce this effect by inhibiting the diffusion 
of personal responsibility among group mem- 
bers. This conceptualization of the two varia- 
bles suggests investigating other variables that 
may also inhibit the diffusion of responsibil- 
ity and, hence, the risky shift. For example, 
Jones and Gerard (1967) proposed that vari- 
ous "organizational features," such as leader- 
Ship formation, role systems, and a division 
of labor, might counteract the diffusion of 
responsibility in groups. Similarly, status in- 
equities in groups should also limit oppor- 
tunities to diffuse responsibility onto others 
(cf. Wallach et al., 1964). Presumably, status 
differences among group members might re- 
duce their sense of “common fate" and re- 
strain interpersonal communication —thus in- 
terfering with responsibility diffusion. In sum, 
experimental investigation of these group 
properties might reveal additional factors that 
curtail the risky shift and lend support to an 
interpretation of this phenomenon as due to 
diffusion of responsibility. 
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Choice-dilemmas items were em 
for the shifts to risk or cauti 


ployed 


items received no support. Learning 
absence of group discussion did, ho: 
cautious items, although the shift in 


of the shifts for risky and cautious 


The familiarization theory concerning the 
influence of group discussion on risk taking 
Maintains that group discussion increases the 
subject’s familiarization with the choice- 
dilemmas items and thus enables him to make 
à better, more rational decision. Whether or 
not the decision is more rational, a risky de- 
cision for items which show a shift to risk 
and a conservative decision for items which 
typically shift to caution are a result of fur- 
ther scrutiny of the problem. Two studies 
(Bateson, 1966; Flanders & Thistlethwaite, 
1967) obtained a shift to risk simply by 
having individuals study the choice-dilemmas 
items more thoroughly and asking them to 
Prepare a brief listing important points to be 
Considered. Teger, Pruitt, St. Jean, and Haa- 
land (1970), however, attempted four separ- 
ate replications of Bateson’s study, and in 
each case were unsuccessful. 

A second theory to be considered is based 
on the assumption that a cultural value exists 
Which favors risk over conservatism in some 
Situations and conservatism over risk in others 
(cf. Brown, 1965). An implication of this 
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(r = —.97), indicating that the two types of 
the variables manipulated in this study. Possib| 
theories as to how risky and cautious items 
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to test two theories which could account 
h follow group discussion of the items. 
ion of increased familiarization with the 


the odds chosen by other subjects in the 
wever, 


lead to a shift for both risky and 
case was of less magnitude than 
discussion. Further, the correlation 
across conditions was very high 
items were affected similarly by 
le explanations for the shifts and 
differ are presented. 


each 


items 


theory is that information about the odds 
choices of other subjects, without actual group 
discussion, should produce a shift to risk or 
caution, depending on the item. No evidence 
of a significant shift was found by Wallach 
and Kogan (1965) under such conditions. 
However, Teger and Pruitt (1967) replicated 
Wallach and Kogan’s study, eliminating a 
requirement of the first study that group 
consensus on the odds to be chosen for each 
problem be reached, and found that the sub- 
jects’ final decisions were more risky than 
their original ones. The magnitude of this 
shift, however, was significantly less than 
when discussion was involved. 

To explore the relative validity of these 
two theories, the opportunity for further fa- 
miliarization with the problems and knowl- 
edge of others’ odds choices were manipulated 
independently. A traditional condition with 
group discussion of each item was employed 
to determine if any shifts which did occur 
were of comparable magnitude to shifts which 
occur after group discussion. Finally, a con- 
dition in which subjects simply responded to 
the choice-dilemmas items twice served as a 
control. 

This design includes replications of Bate- 
son's (1966) study and of Teger and Pruitt's 
(1967) work. The procedure differs from 
Teger and Pruitt's in that it employs written 
communication of information about the 
choices of others rather than face-to-face ex- 
change. Hence, any shifts in this condition of 
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this study cannot be attributed to a leader- 
ship phenomenon. Further, the present study 
included a condition where the same subjects 
were exposed to the manipulations of both 
Bateson and Teger and Pruitt. Thus, any 
shifts resulting from an interaction of these 
manipulations can be examined. Finally, both 
items on which shifts to risk typically occur 
and items which typically shift to caution 
were employed. Hence the similarities and 
differences of the behavior of risky and cau- 
tious items can be assessed. 


METHOD 
Subjects 


One hundred and thirty-nine volunteers from an 
introductory psychology course served as subjects in 
order to fulfill a course requirement of participation 
in experiments, Subjects served in one of four 
experimental conditions or in the control group in 
groups of five or six, Five groups were assigned 
randomly to each of the five conditions. 


Instrument 


The instrument was composed of six items: three 
which have been shown to shift to risk after group 
discussion and three which shift to caution. The 
order of the items was random. The risky items were 
taken from Kogan and Wallach’s original 12 (Wal- 
lach & Kogan, 1959), The cautious items were written 
from summaries which appeared in Stoner (1968). 
The central person in each problem is faced with a 
choice between two courses of action, one of which 
is more desirable and attractive than the other, 
The probability of attaining or achieving the more 
attractive alternative, however, is lower than the 
probability of attaining the less desirable alternative. 

Subjects were asked to place themselves in the 
position of advising the central character in each 
situation, They were asked to choose the lowest odds 
of success they would consider acceptable for the 
central character to adopt the more risky alternative, 
The options in intervals of 1/10 were presented 
ranging from 1/10 to 9/10. All subjects were ad- 
ministered the instrument once immediately upon 
entering the experimental room and again during 
the experimental manipulation. On the second ad- 
ministration of the instrument, instructions empha- 
sized that the study was not a test of memory or 
consistency of responding and that subjects should 
ignore their previous responses in completing the 
instrument. Subjects, however, were advised that 
they should not feel compelled to change from their 
previous responses, but that they should complete the 
instrument as if they had never seen it before by 
answering with what they felt was the best choice, 
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Procedure 


The subjects reporting for a particular experimental 
session met the experimenter in a central room. 
When the actual experiment was to begin, each sub- 
ject was assigned a cubicle and spent the remainder 
of the session alone, although each subject was 
aware that other subjects were participating simul- 
taneously, In the one condition where face-to-face 
group discussion was desired, the subjects completed 
the instrument initially alone in the cubicles, but 
were brought to the central room for the discussion. 


Conditions 


The five experimental treatments are summarized 
below: 

Familiarization (F). After the administration | 
a pretest, subjects were given an opportuni! 
familiarize themselves further with the choi 
mas items. Subjects were asked to write do A 
favoring both possible alternative courses of : 
for the central character for each problem, 
minutes were allowed for further considera 
each problem. At the end of each 3-minute 
subjects retook a single item of the original 

Knowledge of others’ odds choices (O). After the 
administration of a pretest, subjects began to answer 
questions on the Background Questionnaire, a ques- 
tionnaire which asked for detailed information con- 
cerning the SW. sanity background, educational 
history, previous jobs, etc. While taking this ques- 
tionnaire, subjects were interrupted every 3 minutes, 
informed in writing of others’ odds choices for a 
particular item, and asked to retake the item, giving 
whatever consideration they wished to the informa- 
tion about the odds choices of others. 

Familiarization and knowledge of others’ odds 
choices (FO). After the administration of a pretest, 
subjects were allowed further familiarization with 
the items and also were informed of the odds chosen 
by others in their group. Thus, a subject in this 
condition would familiarize himself with an item 
(as in Condition F) and would be informed of the 
odds chosen by others in their group for that item 
(as in Condition O). Familiarization and knowledge | 
of others’ odds choices were presented in counter- 
balanced order. Three minutes were allowed for 
further consideration of each problem. At the end 
of each 3-minute interval, subjects retook a single 
item of the original six, 

Group discussion (G). After the administration of 
a pretest, subjects met in groups of five or six and 
discussed each of the items in the instrument 85 
in the traditional studies in this area. After 3 min- 
utes of discussion of an item, subjects retook the 
items individually. No mention of a requirement for 
consensus in the group was made. 

Control (C). After the administration of a pre- 
test, subjects took the Background Questionnaire 
(as in Condition O) for 18 minutes. At 3-minute 
intervals during the taking of the questionnaire, sub- 
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jects were interrupted to retake a single item of 
the original six. 


Statistical Analysis 

An analysis of variance was performed for the 
factorial design involving conditions (C), type of 
items (T, risky or cautious), particular items nested 
within types (I), and groups of subjects nested 
within conditions (G). Factors C, T, and I are 
considered to be fixed, while G is random. 


RESULTS 


The dependent variable was calculated by 
subtracting the mean odds choice after the 
experimental manipulation from the mean of 
the initial responses. These means were com- 
puted over subjects in a group for a par- 
ticular item. Thus, negative scores indicate a 
shift to caution (higher odds choices), and 
positive scores indicate a shift to risk (lower 
odds choices). 

The data for each subject from the FO con- 
dition were analyzed to determine if the order 
of familiarization and knowledge of others’ 
odds choices influenced the shift. For the cau- 
tious items, the difference between the mean 
shift when familiarization preceded knowledge 
of others’ choices of odds and when the 
knowledge of odds choices preceded familiari- 
zation was 12 (f= .51, df — 33); for the 
risky items the difference was —.16 Cc 
~—.46, df = 33). Since the order seemed to 
have no effect, the data from the two sub- 
groups of this condition were pooled. 

The sums of the mean shifts per group and 
the results of the analysis of variance are 
Summarized in Tables 1 and 2, respectively. 
The main effect of type of item (T) (F= 
1.84, df = 1/20, p < .05) indicates that risky 
ànd cautious items do indeed shift in different 
directions, The hypothesis that the magnitude 
9f the shifts of risky and cautious items is 


TABLE 1 
Sum or tHe MEAN Snirrrs PER Group 
Condition 
Item 
type F 
1 (F) 2 (0) | 3 (FO) 4 (G) 5 (C) 

Risky | —4333| 1.867 | 2.833 6.583 | —1.730 
Cautious 1.233 | —4.964 | —4.334 | —11.167 | —.535 
E 
sh Note.—Negative shifts indicate shifts to caution, Positive 
Shifts indicate shifts to risk. 
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TABLE 2 
ANALYSIS OF VARIANCE SUMMARY TABLE 
Source | df MS | F 
Between Ss E 
C 4 -046 <1 
G/C | 20 624 
| 
Within Ss 
T | 1 4.063 7.84* 
CXT 4 2.663 5.14** 
T X G/C 20 518 
I/T 4 .600 1.24 
CXI/T 16 .169 «1 
I/T X G/C 80 485 
*p «.05 
* b <.01 


affected similarly by the experimental treat- 
ments leads to a predicted C x T interaction 
because of the scoring method in which shifts 
to risk and caution are indicated by positive 
and negative numbers, respectively. The C x 
T interaction was significant (F — 5.14, dj — 
4/20, p < .01). The nature of the interaction 
can be seen in Table 1. 

If the C X T interaction is an artifact of 
the scoring method and if risky and cautious 
items do behave similarly with respect to 
experimental conditions, then a significant 
negative correlation between the shifts for 
risky and cautious items with respect to con- 
ditions would result. The Pearson correlation, 
calculated across conditions employing the 
data in Table 1, is —.97. Thus, a little over 
94% of the variance attributable to the effect 
of the experimental treatments is common for 
risky and cautious items and hence is pre- 
dictable from one type of item to the other. 
Therefore, 94% of the sum of squares for the 
C X T interaction is common for risky and 
cautious items. The residual is 6% of the 
total sum of squares, or .6392. If this residual 
is significant, risky and cautious items are 
being affected differentially by the experi- 
mental treatments. If it is not significant, 
there is no reason to assume that the two 
types of items behave differently with respect 
to the conditions. To test the residual for 
statistical significance, the mean square must 
be obtained and tested with the F statistic, 
using the error term which was appropriate 
for the C x T interaction. The mean square 
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TABLE 3 
ORTHOGONAI Contrasts FOR Tur C X T INTERACTION 


Source 4f | MS Í F 
CXT |4 | 2.663 | 5.14** 
a. Familiarization (F-4+FO vs, 
0 4 C) 1| 116] <1 
b. Knowledge of odds choices | H 
(O + FO vs. F + C) 1 | 3.488 | 6.73* 
c. Interaction F X O (F +0 
vs. FO + C) 1| .209| «1 
d. Group vs. mean of other con- 
ditions (M of F + O + FO | | 
+ C vs, G) | 1 | 6,840 | 13.20** 
Error (T X G/C) 2| 518 | 
| 
*p <05. 
** 5 « 01. 


for the residual of the C X T interaction has 
3 degrees of freedom, 1 degree of freedom 
having been lost from the original 4 to ac- 
count for the predictable component of the 
variance common to risky and cautious items. 
The obtained F is 213/.518, less than 1. 

In order to determine the contributions of 
familiarization and knowledge of others? odds 
choices to the shifts to risk and caution, the 
sum of squares for the C x T interaction was 
partitioned into four orthogonal contrasts as 
shown in Table 3. Knowledge of the odds 
choices of others leads to a shift, but fa- 
miliarization with the choice-dilemmas items 
appears to have no effect. The interaction of 
these factors is not significant. The shift 
which occurs after group discussion of the 
items is significantly greater than the mean of 
the shifts for the other conditions. It appears 
that knowledge of others’ odds choices leads 
to a shift for both risky and cautious items, 
but in each case the shift is of smaller magni- 


tude than the shift obtained after group 
discussion. 


Discussion 


The high correlation (r = —.97) between 
the risky and cautious items indicates that 
risky and cautious items behave alike with 
respect to the variables manipulated in this 
study. The results indicate that increased 
familiarization with the choice-dilemmas 
items leads to no shift, whereas knowledge of 
the odds choices of others leads to some shift, 
but a shift of smaller magnitude than the 
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shift produced in the group-discussion condi- 
tion (comparing Condition O to Condition G 
with a contrast not orthogonal to those in 
Table 3, F = 11.93, df = 1/20, p < 01). 

That increased familiarization through 
further study of the choice-dilemmas problems 
leads to no shift in the expected direction for 
either risky or cautious items supports the 
contention of Teger et al. (1970) that the 
findings of Bateson (1966) and Flanders and 
Thistlethwaite (1967) are not reliable. 

That knowledge of others’ odds choices 
leads to a shift in the expected direction sup- 
ports some version of the value hypothesis. 
Apparently subjects strive to maintain a 
tion of relative risk or caution, depending or 
the item, with respect to other subjects. 
finding, however, is not consistently suppx 
in the literature. This study replicates 'eger 
and Pruitt (1967) and Clark and Willems 
(1969) by finding a shift in the knowledge of 
others’ odds choices condition. Bell and 
Jamieson (1970) and St. Jean (1970), how- 
ever, found only nonsignificant shifts in these 
circumstances. The latter, in fact, found mo 
shift at all when the information about others’ 
odds choices was presented in the form of 
messages as in the present study. Finally, 
Murdoch, Myers, and Smith (1970) used 
both risky and cautious items and found no 
significant shift in the knowledge of others’ 
odds choices condition. 

Knowledge of others’ odds choices does not, 
however, account for the full shift obtained 
in the group-discussion condition. Some addi- 
tional factor operating only when group dis- 
cussion is present may account for part of 
the shift. Perhaps information and arguments 
which affect a shift in the expected direction 
are presented in the discussion. This informa- 
tion and these arguments would not all be 
available to a single subject studying the prob- 
lem as in the familiarization condition of this 
study, Increased information and more argu- 
ments would not, however, lead to a shift in 
the expected direction unless the information 
and arguments presented predominantly fa- 
vored the expected direction of the shift. Per- 
haps the nature of the problems is such that 
more arguments favoring risk for risky items 


E ee ae S 


EFFECTS OF FAMILIARIZATION AND KNOWLEDGE oN Risk 


* and caution for cautious problems are avail- 


able. That is, when all the arguments are 
presented and the evidence weighed, perhaps 
the "best" decision is one which leads to a 
shift in the expected direction, 

Data collected in Condition F, when sub- 
jects were writing arguments favoring each 
of the two alternative courses of action for 
the central character in each item, are rele- 
vant to this point. According to this hypothe- 
sis, subjects should write arguments which 
predominantly favor risk for risky items and 
caution for cautious items. Further, having 
written these arguments, the subjects in Con- 
dition F should shift in the expected direc- 
tion. In each case the data reveal trends op- 
posite to those predicted above. The risky 
* items yielded a mean of 4.72 arguments per 
subject favoring the risky decision and 5.32 
arguments favoring the cautious action (t= 
1.12, df = 24, ms). For the cautious items, 
the mean number of arguments favoring risk 
and caution were 5.84 and 4.80, respectively 
(t — 2.34, df = 24, p < .05). These findings 
are less perplexing when one notes that both 
risky and cautious items showed tendencies to 
Shift in the unexpected direction for Condi- 
tion F (see Table 1). It seems plausible, then, 
that the direction of the shift is in part de- 
termined by the distribution of the arguments 
Presented. If this is the case, the proportion 
of risky arguments to total arguments pre- 
sented for a risky item should correlate posi- 
tively with the magnitude of the shift on the 
item for a particular subject. Similarly, for a 
Cautious item, the amount of shift should cor- 
relate with the proportion of arguments which 
favor the cautious alternative. These corre- 
lations between subjects would be predicted 
even though Condition F did not lead to an 
Overall shift for either type of item. The ob- 
tained correlations were not significant, but 
Were in the direction predicted (r = 13, df 
= 23, for the risky items; r = .16, df = 23, 
for the cautious items). The data suggest that 
the hypothesis that the shift to risk or caution 
I5 in part a function of the distribution of the 
arguments presented is worthy of examination 
in an experiment where the nature of the 
arguments presented is manipulated. 
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A possible explanation of why arguments 
and information presented in the group dis- 
cussion predominantly favor a shift in the 
expected direction is that a value placed on 
risk for risky items and on caution for cau- 
tious items biases the nature of the informa- 
tion exchanged in the group for each prob- 
lem. One would be quick to present an argu- 
ment favoring a shift consistent with the 
value held (cf. Brown, 1965; also, Teger & 
Pruitt, 1967). Further, these value-consistent 
arguments would be more salient and more 
readily acceptable to the group members. 
This idea, then, really combines the familiari- 
zation and cultural value theories: familiari- 
zation leads to gaining information from self 
and others, but the nature of this information 
is in part determined by the cultural value. 

The reasons why different items consistently 
elicit different values is not clear. Future 
studies attempting to determine just what fac- 
tors cause an item to be risky or cautious are 
indicated. One possibility is that cautious 
items involve decisions which are irreversible. 
Further, cautious items seem to carry mor 
consequences, both beneficial and adverse, for 
persons other than the central character (e.g., 
a wife or family) than do risky items. 

Although the present study sheds no light 
on the nature of the difference between risky 
and cautious items, it does show that the two 
iypes of items behave similarly; that is, the 
factors which affect the shift to risk for risky 
items lead to a shift to caution for cautious 
items. The implications are that any model 
which accounts for one type of shift should 
account for the other also, and that future 
studies attempting to elucidate the nature of 
the shifts to risk and caution should include 
work with cautious as well as risky items. 
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The phenomenon of individuals taking 
greater risk in groups than when acting alone 
has been labeled the risky shift and has been 
1] the subject of considerable research and theo- 

retical analysis. Jellison and Riskind (1970) 
have utilized social comparison theory to 
Interpret the research on the risky shift. 
Coupling the assumption that greater risk is 
perceived as an indication of greater ability 
with the assumption from social comparison 
theory (Festinger, 1954) that people prefer 
to locate themselves slightly higher in ability 
y than similar others, Jellison and Riskind 
` argued that people take greater risks in groups 
in an attempt to demonstrate that they are 
slightly higher in ability than the other group 
members. 

Jellison and Riskind found support for the 
hypothesis that the greater the level of risk 
another recommends, the greater will be the 
Perceived ability of the other. This hypothesis 
Was based on the assumption that ability is a 
Major determinant of how much risk a person 
will take. If the magnitude of the costs asso- 
ciated with failure at a task are held con- 
Stant, then the riskiness of a task can be 
Viewed as a joint function of the difficulty of 
the task in relation to the ability of the per- 
., 9n. The same task might represent a high 

tisk for a person of low ability and a low risk 

for a person of high ability. 


„>This research was partially supported by a Na- 
tional Science Foundation Undergraduate Research 
p Fe OWship awarded to the second author. 

* Requests for reprints should be sent to Jerald 
ellison, Department of Psychology, Mezes Hall 211, 
niversity of Texas at Austin, Austin, Texas 78712. 
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" ATTRIBUTION OF RISK TO OTHERS AS A 
FUNCTION OF THEIR ABILITY' 


AND JOHN RISKIND 


Yale University 


An experiment is reported which tested the hypothesis that people would at- 
tribute greater riskiness to another as a direct function of the other's ability. 

t The data supported the hypothesis and the idea that ability and risk taking 
are perceived as closely related. This relationship between ability and risk 
taking is basic to an interpretation of the vast literature on the risky shiít 
in terms of social comparison of ability. 


If people assume that individuals choose a 
level of risk which is a direct function of their 
level of ability, then people should expect that 
the higher the level of another's ability, the 
higher the level of risk the other will take. 
This can be easily tested by providing people 
with information about the ability of another 
and asking them to predict how much risk the 
other would take or recommend. In manipu- 
lating ability, it is important that the other 
of high ability not be perceived as more irre- 
sponsible or impulsive, because this factor, 
rather than ability, might account for the 
greater attribution of riskiness rather than 
ability.* 


METHOD 


The subjects were 40 undergraduate students in 
introductory psychology, with males and females be- 
ing approximately equally represented. Subjects were 
run in two sessions and were randomly assigned to 
one of the experimental conditions within each 
session. 

When the subjects for a session had assembled, the 
experimenter explained that the study concerned 
persons’ ability to estimate the behavior of another 
person on the basis of a first impression. Their task 
was to read a student’s results on the Watson-Glaser 
Critical Thinking Appraisal (WGCTA), form an im- 
pression of him, and then estimate the responses the 
person had previously made to the items on a per- 
sonal questionnaire, called the “choice-dilemmas” 
booklet. The experimenter then gave each subject a 
booklet containing the supposed results of a student's 
performance on the WGCTA. 


3 In another experiment utilizing the same manipu- 
lation of ability as that used in the present study, 
the high-ability person was rated as much more 
hard working than the low-ability person. Being 
hard working would not seem to indicate impulsive- 
ness or riskiness and might even suggest constraint 
and moderation. 
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Ability Manipulation 


The general ability oí the stimulus person was 
manipulated by his score on the WGCTA. The 
WGCTA was described as a test of creative and 
intellectual ability. The test itselí was described as 
consisting of five subscales (deduction, interpreta- 
tion, recognition of assumptions, analysis, and evalu- 
ation), which together measured "the basic thought 
processes involved in all everyday decisions." In thc 
high-ability treatment, the student scored above the 
95th percentile of a large national sample of college 
students, A score interpretation guide included with 
the test results indicated the following: 


A score in this range indicates very high scores 
on all five of the subscales on the test of critical 
thinking ability. A score in this range indicates a 
brilliant person of extremely high intelligence, 
creativity, competence, and intellectual ability. 


In the moderate-ability treatment, the student 
scored in the 56th percentile, A score in this range 
was described as indicating: 


A score in this range indicates only average 
scores on all five of the subscales. A score in this 
range indicates an average person of average in- 
tellectual abilities and creativity. 


In the low-ability treatment, the student scored 
in the bottom 7th percentile. The score interpreta- 
tion guide indicated: 


A score in this range indicates extremely low 
scores on all five of the subscales. A score in this 
range indicates a person of extremely low intelli- 


gence, creativity, competence, and intellectual 
ability. 


Measure of Risk 


mas booklet were to choose the lowest probability 
of attainment for recommending the high-risk action, 
The experimenter explained that the person’: 


5 J , 1 in 10 
++. 9 in 10, 10 in 10. Before the subjects made 


their estimates, the experimenter explained that the 


low odds indicated high risk and the high odds indi- 
cated low risk, 


RESULTS AND Discusston 


The measure of the dependent variable, at- 
tributed risk, was based on the sum of the 


TABLE 1 
MEANS or ATTRIBUTED Risk 


Ability of the other 


Low (13) Moderate (13) High (14) 
597; | 53.31 | 4036 3 
, the lower the amount of risk 
n each condition is given in 


number chosen for each of the 10 items (e.g., 
0in10—0,1in 10—1...10 in 10 — 10). 
The sum of the scores for the 10 items could 
range from 0 to 100, representing a range of | 
very high risk to very low risk, The means for 
the three experimental conditions are pre- 
sented in Table 1. As can be seen in Table 1, 
greater ability was related to greater attrib- i 
uted risk. An analysis of variance indicated 
that these differences were signi 
the .001 level (F = 1128, df = 2/38). A t 
test indicated that the difference between the 
high- and low-ability conditions was highly 


t beyond 


(t = 4.54, df = 25, p < .001).4 A 


similar | mparison between the high-ability | 
and derate-ability conditions was also sig- 
nificant (f= 3.00, df —25, p< 01). The 


difference between the moderate- and low- 
abilities conditions was not significant (£= 
1.56, df= 24, p < .15). 


The present experiment provides an under- 


| 
standing of the findings of Wallach and Wing | 
(1968) and Levinger and Schneider (1969), | 
and may be considered à conceptual replica- 
tion of those findings. In the Levinger and 
Schneider study, subjects completed a choice- 
dilemmas questionnaire for themselves and 
then estimated the risk preferences of a group 
of peers and finally the risk preference they 
would most admire. It seems reasonable to 
Suppose that subjects perceived themselves as 
being | 
tively low in abili y, and the most admired 
preference to be"that of someone high in 
ability, The results indicated that the sub- 1 
Jects expected their peers to take lower risks 
than themselves and that the most admired 
tisk preference was higher than their own. 


derate in ability, their peers as rela- 


* Significance level is based on two-tailed ¢ tests. 
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Given the high similarity in the results of the rec J. M, & Risxivp, J. A social comparison 
nas " ilities interpretation of risk taking behavior. 

present experiment and that of Levinger and wale ; 

Schneider, a strong case can be made for the Aree provo and Social Psychology, 

assumption that the level of risk attributed in du - 


: : : Levincer, G., & Scunewer, D. J. Test of the “risk 
the Levinger and Schneider study is based on is a value” hypothesis. Journal of Personality and 


differences in perceived ability. Social Psychology, 1969, 11, 168-169, 
Warracu, M. A., & Wine, C. Is risk a value? Jour- 
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EXPRESSION OF *WALTER MITTY-NESS" 
IN ACTUAL BEHAVIOR ' 


KENNETH L. HIGBEE ? 


Brigham Young University 


On the Choice-Dilemmas Questionnaire (CDQ)—a measure of hypothetical 
risk taking—Pruitt found that subjects tended to make cautious decisions, but 
to view their decisions as being risky. He labeled this effect a “Walter Mitty” 
effect. In Study I, investigating a possible methodological source of error in 
Pruitt's study, the Walter Mitty effect was replicated. Study II investigated a 
slightly different kind of "Walter Mitty-ness"—the tendency to anticipate being 
riskier than one sees himself as having been aíter the fact. Two-man teams 
(assigned on the basis of similar CDQ scores) participated in a complex 
decision-making task. Analysis of their behavior indicated that Walter Mitty- 
ness may be expressed in actual risk-taking behavior as well as hypothetical risk 
taking. It was also suggested that the CDQ (and other measures of hypothetical 
risk taking) might be viewed as a measure of a personality trait of Walter 


Mitty-ness. 


In a short story entitled “The Secret Life 
of Walter Mitty," James Thurber described 
a man who lived a cautious, mundane life 
with a domineering wife. Though Walter 
Mitty's actual behavior was cautious, he was 
characterized by his daydreams of daring, 
risky adventures. Is this kind of fantasizing 
also characteristic of real people? If so, is 
there a general characteristic of fantasied 
riskiness—a touch of “Walter Mitty-ness”— 
in all people? Or is Walter Mitty-ness a char- 
acteristic of some people and not of others? 
Concerning the latter question, Slovic (1962, 
1964) has suggested that it may not be appro- 
priate to view riskiness as a personality di- 
mension because of a lack of convergent 
validity among various risk measures, His con- 
clusion, however, refers to actual riskiness, 
and does not preclude the existence of a 
personality dimension of fantasied riskiness. 

Tf the answer to any of the above questions 
is “yes,” it may have some important prac- 
tical implications. For example, Alsop (1970), 


1 The author wishes to acknowledge the assistance 
of Larry Farmer in conducting Study I. The author 
is also indebted to Siegfried Streufert for his assist- 
ance and guidance in conducting Study II, which is 
based on a doctoral dissertation submitted to Purdue 
University. Support from the Office of Naval Re- 
search, Group Psychology Branch, is also gratefully 
acknowledged. } 

* Requests for reprints should be sent to Kenneth 
Higbee, Department of Psychology, Brigham Young 
University, Provo, Utah 84601. 
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in discussing members of radical groups, sug- 
gested that 

they have all been (for want of a better word) 
Waltermittyized. They live in a romantic, television- 
imitating world, in which a semiliterate North 
Carolina Klansman, for example, becomes in his own 
mind a fearless gunslinger ; a pasty-faced student with 
a fuzzy beard becomes Che Guevara or the young 
Lenin; or a young Negro becomes the conqueror of 
honky America... . Waltermittyization, because it 
romanticizes violence, has made them desperately 
dangerous [p. 100]. 


Using the Choice Dilemmas Questionnaire 
(CDQ), Pruitt (1969) attempted to deter- 
mine whether Walter Mitty-ness may charac- 
terize real people as a whole, as well as 
Thurber’s fictional character, Pruitt asked one 
group of subjects (Condition A) to rate each 
possible answer (1, 3, 5, 7, 9, and 10 chances 
out of 10) to each of 10 CDQ problems as 
either risky or cautious, by placing an “R” or 
a “C” next to each answer. Items 5 and 12 
of the original CDQ were not included. Sub- 
jects were then given a second CDQ booklet 
and asked to complete the 10 items in the 
usual manner (i.e., to make decisions about 
what course of action to recommend to the 
central person in each story). Most of the 
subjects chose probability levels which they 
had previously rated as cautious. t 

Another group of subjects (Condition C) 
made their decisions on the 10 items, then 
went through the same booklet again and 
rated each possible answer as risky or cau- 
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tious. Thus, they rated the answers after they 
had made their decisions as to which prob- 
ability level they would accept for each prob- 
lem. These subjects tended to rate the choices 
they had made as risky rather than cautious, 
In other words, subjects chose alternatives 
which, in a dispassionate moment, they had 
called cautious (Condition A); but com- 
parable subjects told themselves that their 
choices were risky when looking back on them 
(Condition C). Pruitt labeled this effect the 
"Walter Mitty" effect, after the hero of 
Thurber's story. Pruitt concluded that like 
Walter Mitty, people tend to behave cau- 
tiously, but to perceive their behavior as risky. 

The two studies comprising the present in- 
vestigation are concerned with three aspects 
of the Walter Mitty effect. First, could the 
results of Pruitt’s study be attributed in part 
to the fact that subjects in Condition A made 
their ratings and their decisions in two differ- 
ent booklets, while those in Condition C used 
the same booklet for the two tasks? Second, 
can a kind of Walter Mitty effect similar to 
that found by Pruitt on a measure of hypo- 
thetical risk be found in actual behavior? 
Third, can Walter Mitty-ness be regarded as 
a personality dimension (ie., are individual 
lifferences in various measures of fantasied 
tiskiness correlated with one another)? Study 
I is concerned with the first question, and 
Study II is concerned with the last two 
questions, 


Srupy I 


. In Condition A of Pruitt’s study, the sub- 
Jects rated each alternative as risky or cau- 
tious in one CDQ booklet. They then received 
4 second CDQ booklet in which to make their 
decisions concerning which alternative to rec- 
ommend for each problem. However, in Con- 
dition C, the subjects made their decisions, 
then went back through the same booklet 
again and rated each alternative as risky or 
, Cautious. In other words, Condition C dif- 
fered from Condition A not only in the re- 
Verse order of the rating task and decision 
task but also in that the subjects performed 
the two tasks in the same booklet. Thus, they 
had their decisions right in front of them 
When they were rating each of the alternatives. 
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This may have reminded them which alter- 
natives they had chosen, and thus increased 
their tendency to rate their preferred choices 
as risky. 

If the decisions made by the subjects in 
Condition C were made less salient by having 
the ratings done in a separate booklet, would 
these subjects still have tended to rate their 
preferred decisions as risky? And if the 
ratings made by the subjects in Condition A 
were made more salient by having the de- 
cisions made in the same booklet, would these 
subjects have been more likely to choose alter- 
natives which they had rated as risky? Or 
might Pruitt's Walter Mitty effect be attrib- 
uted in part to a methodological difference 
between Condition A and Condition C rather 
than to a real effect? In this study, an attempt 
was made to replicate Pruitt’s results, with 
the inclusion of two additional groups to in- 
vestigate the above questions. 


Method 


The method was similar to that used by Pruitt. 
Subjects were required to perform two tasks: (a) 
rate each alternative to each of the 10 CDQ situ- 
ations as either risky or cautious, by placing an 
"R" or a *C" next to it, and (b) make decisions 
as to which alternative they would recommend for 
each of the 10 situations. 

A 2X2 factorial design was used—Task Order 
(ratings-decisions, decisions-ratings) X CDQ Booklets 
(different, same). Group I made their ratings in one 
booklet, then their decisions in a second booklet 
(Pruitt’s Condition A). Group II made their de- 
cisions in one booklet, then their ratings in a second 
booklet. Group III made their ratings, then their 
decisions, in the same booklet. Group IV made their 
decisions, then their ratings, in the same booklet 
(Pruitt's Condition C). Each group contained seven 
males and seven females enrolled in a course in 
introductory psychology. 


Results and Discussion 

The mean number of preferred decisions 
which were rated as "risky" by subjects in 
each of the four groups may be found in 
Table 1. 

The results showed a Walter Mitty effect 
similar to that found by Pruitt. Subjects in 
the ratings-decisions groups (I and III) 
tended to rate fewer of their preferred choices 
as risky than did subjects in the decisions- 
ratings groups (II and IV). A three-way 


ve 
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TABLE | 


MEAN Nowoxs or Prerexeen DECISIONS 
PresCkiVED as Risky 


Task order 
CDQ booklets IN 
Ratingsdecisions | Decisions-ratings 
Different 3.07 | 4.79 
š 5.43 


Same 3.29 


analysis of variance was computed—Task 
Order (ratings-decision, decision-ratings) x 
CDQ Booklets (different, same) X Sex (male, 
female). Since sex showed no significant main 
effect or interactions, the data for males and 
females were combined for the two-way analy- 
sis depicted in Table 1. The difference between 
the two task orders was significant (F = 9.42, 
dj = 1/52, p < .01). Thus, subjects tended to 
choose alternatives which they had previously 
called cautious (Groups I and III), but 
viewed their behavior as risky when looking 
back and evaluating it (Groups II and IV). 

It made no difference whether the decisions 
and the ratings were made in the same booklet 
or in different booklets (F < 1). In addition, 
there was no Task Order x CDQ Booklet 
interaction (F < 1). Thus, the Walter Mitty 
effect obtained in the present study (and by 
Pruitt) is apparently not significantly affected 
by whether the rating task and the decision 
task are done in separate booklets. 

The possibility remains that the difference 
in perceived riskiness between subjects in the 
two task orders was affected by differences in 
actual riskiness. That is, subjects in Groups 
II and IV may have perceived their behavior 
as being more risky because their decisions 
actually were more risky on the 10 CDQ 
items than were the decisions of subjects in 
Groups I and III. However, an analysis of 
variance of the scores on the 10 CDQ items 
revealed that the mean score for Groups II 
and IV (X — 53.3) did not differ significantly 
(F <1) from the mean for Groups I and IIT 
(X = 51.7). Thus, even though subjects in 
the two task orders differed in the perceived 
riskiness of their decisions, they did not differ 
in the actual riskiness of their decisions. The 
Walter Mitty effect appears to be a real 


Kenneru L. Hicsee 


tendency for subjects to behave 
while perceiving their behavior to | 


tiously 


risky. 
Stupy II 


The Walter Mitty effect found in Study I, 
and by Pruitt (1969), involved a paper-and- 
pencil measure of hypothetical risk taking. 
One question of interest in Study II is 
whether a form of Walter Mitty-ness may be 
found in actual behavior in a complex de- 
cision-making situation. A second question of 
interest is whether individual differences in 
the strength of such a characteristi iv be 
related to riskiness on the CDQ. Since the 
CDQ is a measure of hypothetical riskiness, 
it may be that a CDQ score is a better index 
of how risky a person thinks he would be than 
it is of how risky he really is. 

In addition, it may not be entirely appro- 
priate to call the effect found in Study I, and 
by Pruitt (1969), a Walter Mitty effect. 
Subjects in these studies tended to behave 
cautiously and view their behavior as being 
risky. Walter Mitty was indeed a cautious 
person, but he seemed to realize that his 
behavior was cautious. In fact, one reason 
Walter Mitty daydreamed of daring adven- 
tures was apparently to escape from his cau- 
tious, henpecked existence. He dreamed of 
being more daring than he saw himself as 
being in real life. Thus, Walter Mitty-ness 
may be viewed as a tendency to anticipate 
being riskier than one sees himself as having 
been after the fact. This is the kind of Walter 
Mitty-ness investigated in Study II. 


Method 


One hundred and twenty male students partici- 
pated in two-man teams in the Tactical and Negoti- 
ations Game (TNG). Thirty teams were composed 
of two high risk takers and 30 were composed of 
two low risk takers, according to CDQ scores. The 
TNG is a complex decision-making task developed 
by Streufert, Castore, and Kliger (1967) and Streu- 
fert, Kliger, Castore, and Driver (1967). It is an 
experimentally controlled simulation, Subjects are 
told that they are playing the game against another 
team of subjects, and that the experimenter is acting 
as judge. In reality, the strategy of the other "team" 
is preprogrammed. All information messages received 
by the subjects are constant across all teams. The 
two men in cach team serve as co-commanders for 
a mythical country involved in a simulated small- 
scale internation conflict. In attempting to resolve 


i 


, 


Es! 


the conflict, teams make decisions concerning mili- 
tary, economic, intelligence, and diplomatic matters, 

The TNG has been shown to produce perceptions 
tlent to the induction of independent 
Higbee & Streufert, 1968). Further 
this kind of procedure, and data indi- 
validity of the programmed manipulation 
1 received by subjects, are presented 
reufert (1968), Streufert et al. (1967), 
ert and Streufert (1969). 

Before beginning the TNG, subjects completed a 

Pregame scale on which they indicated on a 7-point 
sale how risky they anticipated their team would 
be during the game. The teams then played the TNG 
for five consecutive 3-hour periods (each period 
Was 6 months according to converted clocks on their 
Walls), During each of the five periods, teams made 
Written decisions on decision forms. In addition, 
they received information from the experimenters, 
concerning enemy activity and the results of their 
| Own decisions, on report forms. 
I After each of the five playing periods there was 
| 4 brief intermission, during which the subjects com- 
Pleted an interim report form, The interim report 
form contained several scale, among which were 
two of interest for the present study. On these two 
Seales the subjects responded to two questions by 
Checking a 7-point scale ranging from “very low 
Tisk” (1) to “very high risk” (7). The two questions 
Were “How risky were your team’s decisions this 
Past half year?” and “How risky do you think your 
team will be next period?” The first question pro- 
by vided an index of the subjects’ perceived riskiness 
Mor the previous period, and the second question 
Provided an index of their anticipated riskiness for 
the next period. 

In addition to the measures of perceived risk and 
anticipated risk, an index of actual risk was obtained, 
All military decisions were judged as risky or non- 
risky according to a set of objective criteria. Military 
lecisions were of prime interest, because extrapola- 
tions from laboratory research on risk taking (es- 
pecially concerning the risky shift) to real-world 
decisions have been centered in the area of military 
Policy (cf. Kogan & Wallach, 1967). Data on the 
Other kinds of TNG decisions were also collected, 
Dut have not yet been analyzed. The number of risky 
Military decisions made by each team served as a 
Measure of actual risk (other measures were also 
Tomputed, but are not necessary for this study—see 
Aligbee, 1970), 
us, the data of interest for Study II include 
(a) anticipated riskiness, as measured for the entire 

ame by the pregame scale, and before each period 
Y the interim report form; (b) perceived riskiness, 
Measured after each period by the interim report 
stm; and (c) actual riskiness, as measured by an 
Objective index of risk. 


Validation for 
tating face 
of inform 


Results and Discussion 


The first question of interest is whether 
“ete was any evidence of Walter Mitty-ness 
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TABLE 2 


MEAN Pexceiven Risk, ANTICIPATED. Risk, AND 
AcrvaL Risk rox Each CDQ CaTEGOkY 


High-CDQ| Low-CDQ 


Itn risk takers | risk takers 


Perceived risk (interim report 


forms) 4.26 4.10 
Anticipated risk (interim report 

forms) 4.96 4.05 
Anticipated risk (pregame 

scale 442 3.93 
Actual risk (no. risky decisions 

per period) 1.27 1.05 


in the subjects’ behavior. Specifically, was 
there a consistent difference between perceived 
riskiness for last period and anticipated riski- 
ness for next period? That is, did subjects 
anticipate being more risky in the future than 
they had been in the past—as Walter Mitty 
dreamed of being more daring than he actually 
was in real life? 

The mean perceived and anticipated risk 
levels for all five periods combined (as mea- 
sured by the interim report forms) are con- 
tained in Table 2, both for high-CDQ risk 
takers (low scorers) and for low-CDQ risk 
takers (high scorers). The results indicated 
that subjects did tend to anticipate being 
more risky each coming period than they felt 
they had been during the last period. This 
was true both for high-CDQ risk takers (F — 
39.10, ? < .001) and for low-CDQ risk takers 


TABLE 3 


Mean PrncErvED Risk rog Last PERIOD AND MEAN 
Anticrpatep Risk ror Next PERIOD ror Eacr 
CDQ Carecory, By PERIODS 


Period 
Risk 
1 2 3 A 5 
Low CDQ risk takers 
Perceived | 3.2014 | 4.02.4} 4.45 440 | 443 
Anticipated | 4.62 f | 4.58 4.78 | 4.60 | 4.68 
High CDQ risk takers 
Perceived | 3.124 | 4.351, | 4.55% fay 4.67 
Anticipated |4,70f | 5.10f | 5.15f | 5.12 4.75 


* Difference between perceived risk and anticipated risk is 
significant (p < .05). 
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TABLE 4 


Mean Nuuser or Risky Decisions Mane ty 
Att Teams pure Eacn Perion 


^ 


Period 
ltem 


LI 2 3 * 5 


42 | 140 | 140 | 143 | 1.15 


No. decisions 


(F = 34.52, p < 001). The mean perceived 
and anticipated risk levels for each of the five 
playing periods are presented in Table 3. 

The results thus indicate that all subjects, 
regardless of CDQ score, consistently tended 
to anticipate being more risky next time than 
they were last time. This effect may be termed 
a “just-wait-until-next-time” effect. It is some- 

t what analogous to Walter Mitty’s daydreams, 
! in which he dreamed of being more daring and 
adventurous than he actually was in real life. 

If the just-wait-until-next-time effect were 
based on accurate perceptions of reality, then 
we would expect the subjects’ actual risk 
levels to increase each period. That is, if sub- 

i jects anticipated being more risky each pe- 
> riod, then acted in accordance with their 
anticipations, we would expect an increase in 
actual riskiness throughout the five periods. 
However, the mean number of risky decisions 
did not significantly (p < .05) increase after 
Period 1 (see the means presented in Table 4). 
Thus, the just-wait-until-next-time effect 
was apparently not based on the subjects’ 
accurate appraisal of their behavior, but 
rather appears to be a kind of Walter Mitty- 
ness—a tendency for subjects to think that 
they will be more risky in the future than 
they have been, without actually becoming 

more risky in their behavior. 

As was mentioned previously, Slovic (1962, 
1964) has suggested that riskiness may not be 
a general personality trait, since different 
measures of the construct lack convergent 
validity, However, Kogan and Wallach (1964, 
1967) have suggested that demonstration of 

validity for particular types of people, rather 
than for people in general, would be sufficient 
to indicate the existence of a construct, They 
found, for example, that a trait of “riskiness” 
may be applicable to people who are high in 
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“motivational disturbance” (anxiety and de- 
fensiveness) . 

The findings of the present study on Walter 
Mitty-ness suggest another possible modifica- 
tion of Slovic’s conclusion. Whether or not 
there is a personality trait of actual riskiness, 
there may be a trait of imaginary, or fan- 
tasied, riskiness. The existence of such a trait 
can be determined from the relationship be- 
tween CDQ scores and anticipated riskiness 
in the TNG. 

The results indicated that there was no 
difference between high-CDQ groups and low- 
CDQ groups in actual riskiness (F — 1.30). 
In addition, there was no difference between 
the CDQ categories in perceived riskiness 
(F « 1). The means upon which these analy- 
ses were based are contained in Table 2. How- 
ever, there was a tendency for high-CDQ 
groups to differ from low-CDQ groups in 
anticipated riskiness (the mean anticipated 
risk levels for each CDQ category are con- 
tained in Table 2). On the pregame scale, 
high-CDQ risk takers anticipated a signifi- 
cantly higher level of risk than did low-CDQ 
tisk takers (F = 4.46, p < .05). On the in- 
terim report forms, high-CDQ risk takers also 
tended to anticipate a higher level of risk 
than did low-CDQ risk takers, but this effect 
was not quite as strong (F = 3.23, p < 07). 
The results were at least consistent in indi- 
cating that while there was no difference be- 
tween the CDQ groups in actual risk or in 
perceived risk, there was a difference in antici- 
pated risk. High-CDQ risk takers tended to 
anticipate being more risky than did low-CDQ 
risk takers. 

Further evidence for this conclusion may be 
obtained by examining the correlations be- 
tween CDQ scores and the various risk mea- 
sures Actual risk showed a nonsignificant 
correlation with CDQ scores (r = —.15, ns), 
as did perceived risk (r = —.10, ns). How- 
ever, the correlation between anticipated risk 
(pregame scale) and CDQ scores was signifi- 


* In calculating the correlations for actual risk, the 
risk score of each two-man team was paired with 
the mean CDQ score of the team (N — 60, df= 58). 
However, since the perceived and anticipated risk 
Scores were obtained for each individual, they were 
paired individually with CDQ scores (N — 120, df — 
118). 
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int (r = —.25, p < .01), and the correlation 
e anticipated risk (interim report 
) and CDQ scores fell just short of sig- 
cance (r — —.16, correlation of .164 re- 
ed for $ < .05). These correlations paral- 
the results of the analyses of variance dis- 
cussed above, and give some numerical idea 
of the strengths of the relationships discussed. 
The relationship between anticipated risk 
Land CDQ scores makes sense when we con- 
sider the similarity of the CDQ task to antici- 
"pated risk in the TNG. In both cases the 
“subjects are making judgments regarding hy- 
pothetical risk. The CDQ score indicates how 
risky a subject anticipates he would be in 
Certain situations. Similarly, the scales used 
in the present study indicated how risky sub- 
ects anticipated they would be in the TNG 
Situation. 
— As described above, high-CDQ risk takers 
lended to anticipate being riskier than did 
low-CDQ risk takers, but they were not 
- actually riskier, nor did they perceive them- 
Selves as having been riskier when looking 
"back on their behavior. This relationship be- 
een CDQ scores and anticipated risk sug- 
gests that the CDQ may be a more appro- 
i priate indicator of anticipated riskiness than 
of actual riskiness in complex decision-making 
_ behavior (even though CDQ scores have been 
found to be related to actual riskiness in sim- 
Ple tasks—cf. Wallach & Kogan, 1965). 
_ Thus, the CDQ (and possibly other mea- 
ures of hypothetical risk taking) might be 
viewed as a measure of Walter Mitty-ness. 
Subjects who advise risky behavior in the 
- CDQ situations may be more prone to think 
t they would be risky in real-life situations, 
but may not actually be more risky than sub- 
jects who advise cautious behavior on the 


.. Contrary to the present results on Walter 
itty-ness, Cameron and Myers (1966) found 
‘ho difference between imaginary and actual 
Tiskiness in a simple betting task. The im- 
Portant difference between studies such as 
eirs and the present study is probably the 
mplexity of the task. For example, Higbee 
d Streufert (1969) found that the relation- 
‘Ship between riskiness and perceived control 


tn a complex task (TNG) may be different 
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from that often found in simple laboratory 
betting tasks, Similarly, contrary to findings 
that CDQ scores are related to betting or 
other simple risk-taking tasks (cf. Maehr & 
Videbeck, 1968; Wallach & Kogan, 1965), 
the results of the present study indicated that 
CDQ scores were not related to riskiness in 
a complex task. Such differences suggest the 
potential importance of studying actual riski- 
ness in complex situations when possible. 

In summary, two conclusions are indicated 
by the results of this research: First, Walter 
Mitty-ness seems to characterize people not 
only in hypothetical risk-taking decisions 
(CDQ) but also in actual risk-taking be- 
havior in a complex decision-making situation 
(TNG). The type of Walter Mitty-ness found 
on the CDQ involves cautious decisions which 
are perceived as risky. The type of Walter 
Mitty-ness found in the TNG involves one's 
anticipating that he will be more risky than 
he sees himself as having been when looking 
back on his behavior. The second conclusion 
indicated by this research, somewhat more 
tenuous than the first, is that the CDQ may 
be a more appropriate measure of how risky 
a person thinks he will be in real life than it 
is of how risky he actually will be. Thus, the 
CDQ (and other measures of hypothetical 
risk) may be a measure of a personality trait 
of Walter Mitty-ness. 
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self as significantly more risky than othe 


a difference between S-O and O-S orde: 


Perhaps the most successful explanation of 
the “risky shift” effect of group discussion 
has been Brown’s (1965) value hypothesis. 
According to this hypothesis, a cultural value 
for risk leads individuals to want to be at 
least as risky as their peers. In the course of 
discussion, some individuals find that they are 
not so risky, compared to other group mem- 
bers, as they thought they were and wanted 
to be; these individuals then shift to more 
risky options. 

Important support for the value hypothesis 
has come from a number of studies where 
subjects answered the Kogan and Wallach 
( 4064) risk dilemmas first for themselves and 
then for “others,” Brown (1965), Levinger 
and Schneider (1969), Stoner (1968), and 
Wallach and Wing (1968) all found that con- 
sistent with the value hypothesis, subjects see 
others as less risky than themselves. 

Although accepted as support for the funda- 
mental assumption of the value hypothesis, 
the self-other (S-O) difference is susceptible 
to alternative explanation. Perhaps, for in- 
Stance, the S-O difference is caused by some 
Species of demand effect created by experi- 
Menter instructions or repeated exposure to 
the risk dilemmas. If set is behind the S-O 
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was more than twice the difference in the O-S order. The 
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Subjects answered the risk dilemmas for themselves and for “others” in two 
different orders: self-other (S-O) and other-self (O-S). Both orders showed 


T, but the difference in the S-O order 
authors argue that 
ts cannot be explained by the value 


hypothesis without additional assumptions. 


difference, then reversing the usual S-O order 
to O-S—having subjects answer first for others 
and then for self—might eliminate or even 
reverse the usual S-O difference. On the other 
hand, the value hypothesis implies no dis- 
crepancy in results between the S-O and the 
O-S order. Tn either order, the value hypothe- 
sis predicts that subjects’ responses will re- 
flect the same motivation to see themselves 
as more risky than others, 


METHOD 


Six items known to produce reliable risky shifts 
were chosen from the original Kogan and Wallach 
Choice Dilemmas Questionnaire (Kogan & Wallach, 
1964). Items 1, 4, 6, 7, 8, and 9 were used. Two 
questionnaire forms were prepared, each form pre- 
senting the six items twice. Form S-O asked each 
subject to answer all six items for himself (standard 
instructions), and then to answer all six items again 
with “the minimum odds of success that most people 
would demand.” Form O-S asked each subject to 
answer the six items first for “most people” and 
then again for himself. 

The two forms were randomly mixed and passed 
out in two undergraduate psychology classes. A total 
of 45 females and 20 males returned completed ques- 
tionnaires from the two classes: 29 of the S-O form 
and 36 of the O-S forms. 


RESULTS 


The minimum-odds (out of 10) responses 
were multiplied by 10 to get a risk score on 
each item, where a lower risk score implied 
greater risk. Each subject got a self-risk score, 
the sum of the six items answered personally, 
and an other-risk score, the sum of the six 
items estimated for others, Table 1 presents 
the means, across subjects, of self- and other- 
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TABLE 1 


Mean Inpivipuat Risk Score on Sıx ITEMS, 
FOR SELF AND Orner, BY CONDITION 


Condition | Self 


| Other | Difference 
S-O order* 24 | 334 | 60 
O-S order^ 28.3 | 30.7 24 
sN = 29, n 
bN = 36. 


risk scores, for the two forms of the question- 
naire. Table 2 indicates that others were per- 
ceived as significantly less risky than self for 
both questionnaire orders. 

Table 2 also indicates a strong trend for 
the S-O difference to be larger than the O-S 
difference. In fact, the S-O difference is more 
than twice the O-S difference. It is interesting 
to note that the greater difference for the 
S-O order is largely a tendency to see others 
as more cautious in the S-O order than in 
the O-S order (t = 1.25, df = 63, ns). 


Discussion 


Others were seen as significantly less risky 
than self in both the S-O and O-S orders. 
This result is consistent with the “risk as 
value” interpretation of the well-established 
S-O difference. On the other hand, the very 
strong trend for the O-S difference to be less 
than the usual S-O difference is not predicted 
by the value hypothesis. Furthermore, this 
Strong trend in our results ought to be taken 
seriously, because it echoes a similar result in 
a study by Pruitt (1969). 

Pruitt asked each subject for his own choice 
on each dilemma and also for a judgment, on 
each dilemma, as to which of the alternative 
choices were “risky” and which “cautious,” 
Pruitt found that subjects selected decisions 
they had labeled as cautious when judgment 
about riskiness of the options preceded per- 
sonal choice, but labeled approximately the 
same decisions risky when the judgment came 
after the choice (the “Walter Mitty effect”). 
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'TABLE 2 
DIFFERENCES IN RISK TAKING ON Six ITEMS 


Comparison difference * | S 
Self vs. other in S-O 6.0 | 28 | 4.49088 
Self vs. other in O-S 24 35 1.75% 
S-O vs. O-S difference 1.88* 


3.6 | 63 


* p <.10 (two-tailed). 
** p « .05 (one-tailed). 
*** > € .01 (one-tailed). 


In both the Pruitt study and the present 
study, subjects made decisions and gave in- 
formation reflecting on the riskiness of the 
decisions. In both studies, the information 
made the decisions appear riskier when the 
information followed the decision than when 
the information preceded the decisions, These 
common results are not predicted by the value 
hypothesis, at least without additional as- 
sumptions, because nothing about the value 
hypothesis suggests that the motivation to 
appear relatively risky is contingent on order 
effects, 
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their peers. However, 


The most common explanation advanced to 
account for the risky-shift phenomenon is the 
"risk-as-value" hypothesis. As formulated by 
Brown (1965), this interpretation makes the 
key assumption that moderate risk is a 
strong cultural value, so that most individuals 
come to view themselves as being at least as 
willing as their peers to take risks. When a 
group of such persons meets and discusses 
Matters of risk taking, those whose initial 
private positions were actually less risky than 
the other group members are made to recog- 
nize their relative cautiousness; subsequently, 
they recommend greater risk than earlier, 
Presumably to restore their perception of 
themselves as relatively risky. On this view, 
the principle function of the group discussion 
is to allow group members to compare their 
Positions on the issue to those of other people. 

There is considerable evidence that most 
People do, in fact, perceive themselves to be 
at least as willing as their peers to take risks. 
Hinds (1962) found that male graduate stu- 
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Two experiments tested whether the risky-shift 
cultural value of risk and by the knowledge 
others. As the risk-as-value hypothesis predicts, a significant risky shift was 
found only for subjects who perceived themselves to be 
a significant risky shift occurred even in discussion 
groups when subjects could not reveal their 
tion of this finding to the risk-as-value hypothesis is discussed. 
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phenomenon is mediated by the 
of the specific preferences of 


at least as risky as 


preferred risk levels. The implica- 


dents in industrial management consistently 
guessed that others would choose more cau- 
tiously than had the subjects themselves. 
Brown (1965) found in an undergraduate 
class of 30 and in a graduate seminar of 16 
students that not 1 subject guessed that oth- 
ers would answer a choice-dilemmas problem 
more risky than himself. Wallach and Wing 
(1968), Levinger and Schneider (1969), and 
Willems (1969) have found similar results, 
Thus, the data unequivocally reveal a strong 
and pervasive tendency by persons of both 
sexes to view themselves as being at least as 
risky as their peers. 

From this risk-as-value hypothesis, a pre- 
diction may be made concerning individual 
differences in susceptibility to the risky shift: 
Subjects who perceive themselves to be more 
cautious than their peers should not change 
toward risk; those who perceive themselves 
as relatively risky should show strong and 
consistent shifts. This follows from the as- 
sumption that the former subjects do not 
value risk, while the latter subjects do value 
it. Experiment I was designed to test this 
hypothesis. 


EXPERIMENT I 
Method 


Subjects. A total of 67 male undergraduates at the 
University of Kansas participated in the experiment, 
All were volunteers and received research credit for 
their participation. 

Procedure. Risk taking was measured by the 
choice-dilemmas instrument developed by Kogan and 
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TABLE 1 


PRETEST AND POSTTEST MEAN Scores, SUMMED ACROSS 
THE SiX ITEMS, FOR THE THREE 
CONDITIONS IN EXPERIMENT I 


| Cautious Ss 


Test Risky Ss Control Ss 
Pretest 32.58 | 4300 35.04 
Posttest 26.67 41.63 33.48 


Wallach (1964), which was described to subjects as 
an opinion questionnaire, In order to maximize the 
risky-shift effect, only the six items that had pro- 
duced the greatest amount of shift in other studies 
were used (Items 1, 3, 4, 6, 7, and 11). 

Subjects participated in groups of four, sitting in 
chairs arranged in a circle. In an initial session, sub- 
jects gave only their private responses to the choice 
dilemmas and then recorded the recommendations 
they felt the majority of other people like themselves 
would make. Subjects were then dismissed and were 
rescheduled for a subsequent experiment approxi- 
mately 8 days later. Three conditions were estab- 
lished. 

Discussion, risky subjects. Twenty-four subjects 
were selected at random from those who, on at 
least four of the six items, had previously perceived 
themselves to be at least as risky as their peers. They 
were divided at random into six groups of four sub- 
jects each, Instructions for this condition were 
identical to those given by Clark and Willems 
(1969), Subjects were told that their first responses 
had constituted a practice session to familiarize 
them with the items; they were then asked to dis- 
cuss each item for approximately 5 minutes. After 
the groups had discussed all six items, subjects once 
more responded privately to the Choice Dilemmas 
Questionnaire. 

Discussion, cautious subjects. This condition con- 
sisted of those 16 subjects who, on at least four of 
the six items, had initially perceived themselves to 
be more cautious than their peers. They were di- 
vided at random into four groups of 4 subjects each, 
All other experimental manipulations were identical 
to the preceeding conditions. 

Control subjects. These 27 subjects consisted 
prindpally of those who did not meet the criterion 
for the other two conditions; that is, those who 
were neither riskier nor more cautious on four of 
Six items. Subjects in a large assemblage were told 
that their first recommendations had been for prac- 
tice to familiarize themselves with the items, and 
that by reconsidering each item they might be more 
confident about their judgments. They then re- 
sponded once more to the Choice Dilemmas Ques- 


tionnaire. 
Results 


A 2X2 factorial design with repeated 
measures on the second factor was employed 
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to analyze the data. Since the number of sub- 
jects in the two levels differed, an unweighted- 
means solution was used (Winer, 1962, pp. 
374-378). Table 1 summarizes the pretest 
and posttest mean scores for each of the con- 
ditions. As may be seen in Table 2, there was 
a significant main effect for experimental con- 
ditions, confirming that risky subjects were 
significantly riskier overall than cautious sub- 
jects. Clearly, then, subjects who perceived 
themselves to be relatively cautious were ac- 
tually more cautious than their peers, and 
those who perceived themselves to be rela- 
tively risky were so, in fact. 

The main effect for pretest-posttest re- 
sponses verified that subjects? posttest re- 
sponses were riskier than their pretest scores, 
indicating significant risky shifts. More in- 
teresting, however, is the interaction between 
conditions and pretest-posttest responses. The 
Newman-Keuls test showed that this inter- 
action occurred because risky subjects shifted 
significantly toward risk, whereas cautious 
subjects did not. 

The result of a £ test, which tested the av- 
erage change scores within the control group, 
yielded insignificant results (t= 1.94, df = 
26). 


Discussion 


It is evident, then, that the results support 
the risk-as-value hypothesis, for greater shift 
toward risk occurred among subjects who per- 
ceived themselves to be at least as willing 
as their peers to take risks. A question re- 
mains as to the processes by which these dif- 
ferences between groups were mediated. Ac- 
cording to the risk-as-value hypothesis, the 
principal function of the group discussion is 
informational: the subject discovers that he 
is not as risky on some items as are others 
in his reference group, and he changes his 
judgment in order to return the relationship 
between his own position and the perceived 
group norm to its optimal level. According to 
this strict interpretation of the hypothesis, a 
mere exchange of information with others, 
without any discussion of the reasons for their 
judgments, should be as effective an induce- 
ment to shift toward risk as is the personal 
involvement in group discussion. 


Risx-As-VaLuE HYPOTHESIS 


In experiments which have tested the effects 
of exchanging judgments without verbal justi- 
fication for them, a significant risky shift has 
occurred, but one which is likely to be smaller 
in magnitude than is the typical discussion 
conditions. Thus, Kogan and Wallach (1967) 
found significantly greater shifts among sub- 
jects who discussed the choice dilemmas than 
among subjects who listened to tape-recorded 
discussions of other groups. Similarly, Teger 
and Pruitt (1967) observed greater risky shift 
in the typical discussion condition than in the 
information-exchange condition without dis- 
cussion. 

Teger and Pruitt (1967) interpreted this 
outcome by arguing that the usual discussion 
procedure not only provides information about 
other subjects’ positions on these items, but 
also increases the saliency of the norm of 
riskiness, That is, in the arguments that are 
given in discussions of the choice dilemmas, 
those who are objectively less risky become 
increasingly aware of the value that others 
place on risk and, therefore, are more sus- 
ceptible to the effects of information about the 
group’s judgments. 

Experiment II was undertaken in an at- 
tempt to sort out these influences, knowledge 
of others' risk preferences and knowledge of 
arguments only, by comparing the typical 
information-plus-discussion condition with two 
Other conditions: one where subjects ex- 
Changed judgments without discussion, a sec- 
ond where they discussed the alternatives 
Without expressing their own preferences. If 
the risky shift is solely mediated by the 
knowledge of others’ risk preferences, the 
information- -exchange and regular-discussion 
conditions should yield shifts of the same 
Magnitude. From Teger and Pruitt's inter- 
pretation we would expect larger shifts toward 
tisk in the information- plus-discussion condi- 
lion than in the information- exchange condi- 
tion. If, as Madaras and Bem (1968) postu- 
lated, the risky shift is solely mediated by 
the preponderance of arguments in the favor 
of the risky alternatives, we would expect no 
risky shift in the information- -exchange con- 
dition and a risky shift of the same magnitude 
in the two discussion conditions. 
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TABLE 2 
Summary OF ANALYSIS OF VARIANCE 
FOR EXPERIMENT I 
Source SS df MS F 
Between Ss 
Conditions (A) 766.55| 1 | 766.55 | 25.84** 
Error between 237.38| 8 29.66 
Within Ss 
Pretest-posttest (B)| 63.12] 1 63.12 | 15.32** 
AXB 2451| 1 2451| 5.95* 
Error within 3295| 8 4.12 
* 
=b So 
EXPERIMENT II 
Method 


Subjects. A total of 120 male undergraduates par- 
ticipated in randomly assigned groups of 5, with 8 
such groups (or 40 subjects) in each of the three 
experimental conditions. 

Procedure. The procedure was very similar to the 
one used in the preceding experiment. Subjects first 
made their own recommendations on the Choice 
Dilemmas Questionnaire and then recorded the rec- 
ommendations they felt most people like themselves 
would make. Only those subjects who perceived 
themselves to be at least as risky as their peers on 
four of the six items were asked to return for a 
second meeting, held approximately 8 days after 
the first administration of the questionnaire. 

Three experimental conditions were established: 

Information exchange. Subjects were told that their 
first recommendations had been for practice. In- 
structions similar to those given by Teger and Pruitt 
(1967) were employed, and subjects in each group 
shared their recommendations by holding up cards 
for each item so that other group members could 
see them. To assure full exchange of information, 
the responses to each item were presented three 
times, each time without verbal discussion (cf. also 
Clark & Willems, 1969). 

Group-discussion condition. The procedure was 
identical to the discussion, risky-subjects condition 
in Experiment I. 

Arguments-only condition. The procedure was 
identical to the group-discussion conditions, except 
that subjects were not allowed to reveal their pre- 
ferred risk level probabilities on any item during 
the discussion. The subjects were instructed to dis- 
cuss the positive and negative aspects of the various 
alternatives to each item, without indicating the odds 
they had selected. Observation of the group discus- 
sions confirmed subjects’ adherence to the instruc- 
tions, not to mention numerical probabilities. 


Results 


A 3 X 2 factorial design with repeated mea- 
sures on the second factor was used to analyze 
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TABLE 3 


Pretest AND Posrrest Mean Scores, SUMMED ACROSS 
THE Six ITEMS, ror THE THREE ÉXPERIMENTAL 
CONDITIONS IN EXPERIMENT II 


Test 


| exchange 
Pretest | - 3135 3030 | 30.45 
Posttest | 25.50 24.80 | 28.55 


the data. Pretest and posttest mean scores are 
presented in Table 3. As can be seen in Table 
4, there was a significant main effect for the 
difference between pretest and posttest scores, 
indicating significant risky shifts. However, 
there was also a significant interaction, indi- 
cating that the magnitude of shift differed 
among conditions. The Newman-Keuls test 
showed that both the group-discussion and 
arguments-only groups shifted significantly to- 
ward risk, and there was a nonsignificant 
tendency for information-exchange groups to 
shift toward risk. The final means of both of 
the group-discussion and the arguments-only 
conditions were significantly riskier than that 
of the information-exchange condition. 


Discussion 


The results of Experiment II appear to 
be inconsistent with Brown’s information- 
exchange per se interpretation of the risk-as- 
value hypothesis, for if risk is a value, and 
subjects perceive themselves to be at least as 
willing to take risks as their peers, they should 
need to reevaluate their own positions only 
when discovering that others had chosen 
riskier alternatives than they. Thus, a sig- 


TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE 
FOR EXPERIMENT IT 


Source SS df MS F 
Between Ss 569.59 | 23 
Conditions (A) 30.59| 2| 15.30 «1 
Error between 539.00 | 21 | 25.67 
Within Ss 509.12 | 24 
Pretest-posttest (B) | 235.85 | 1 | 235,85 24.29** 
AXB 7 69.42 | 2| 3471| 3.57* 
Error within 203.85 | 21| 9.71 


* p «.05. 
**p «01. 
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niñcant risky shift should not have occurred 
in the arguments-only condition. The hy- 
pothesis can be rescued, however, by assum- 
ing that listening to and participating in a 
discussion of arguments in favor of risk, even 
without clear commitment from each subject 
to an alternative, provides subtle but unequi- 
vocal information about each participant’s 
choices. In this way, each discussant can learn 
that his peers are riskier than he had thought. 
There is evidence that such inferences about 
others’ judgments were actually drawn. Sub- 
jects indicated before and after discussion of 
the choice dilemmas what they thought the 
majority of people like themselves would 
mark for each item. In both the group-dis- 
cussion and the arguments-only conditions, 
subjects changed their estimates to represent 
other people as riskier than they had initially 
been (p < .01); in each case estimates in the 
two conditions did not differ significantly from 
each other either before or after discussion. 
Thus, the data do suggest that the mediation 
of the cultural value of risk through the 
specification of probabilities is still tenable. 
Nevertheless, a more adequate test is needed 
to substantiate this view. 

Even if subjects can infer reliably the risk 
positions of others from listening to their 
arguments, the failure to find significant 
shifts in the information-exchange condition 
is embarrassing for the information-exchange 
per se interpretation of the risk-as-value hy- 
pothesis. The exchange of information should 
have been enough to show the more conserva- 
tive persons that they were not as risky as 
most others in the group; those subjects 
should then have shifted toward risk. Yet, this 
was not what happened. Since Teger and 
Pruitt (1967), Clark and Willems (1969), 
and Willems and Clark (1969) have found 
significant risky shifts in information-exchange 
conditions, knowledge of others’ risk prefer- 
ences can not be completely ruled out as a 
mediator of the risky shift. The present re- 
sults, however, favor the “knowledge of 
others’ arguments” interpretation of the risk- 
as-value hypothesis. It remains to be seen 
whether this process is independent of or 
operates concurrently with the exchange of 
specific risk levels, 


The results of these two experiments indi- 
ate that the shift occurred principally, as 
risk-as-value hypothesis predicts, among 
ects who perceived themselves to be rela- 
risky. However, the shift occurred even 
s subjects heard no direct statements 
about others’ preferences, and was greater in 
each discussion condition than in the informa- 
'tion-exchange without-discussion condition. 
_ For the information-exchange per se interpre- 
ation of the risk-as-value hypothesis to re- 
n a plausible explanation of these results, it 
st be demonstrated that subjects can infer 
ably from the group discussion alone the 
eferred positions of the others. Meanwhile, 
interpretation of Madaras and Bem 
968) and of Teger and Pruitt (1967), that 
group discussion engages the value of risk, 
addition to providing information about 
lerence-group judgments, is strongly sup- 
Ported. It may turn out, in fact, that the 
engagement of this value alone, without in- 
formation concerning others’ preferences, 
would be sufficient to produce the shift to- 
ward risk. 
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RISKY SHIFT AS A FUNCTION OF GROUP MEMBERS' VALUE 
OF RISK AND NEED FOR APPROVAL' 
HELMUT LAMM, EDITH SCHAUDE, axo GISELA TROMMSDORFF 


University of Mannheim, West Germany 


Eighty-eight female subjects were assigned to dyads homogeneously composed 
with regard to their valuation of risk acceptance (low or high) as well as their 
need for approval (low or high). The dyads discussed each of a set of 
hypothetical decision situations involving risk taking and, at the end of each 
discussion, indicated the risk levels acceptable for them. High-value dyads 


shifted significantly more toward risk, relative to their prior decisions (2 X 2 


analysis of variance), As intended, the study demonstrates that risk as a 
value plays a causal role in the group risky-shift effect. Other results are 
brought to bear on the parsimony of recent formulations of the value theory 


of risky shift. 


“Value theory” explains the group risky- 
shift phenomenon as a consequence of the 
positive value typically placed on risk accep- 
tance, in conjunction with group discussion as 
a vehicle for attaining the value (Brown, 
1965). 

Levinger and Schneider (1969) have shown 
the first supposition—that risk is a value—to 
be true. When asked about the level of risk 
acceptance that they considered as most ad- 
mirable, college students indicated higher lev- 
els of risk than they had accepted in their 
own previous decisions. (It was anticipated 
that the same would be found in the present 
study.) Additional support comes from a 
study by Madaras and Bem (1968), who 
found more positive evaluations (ratings) be- 
ing made of risk-accepting than of cautious 
stimulus persons. 

Even with the evidence just noted, it re- 
mains to be demonstrated that the positive 
value of risk acceptance is a causal factor 

1 This research is part of the program of the Re- 
search Center for Social and Economic Psychology 
(Sonderforschungsbereich) at the University of 


Mannheim and was financed by the Deutsche Fors- 
chungsgemeinschaft. 

* Requests for reprints should be sent to Helmut 
Lamm, Institut fur Sozialwissenschaiten, Universitat 
Mannheim, 68 Mannheim, West Germany. 

3No consideration will be given here to the re- 
ported finding of cautious shifts on a few items 
(hypothetical decision situations) of the kind used 
in the present study as a measure of risk taking. 
Just as it can be argued that value theory is able 
to account for shifts toward lower risk levels, so 
the argument of the present study can be applied, 
by appropriate inversion, to the (less general) phe- 
nomenon of a group shift toward caution. 
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underlying the group risky-shift effect. The 
present study is intended to establish this 
causality by showing that groups of subjects 
placing high value on risk acceptance manifest 
larger risky shifts than groups characterized 
by lower valuation of risk. 

Just as the existence of the nexus between 
the value of risk and the shift toward risk has 
not been empirically established, so its nature 
—the social-motivational process bringing 
about the risky-shift outcome—has not been 
investigated conclusively. A formulation that 
is attractive by its applicability beyond the 
risk-taking domain has been proposed by 
Levinger and Schneider (1969), who found 
that subjects typically consider themselves as 
more willing than their peers to accept risk. 
This finding has been reported in other recent 
studies (Ferguson & Vidmar, 1970; Stoner, 
1968; Wallach & Wing, 1968), and the pres- 
ent study was designed to confirm it with 
German subjects. As Levinger and Schneider 
(1969) suggested, discussion provides a new 
(more risky) reference point to group mem- 
bers, who are thus “freed to make a new 
compromise between their ideal and the 
newly perceived point of reference, and their 
choices can shift closer to their ideal [p. 
168]." That the reference point (estimate of 
peers’ risk acceptance) indeed changes toward 
higher risk levels as a result of discussion has 
been shown by Ferguson and Vidmar (1970). 

Two questions may be raised in regard to 
the above considerations. First, is the shift in 
reference point—the “eye-opener” effect—es- 
sential to the validity of the value theory? 
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Brown (1965, p. 705) himself has suggested 
another mediating mechanism, invoking the 
bias in discussion content presumably taking 
place arguments are selected and formu- 
lated by discussion participants. (This propo- 
sition can be subsumed under the biasing ef- 
fect of values on the perception of ambiguous 
stimuli. In the present case, the risk value 
would bias the cognitions—the discussion 
points—elicited by the choice-dilemmas situ- 
ations.) It is hoped that the present study will 
shed some light on the above question. 
Second, what motivations underlie the 
desire, acted out through group discussion, to 


move closer toward ideal positions (higher 
risk levels)? Conceivably, the approximate 
attainment of ideal standards serves a need 
for social recognition or approval. To explore 
this possibility, the present study investigates 
the role of subjects’ needs for social approval. 
According to Crowne and Marlowe (1964), 
“the studies of the approval motive suggest a 
set of self-reflexive attitudes—a self concep- 
tion—in which an idealized version of the 
self . . . is maintained and defended [p. 190].” 
It may be hypothesized that groups of sub- 
jects characterized by high approval needs, 
compared to groups with low approval needs, 
would shift more markedly toward higher 
risk acceptance, because they would thereby 
move toward more highly regarded positions.* 
Confirmatory results on this hypothesis would 
Suggest that one motivational factor involved 
in the group shift mechanism is the desire for 
esteem. The inclusion of need for approval as 
an experimental variable was frankly explora- 
tory, particularly in view of its complexity as 
à personality construct (see Crowne & Mar- 
lowe, 1964, p. 201) and in view of Kogan and 
Wallach's (1967) finding of a. negative asso- 
ciation between need for approval and risky 
shift (p < .09), 


*It is assumed here—but not crucial to our main 
argument—that subjects, on decisions they. them- 
Selves consider as ideal (deserving of highest re- 
gard), believe the ideal preferences of their peers to 
e in the same—more risky—direction. This assump- 
lion is easy to test, To have pertinent data collected 
from the subjects of the present experiment would 
have made them possibly too sensitive to the pur- 
Poses of the study. 


METHOD 
Subjects 


One hundred and fifty-five female students at the 
University of Mannheim (Germany) volunteered to 
participate. On the basis of the preexperimental 
measures (see below), 88 subjects were selected to 
take part in the experiment proper, for which they 
were paid. 


Risk-Taking Instrument 


The choice-dilemmas task (Kogan & Wallach, 1964, 
Appendix E) was used to measure risk taking. Items 
6 and 12 were omitted, and the remaining items were 
translated and adapted to fit the German cultural 
context (see Lamm & Kogan, 1970). Each item 
presents a hypothetical decision situation as might 
occur in everyday life, in which a protagonist has to 
choose between a relatively unattractive but safe 
course of action and a more attractive but risky 
alternative (e.g, carrying on with a heart ailment 
versus undergoing an operation). The level of 
risk taking is indicated by the minimum odds of 
success considered necessary for choosing the risky 
alternative. The 10-point risk-taking scale ranges 
from a 10% to a 100% chance that the risky al- 
ternative would be successful. 


Preexperimental Session 


The preexperimental session constituted in essence 
a replication of the three choice-dilemma measures 
obtained by Levinger and Schneider (1969)—(a) 
own decisions, (b) estimate of peer decisions, and 
(c) ideal decisions. Between 10 and 20 subjects 
participated in these sessions at a time. All assign- 
ments were carried out individually. (a) For each 
item, the subjects indicated the minimum odds they 
considered necessary before recommending the choice 
of the risky alternative. (b) They were then asked 
to go through the choice dilemmas again and indi- 
cate, for each item, the response they thought would 
be given by the majority of German students of 
their own sex. In other words, they noted down the 
minimum odds of success presumably chosen by most 
of their peers. (c) Finally, they were asked to go 
through the items a last time and indicate the deci- 
sions for which they had the highest regard (“Thre 
höchste Anerkennung finden wiirde’’) ,5 


5 The present study used a weaker term than 
Levinger and Schneider's (1969) “most admirable,” 
in order to reduce the possibility of a semantic de- 
mand factor. More concretely, the term “admirable” 
would, by its mere connotational meaning, seem to 
be less applicable to a cautious decision than would 
the phrase “deserving of high regard." Thus, the 
present measure of risk as a value is, if anything, 
more conservative than the one of Levinger and 
Schneider (1969), 
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TABLE 1 
MEANS FOR INITIAL AND EXPERIMENTAL SAMPLES As CHARACTERIZED 


BY CONTROL AND INDEPENDENT VARIABLES 


Whole sample> 


value 1 ¢ Initial sample 
Variable eae | (oA ms (N = 88) N -1 

Own decisions* 48.27 49.87 

Low need for approval | 48.50 48.59 

High need for approval 47.27 48.64 DE ME 
Peer minus own 7.54 7.30 

Low need for approval 6.00** 6.45* 

High need for approval 7.54** 9,82** "s i: 
Value (own minus ideal)* 11.28 10.75 

Low need for approval 2.82 18.32*** 

High need for approval 4.64°* 19,36°** 3 1 
Need for approval* y 15.01 15.08 

Low need for approval 11.36 11.27 

High need for approval 18.77 18.64 


* Twenty-two subjects low and 22 subjects high in need for approval, 


* Experimental sample. 
* Higher scores indicate lower risk levels. 


å No significant effects (F < 1) for either independent variable. 


* Independent variable. 

* p <.05 (two-tailed £ test). 
** p <.01 (two-tailed £ test). 
*** p € 001 (two-tailed / test). 


As a last assignment, the subjects filled out a 
German translation of Crowne and Marlowe's (1964) 
need-approval (or social-desirability) questionnaire. 


Independent Variables and Assignment to 
Experimental Conditions 


The value attached to risk taking was operation- 
alized as the extent to which the subjects placed the 
ideal at a riskier level than their own decisions. On 
the basis of this own-versus-ideal discrepancy score, 
the 155 subjects of the total sample were classified, 
by median split, as low or as high on the value 
dimension (see Table 1). 

Again by a median split on the total distribution, 
the subjects were classified as low or high on need 
for approval as measured by the Crowne-Marlowe 
scale (see Table 1). 

Since higher risk valuation was associated with 
lower own risk levels (r= 29, df = 153, p < 01), 
and since group risky shift could not be assumed to 
be independent of own (prior) risk levels, the selec- 
tion of subjects from the total sample was re- 
stricted by the requirement of keeping own risk deci- 
sions equal over the four experimental conditions 
(see Table 1). 


Procedure 


The dyadic discussions took place a few weeks 


b. after the preexperimental session. Upon arrival at 


the laboratory, each Subject received an instruction 
sheet and a booklet containing the choice-dilemmas 
descriptions. 

„The instructions asked the two participants to 
discuss each of the familiar decision situations, and 
at the end of a discussion, indicate their decisions 
(minimum odds of success considered necessary for 
choosing the tisky alternative), For the latter, they 


could draw on the considerations brought out dur- 
ing the discussion. Though each member should 
make her own decision, the exchange of views could 
of course lead to a consensus. It was further pointed 
out that the discussions would be tape recorded to 
be played later to a group of coeds, They could be 
sure of anonymity throughout those replays. (The 
instructions revolving around the tape recordings 
were intended to provide a rationale for the dyadic 
discussions. In addition, the mention of a future 
audience was intended to induce a sense of a larger 
group presence.) 

The subjects were asked to discuss each item 
thoroughly before making their decisions but to take 
no more than 5 minutes for each item, The female 
experimenter noted down the decisions announced at 
the close of a discussion. If the latter had to be 
broken off because the time limit was reached, the 
experimenter asked the participants to state their 
decisions in alternating order. 


RESULTS 


Own, Peer, and Ideal Decisions and Their 
Interrelations in the Initial Sample 


Table 1 shows that the subjects ascribed 
more cautious decisions to their peers, rela- 
tive to their own decisions (5 < .001). As is 
further evident from Table 1, the decisions 
held in highest regard (ideal decisions) were 
located at higher risk levels than the subjects’ 
own decisions (5 < .001). 

The following correlational results were ob- 
tained by computing a correlation coefficient 
for each of the 10 items, resulting in an 
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"average" r after appropriate z transforma- 
tions. Significant positive correlations (p < 
01 or better) were found between own and 
peer, own and ideal, and peer and ideal deci- 
sions (r = .41, r = .52, r = .26, respectively). 
Further, a significant negative correlation was 
found between peer-minus-own and own- 
minus-ideal decisions (r= —.34, p< 01), 
indicating that the more the subject assumes 
she is riskier than her peers, the smaller the 
distance she indicates between her own actual 
and her ideal decisions. In other words, the 
more the subject thinks she excels her peers 
in risk acceptance, the more she considers 
herself as being in line with her ideal. (It 
may be of interest to note that this correla- 
tion did not approach significance when com- 
puted on total choice-dilemmas scores.) 

None of the correlations between need for 
approval and the variables mentioned above 
(based on total choice-dilemmas scores) ap- 
proached significance. 


Results Involving Risky Shift 


As is shown in Tables 2 and 3, subjects 
characterized by a higher value of risk (higher 
discrepancy between own and ideal, more 
risky decisions) manifested larger risky shifts 
aiter dyadic discussion than those in the low 
value category (F = 4.46, p < .05), confirm- 
ing the principal experimental hypothesis. 

With the exception of the low-value, high- 
need-approval condition, the risky shifts were 
Significant for the various dyadic types ($ < 
05 or better; see Table 2), 

In addition to the analyses presented in 
Tables 2 and 3, correlations were performed 
between the own-peer discrepancy score and 
tisky shift. The correlation was not signifi- 
cant when based on total choice-dilemmas 
Scores. But when performed on each of the 
items, the 10 coefficients yield an average 7 of 
—91 (p < .01, after appropriate z transfor- 
‘Mations). In other words, the more pro- 
Tounced the tendency to see oneself as riskier 
P the peer group, the smaller the shift 
toward higher risk levels. This result, sur- 
Prising at first glance in its apparent contra- 

iction to the value-theory prediction, be- 
Comes more plausible when one considers, on 
* one hand, the positive correlation (r — 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS 
on Dvapic Risky Surrr 
T. ae — 


| Low value | High value 


| X | sp | x. 3n 


209 | 9.90 |845" | $39 
3.77" | 4.69 |5.27* | 7.70 


——— 
Low need for approval 
High need for approval 


Note.—Eleven dyads per cell. 
* Total choice-dilemmas shift, per individual, 
* p € .05 (two-tailed), 

** p € .01 (two-tailed). 


32, df — 86, P « 01) between own pre- 
dyadic decisions and risky shift (ie. more 
cautious subjects shifted more markedly) and, 
on the other hand, the negative correlation 
(r = —.39, df — 86, p< .01) between own 
predyadic decisions and relative underestima- 
tion of peers’ risk acceptance (ie, the 
more cautious subjects were less likely to 3 
believe their peers to be more cautious than 
themselves). When own predyadic decisions E 
are partialed out, the resulting partial corre- 

lation between relative underestimation of 
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peers’ risk acceptance and risky shift becomes 4 
negligible. When based on dyadic scores (V if 
= 44), the correlation is negligible even 


j 
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without partialing out subjects’ own predyadic 
decisions. The above correlational patterns 
are the same (though at lower levels of sig- 
nificance) for each of the four experimental 
conditions (22 subjects in each case). 


a 


DISCUSSION 


The finding of Levinger and Schneider 
(1969)—that subjects think they accept more 
risk than their peers and that, on the other 
hand, they accept less risk than they consider 
ideal—is replicated here with German sub- 
jects, thus confirming the validity of the cen- 
tral and a subsidiary assumption contained in 


TABLE 3 
ANALYSIS OF VARIANCE 


b 
Source df MS F m 
Value (A) 1 170.05 4.46" A 
Need for approval (B) 1 6.19 16 ! 
AXB 1 48.64 1.27 
Error 40 38.17 
*» «.05. 


434 


the value theory of the risky-shift phenome- 
non, 

More important, the present study provides 
a direct and successful test of the proposition 
that the risky-shift effect is indeed a function 
of risk as a value.* This causal relation has 
heretofore not been empirically demonstrated. 

No evidence was found concerning any ef- 
fect on risky shift of the subjects’ underesti- 
mation of their peers’ (relative to their own) 
risk acceptance. However, with mere two-per- 
son groups, the present experiment may have 
provided a rather inefficient design for uncoy- 
ering any effect of the peer-estimation factor: 
it is doubtful that a single peer-group mem- 
ber’s display of higher than expected risk 
acceptance has sufficient impact to change 
one’s reference point in the risky direction. 
It may be concluded, then, that the present 
design has been too weak to allow the social- 
comparison factor (peer estimation) of the 
value theory to come into play. Stronger 
experimental procedures (larger sized groups 
and direct manipulation) are needed to in- 
vestigate that factor as a possible contributor 
to the risky-shift effect. On the other hand— 
and more important at this point—consider- 
ing that the value of risk did have the pre- 
dicted effect on risky shift in the present 
study, one is led to conclude that the social- 
comparison component (underestimation of 
peers) is not necessary for value theory to 
be valid. 

The approval motive had no effect on risky 
shift. In explanation, consider that the need 
for approval may motivate subjects to present 
themselves as in line with normal standards 
(see Crowne & Marlowe, 1964, p. 202) as 


ë Vidmar (1970) has shown that greater hetero- 
geneity (discrepancy) among group members’ initial 
choice-dilemmas decisions is associated with greater 
risky shift. To ascertain whether the present risky- 
shift differences were not caused by such differential 
heterogeneity, an approximative index of hetero- 
geneity was obtained for each of the four sets of 
experimental dyads by computing, for each item, 
the standard deviation of own decisions over the 
respective 22 individuals. The low- and high-value 
conditions did not differ on that index. The average 
(per item) standard deviations were 1,95, 2.01, 1.94, 
and 197 for the low-low, low-high, high-low, and 
high-high value and need-approval conditions, re- 
spectively, 
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well as with ideal standards oí behavior 
(Crowne & Marlowe, 1964, p. 190; Pervin & 
Lilly, 1967). In the present experimental sit- 
uation, these two orientations may be seen as 
represented by the respective measures of 
peer-own and own-ideal position discrepan- 
cies. Obviously, the two tendencies here act 
in mutually opposite directions. T! if in 
the present study the subjects high, in com- 
parison to those low, in need for approval 
strive more strongly to match ideal standards 
(higher risk levels), the effect is neutralized 


by their stronger concern over closeness to 
the presumed normal position (lower risk lev- 
els). If the latter consideration is valid, high 
approval needs should be associated with 
stronger shifts in sufficiently large groups, 
where group discussion demonstrably leads to 
the displacement of the perceived social stan- 
dard toward higher risk (Ferguson & Vidmar, 
1970), thus “taking care” of subjects’ con- 
cern over closeness to the normal position, 
Yet, Kogan and Wallach (1967), using five- 
person female groups, found an indication of 
smaller shifts by groups of subjects high in 
need for approval. (These authors explained 
their finding through the affective inhibitions 
presumably characterizing persons with 
strong approval needs.) 

Concerning the role of the approval mo- 
tive, it may thus be concluded that the pres- 
ent study—together with Kogan and Wallach 
(1967)—casts doubt on the desire for social 
esteem as a possible motivational source un- 
derlying subjects’ shift toward more aspired 
and risky stands. Rather, it appears to be 
self-esteem motivation that disposes subjects 
toward greater commitment to their aspired 
risk levels. The latter proposition is supported 
by a near-significant correlation (r = .25, df 
= 42, $ « 10, two-tailed) between self-es- 
teem and risky shift across the 44 experi- 
mental dyads. (No other correlation with 
self-esteem approached significance.) The role 
of self-esteem in risky shift appears to be 
worth more focused investigation. 


7A German yersion of Cohen’s (1953) self-esteem 
scale, measuring self- versus ideal discrepancies, was 
administered at the end of the experimental session. 
Thus, an interesting possibility remains that risky 
shift raises participants’ self-esteem. 


The implications of the present study may 
be summarized as follows: (a) It is demon- 
strated conclusively that the group risky-shift 
effect is attributable, at least in part, to the 
Positive value of risk. The principal argu- 
ment of value theory is thus confirmed by 
experimental evidence. (6) There is indirect 
evidence in the present study suggesting that 
the social-comparison aspect of value theory 
—the fact that subjects typically think they 
Surpass their peers in risk acceptance—is less 
essential to the validity of value theory than 
has been widely assumed. (c) Tentative evi- 
dence suggests that shifting toward risk in a 
group discussion helps subjects sustain their 
self-esteem rather than gain social esteem. 
This is presumably because they are moving 


closer to a position which they personally 
value. 
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EFFECTS OF GROUP DISCUSSION ON ESTIMATES OF 
CULTURALLY APPROPRIATE RISK LEVELS' 
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Brown's value hypotheses explanation of the risky shift effect assumes that 
subjects' estimates of culturally appropriate risk levels on life dilemma-type 
problems change as a result of group discussion. Two experiments, using risk- 
evoking and caution-evoking items, confirmed this critical assumption. On risk 
items the estimates shifted in a more risky direction, and on cautious items 
estimates shifted in a more cautious direction. Furthermore, the magnitude of 
these shifts was related to the relative initial risk positions of the group 


members. 


One major interpretation of the “risky 
shift" effect is Brown's (1965) value hypothe- 
sis, which implies that a form of social com- 
parison is the dynamic mechanism behind the 
shift. This interpretation seems to rest on two 
basic assumptions: (a) On risk-evoking items 
subjects estimate themselves to be among the 
most risky members of the population, and on 
caution-evoking items they estimate them- 
selves to be among the most cautious; (5) 
group discussion, by revealing where others 
actually stand, causes the subjects’ estimates 
of the risk level preferred by a majority of 
others to shift in a risky direction on risk- 
evoking items and in a cautious direction on 
caution-evoking items. The first assumption 
has received considerable empirical support 
from several authors (Levinger & Schneider, 
1969; Stoner, 1968; Wallach & Wing, 1968) 
who have compared the individual's first de- 
cision (I) to his estimate of the level of risk 
typically taken by the majority of people like 
himself (M,). However, the second assump- 
tion, which can be operationally defined by 
a change in M., has not been tested to date. 
Moreover, an equally plausible alternative 
hypothesis (also pointed out by Levinger & 
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Schneider, 1969) is that instead subjects will 
reject their fellow group members’ pref- 
erences as unrepresentative of the population, 
For example, the subject might say to him- 
self, *For some reason I have gotten into a 
group with people who are really deviant from 
my estimate of the cultural norm but it must 
be theirs and not my estimate which is wrong." 
In this event no change in M,s would result 
and doubt would be cast on the validity of 
Brown's hypothesis. 

Brown's hypothesis states that subjects 
alter their risk preference because group dis- 
cussion makes them aware that they do not 
epitomize the value evoked by the item. In 
other words, they still want to be at least as 
risky as, or more risk than, others in the 
population. If this is the case, subjects’ new 
risk preferences should on the average be 
more extreme than their new estimates of 
what a majority of others would prefer, just 
as their initial preferences are usually more 
extreme than their initial majority estimates. 1 

Furthermore, Brown's hypothesis implies | 
that the magnitude of these shifts will not be 
equally distributed among all group members, 
but rather the degree of shift will be related | 
to a member's initial risk preferences relative | 
to the other members of the group. On risk- 
evoking items, for example, the less risky & 
member is initially, the greater the discrep- 
ancy he will see between his original estimate 
of how others would answer the item (Me) 
and the actual preferences of the members of 
his ‘group. This will lead him to shift both 
his M, and his personal preference in a risky 
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direction to fit the newly perceived norm. The 
more risky group members, on the other hand 
will discover they are more risky than others, 
just as they had expected, and will not feel 
to change. On caution-evoking 
items, the dynamics presumably operate in 
the opposite direction, causing an initially 
more risky group member to shift his M, and 
personal preference in a cautious direction. 
Vidmar (1970) and Vidmar and Burdeny 
(1969) have verified these hypotheses with 
respect to personal preferences, but the cru- 
cial hypothesis with respect to M, shifts has 
not been tested to date. 

Consequently, two experiments were con- 
ducted to test the following hypotheses: (I) 
On risk-evoking items subjects’ cultural norm 
estimates (Mes) shift in a more risky direc- 
tion after group discussion, and on caution 
evoking items these norm estimates shift in a 
more cautious direction; (II) after discussion, 
subjects still feel that they are more risky 
than others on risk-evoking items and more 
cautious than others on caution-evoking items; 
(III) the magnitude of the differences pre- 
dicted in Hypotheses I and II is related to 
the individual members' initial risk position 
relative to the other group members. 

In describing both experiments, “‘I,” refers 
to the first decision made individually by each 
subject, “G” refers to the unanimous group 
decision, and “Ta” refers to the final decision 
made individually by each subject. 


, 


compelled 


EXPERIMENT I 


This experiment was performed in order to 
test the hypothesis concerning norm estimate 
change. To avoid the possibility of test-retest 
sensitivity due to obtaining two M,s, the sub- 
jects were randomly assigned to one of two 
independent conditions, one measuring M.s 
before group discussion and one measuring 
Mss after group discussion. Since random as- 
signment implies that two groups will be ini- 
tially not different, the Mes of the first condi- 
tion could be compared to those of the second 
condition. The design was as follows: Condi- 
tion A (five risky and five cautious items): 
lj; M, G, Iz; Condition B (five risky and 
five cautious items): I4, G, I2, Me. 
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Method 
Subjects and Assignment to Groups 


The subjects were 56 male introductory psychology 
students from the University of Western Ontario 
subject pool. They were randomly assigned to one 
of four sessions. (Two sessions were run for each 
condition.) The condition presented in a particular 
session was randomly determined. Within each ses- 
sion subjects were assigned to 4-man groups to dis- 
cuss the items. Condition A had six groups (24 sub- 
jects) and Condition B had eight groups (32 sub- 
jects). 


Experimental Paradigm and Instructions 


The two experimental conditions were modifica- 
tions of the standard risky-shift paradigm (Kogan & 
Wallach, 1967). The major difference was the inser- 
tion of the M, instructions (adapted from Wallach 
& Wing, 1967) which were as follows: 


Now that you have finished making your recom- 
mendations, I would like you to go back over the 
questionnaire again for a different purpose. This 
time please indicate what vou think the majority 
of your fellow students here at the University of 
Western Ontario would mark for each item. For 
each item write an “M” in the space that cor- 
responds to the choice that you think the ma- 
jority of your fellow students would make. You 
can put the *M" in the same space you checked 
for yourself or in a different space. 


In Condition A the M. instructions were inserted 
immediately after the T; decisions, whereas in Condi- 
tion B they were inserted after the I. decisions. 


Life Decisions Questionnaire 


Five caution- and five risk-evoking items were 
selected from among those employed by other re- 
searchers. They were randomly asigned positions in 
a 10-item questionnaire and are summarized with 
their source in that order: 


1. The captain of a losing football team must, on 
the last play of the game, decide between a play that 
will almost certainly lead to a tie and a more risky 
play which will win the game if successful, but lead 
to certain defeat if unsuccessful (risk: Kogan & 
Wallach, 1964). 

2. A man and his wife, who is in her first preg- 
nancy, discover that a complication has arisen; they 
must decide between an abortion immediately to 
save the mother (she could have no more children) 
and letting the pregnancy continue, which could 
kill both mother and child (caution: Stoner, 1968). 

3. A graduate student attempting to get his doc- 
torate must decide between a little-known college 
where he would almost certainly get his degree and 
a well-known school which fails many of its students 
(risk: Kogan & Wallach, 1964). 
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TABLE 1 


Mean I, — G, I; — I; asp I, — M, Sutrrzs, AND 
Tests or Sioxaricance rox Conxprrions A 
AND B rox Born Risky aw Cavrious 
Irems: Exreximent I 


Con- 


type Score | dition | atte | Y | f 
Risky L-G A. 58 5 410° 
B» E! 7 3.87* 
IL-l | 4.105261] 287] "eye 
Be | | 7| 435° 
Cautious | I; - G A^ | —.18 5 — 57° 
| Lia —51 | 7 | —1.50° 
| - Rh Ae -45| 5| —.23 
| | Bp —.56 7 | —3.04° 
Risky L-M,|A | —1.07 | 12 4.7748°* 
I,—M,| B 34 
Cautious | L — Me | Ab 39 | 12 551% 
L-M.|B | —.02 


* Positive values indicate a risky shift; negative values in- 
dicate a cautious shift. 

> Positive score indicates that subjects estimated others to be 
more risky than themselves; negative score indicates that sub- 
jects estimated others to be more cautious than themselves. 

* Tests of significance ; difference from zero. 

: Boag be Conditions A and B. 

$ <.05. 


4. A successful businessman must decide whether 
or not to run as congressional candidate for the mi- 
nority party in his district. However, his chances of 
success are not large, the campaign would involve 
financial sacrifice, and his wife and children already 
feel deprived of his company (caution: Rabow, 
Fowler, Bradford, Hofeller, & Shibuya, 1966). 

5. A married man with children must decide 
whether or not to invest his life insurance in a stock 
which he has heard might triple in value. However, 
if a new product does not catch on, the stock will 
certainly drop (caution: Nordhgy, 1962). 

6. A couple considering whether or not to get 
married have had a number of arguments. Consul- 
tation with a marriage counselor has indicated that 
a happy marriage while possible is certainly not 
assured (caution: Kogan & Wallach, 1964). 

7. A married man with one child and a secure but 
not exceptional job must decide between staying or 
accepting a better paying job of higher potential 
from a struggling young company (risk: Kogan & 
Wallach, 1964). 

8. A tournament chess player must decide between 
an assured stalemate and a risky play which, if suc- 
cessful, would win the game, but if unsuccessful, 
would result in certain defeat (risk: Kogan & Wal- 
lach, 1964). 

9. A man about to fly overseas on his vacation is 
troubled by a severe abdominal pain. Now at the 
airport he must decide between boarding the plane 
and going to the hospital, which would ruin his va- 
cation plans (caution: Stoner, 1968). 
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10. A married physicist who has just bec ven 
a S-year university appointment must ce 
tween a series of short-term but relatively 
portant studies and a long-term potentia! 
tant study which might leave him with : 
show for his 5 years (risk: Kogan & Wall 


Results 
Group and Individual Shijts 
The risk-evoking and caution-evoking items 
were analyzed separately. “Group” scx 
the I,, Iz, and M, decisions were obtained by 
taking each subject's average response to the 
five risky (or cautious) items and then aver- 


res for 


aging these scores over the four group mem- 
bers. The G decisions were also averaged over 
the five risky (or cautious) items. To deter- 


mine whether statistically significant shifts 
occurred in Conditions A and B, ¢ tests of 
the difference from zero were performed on 
the group (I; — G) and the mean individual 
(I; — Iz) difference scores. The results, pre- 
sented in Table 1, indicate that risk-evoking 
items yielded statistically significant shifts in 
all instances, but that the caution-evoking 
items yielded only one statistically significant 
shift. Examination of the individual cautious 
items shows that all shifts were in the ex- 
pected direction. This latter result is con- 
sistent with previous research (Stoner, 1968), 
which indicates that statistically significant 
cautious shifts are more difficult to obtain 
than significant risky shifts. 


Majority Estimate Shifts 


In order to examine the first hypothesis 
(majority estimates are different if taken be- 
iore or after discussion), a difference score 
(I; — M.) was calculated for each group. 
These I, — M, scores were used instead of the 
M, scores themselves, because any differences 
in T, scores between Condition A and Condi- 
tion B resulting from sampling error likely 
meant some initial differences in the subject's 
norm estimates. The use of the T, — M. 
scores, however, acts as a rough covariance 
technique to account for these initial discrep- 
ancies. 

The lower part of Table 1 presents the 
T; — M, differences by conditions and the t- 
test comparisons between Condition A and B 
lor both risk- and caution-evoking items. On 


ing items, group members estimated 
discussion that others were more cau- 
than themselves, whereas group mem- 
whose opinions were solicited after group 
discussion estimated that others were more 
Tisky than themselves. The difference is sig- 
Mificant at the .001 level (two-tailed). On 
tion-evoking items, subjects in Condition 
tended to estimate others to be more risky 
they themselves, whereas subjects in 
Condition B estimated others to be slightly 
more caut'ous. The differences between Con- 
ditions A and B were not significant for cau- 
lion-evoking items, although considering the 
all cautious shifts obtained, this result 
might be expected. 
- In order to examine the second hypothesis, 
that subjects feel they still epitomize the 
ue after discussion, mean T, — M, differ- 
Scores were calculated for the risky and 
cautious items of Condition B. These mean 
difference scores were significantly different 
from zero both for the risky items (&— 
—.52, t — 2.71, df = 7, p < 05) and for the 
Cautious items (X = .62, 1 = 3.15, df = 7,5 
X 02). This suports Hypothesis II, since, 
€ven after discussion, subjects thought they 
mselves were riskier than others on risk- 
voking items and more cautious than others 
caution-evoking items. 


Relative Risk Position Analysis 


The third hypothesis stated that changes in 
individual members’ risk preferences (in- 
ted by I, — I» shift scores) and their ma- 
ity estimates (I; — M, scores) would be 
related to their relative risk position in the 
oup. Since subjects’ risk preferences differ 
from item to item (see Vidmar, 1970; Vidmar 
t Burdeny, 1969), it was necessary to con- 
ct average item shiít scores for each rela- 
risk position in the group (from Py, the 
t risky member, to P4, the most cautious 
nber). Thus, the I, decisions on each item 

e used to arrange the preferences of each 
toup member from most risky to most cau- 
s. The I, — I; and the I, — M, shift 
es for each position were then averaged 
Wer the five risky or the five cautious items 

each group. The means of these “averaged 
by position” shift scores for Conditions 


TABLE 2 


Meas I, — I; AND I, — M, Scouss by Rxrertvk 
Risk Posrrow yor Risky axp Cavriovs 
Irrws or Conpmons A axp B: 

Expruiment I 
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Item | 


Con- 
me A | Position h = te h -Me 

Risky | A | Ps (most risky) -6!]| -265 
Ps 00 | —Las 

Py -55 -50 

P. Veer cautious) 1.80 2 

I B Pi (most risky) -2:0| -190 

P; 56 a 

P Lu 1.27 

Ps (most cautious) 2.13 2.20 

Cautious A Pi (most risky) | -160 -83 
P -37 -.40 

P 40 AS 

Pa (most cautious) 87 143 

| B | Pi (most risky) -262 | -2.77 

P: -37 E: 

| 3 21 1.20 

| | Ps (most cautious) E] 1.55 


ere score indicates cautious shift; positive, a risky 
shift. 
* Negative score indicates others were estimated as more 


cautious. Positive score indicates others were estimated as 
more risky than self. 


A and B are presented in Table 2. A 2 x 2 x 
4 (Item Type x Condition x Risk Position) 
analysis of variance (with item type as a cor- 
related measure) was performed on both the 
Iı — I» and I; — M, shift scores. The results 
are sumamrized in Table 3. 

Considering the individual shift scores first, 
item type is significant beyond the .001 level. 
The significant Item Type X Condition inter- 
action suggests that asking subjects for M,s 
before group discussion (Condition A) tends 
to slightly suppress individual shifts; Table 
2 shows that the magnitude of both risky and 
cautious shifts is greater in Condition B. The 
risk position factor is significant beyond the 
-001 level and accounts for 44.7% of the ex- 
perimental variance? Examination of the 
means for the risk-evoking items in Table 2 
shows that as predicted, the more cautious a 
member had been initially, the more he 
shifted toward a riskier alternative. This rep- 
licates the results of the Vidmar and Burdeny 
(1969) experiment, including the finding that 
the most risky member (Pi) in fact shows a 
tendency to shift in a cautious direction. This 
cautious shift of the most risky members was 
also suggested by Brown (1965) and Mar- 


*Controlled variance estimates throughout the 
paper are calculated írom the statistic "omega- 
squared" (Hays, 1963; Vaughan & Corballis, 1969), 
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TABLE 3 
SUMMARY OF ANALYSIS Or VARIANCE FOR I; — I; AND I; — M, SCORES: EXPERIMENT I 
CHNCIOESEGRENE | h-h | l: — Me 
Factor 4 | — ET 
MS F MS F 
ioa o Uses | an 502 
Item X Condition 1 4.81 5.96* | 19.87 
Item X Position 3 2.18 2.70 | A6 
Item X Condition X Position M .22 27 | 96 
Error 48 | 81 | sM 
| 
| | 
Condition (A) | 1 .02 | ts 03 | 8.84 
Position (B) $5* | 34.78 5171*^ 65.88 
AXB 3 52 78 440 
Error 48 | .67 1.18 


quis (1962), who emphasized that moderate 
and not extreme risk would be favored. The 
shift patterns were essentially reversed for the 
cautious items, even though the differential 
shifting was not as pronounced. That is, the 
more risky a member was initially, the greater 
was his cautious shift; the initially more cau- 
tious members (P and P4) in fact showed a 
slightly risky shift. 

Considering the majority estimate differ- 
ences, the significant main effects of item type 
and condition and significant Item X Condi- 
tion interaction which are summarized in Ta- 
ble 3 indicate that as expected, group discus- 
sion influenced subjects’ M,s. The significant 
position main effect, which accounts for over 
58% of the experimental variance, indicates 
that subjects’ I, — M, discrepancies were dif- 
ferentially effected by group discussion, de- 
pending on their relative risk position in 
the group. Table 2 shows that on risk- 
evoking items, initially riskier members 
(Pi, Po, Ps) estimated before discussion 
(Condition A) that others would be more 
cautious than themselves. The most cau- 
tious member (P4) apparently believed 
that others would be slightly more risky than 
himself. When estimates were obtained after 
group discussion (Condition B), only the ini- 
tially most risky member (P) estimated oth- 
ers to be more cautious than himself. The 
other members estimated others to be more 
risky than themselves. Contrasting the P, 
members’ estimates between Condition A and 


j 


B shows that the members in the latter condi- 
tion estimated others to be considerably more 
risky instead of just slightly more risky. On 
caution-evoking items, the patterns tend to be 
reversed. However, the initial (Condition A) 
discrepancies between personal choice and 
estimates of others’ choices are not as pro- 
nounced as those for the risky items. This 
may suggest why cautious shifts are generally 
not as large as risky shifts. 


EXPERIMENT II 


Experiment II asked four questions. Was 
the failure to obtain strong shifts on cautious 
items due to the fact that both risky and 
cautious items were interspersed on the same 
questionnaire and the risky items somehow 
dampened the cautious shifts? Would we be 
able to draw the same conclusion about ma- 
jority estimate shifts if all subjects made Mes 
both before and after group discussion instead 
of before or after discussion as in the inde- 
pendent groups design of the preceding experi- 
ment? Experiment I used “conscripted” males 
as subjects. Would we get the same results 
with male and female paid volunteers? Fi- 
nally, would a postexperimental questionnaire 
indicate why various subjects shifted the way 
they did? 

Method 
Design and Procedure 


Subjects answered either the five risky or the five 
cautious items contained in the “Life Decisions 


Questionnaire," Standard L, G, and Is instructions 
were used with each item condition. Each subject, 
however, made majority estimates immediately after 
I, (called M.,) and immediately after Is (called M..). 
The postexperimental questionnaire (described be- 
low) was administered after M... 


Subjects and Assignment 


Subjects were 48 female and 36 male volunteers 
recruited during the summer session at the University 
of Western Ontario. Occupations of student, teacher, 
secretary, bank clerk, and housewife were represented. 
Subjects received $2 for participation and were given 
à chance to obtain a $25 door prize. They were ran- 
domly assigned to 4-man groups of homogenous sex: 
Five male and five female groups worked on the 
Tisky items; four male and seven female groups 
worked on the cautious items. 


Postexperimental Questionnaire 


This questionnaire was given while the subjects 
still had the Life Decisions Questionnaire on which 
they made the G, Is and M., decisions. Instructions 
were as follows: 


Please answer the following questions for EACH 
problem on the Life Decisions Questionnaire. You 
. may refer to your answers on your copy of that 
questionnaire but please do not change any of your 
answers now. 


-. Four questions were asked about each item. Ques- 
‘tion (a) asked whether the subject’s personal pref- 
erence after group discussion was (more risky, re- 
mained the same, more cautious) than before group 
discussion. The other three questions were accompa- 
Ried by a 5-point Likert-type scale. Question (b): 
“My initial risk preference was" (“overly risky," 1, 
through “overly cautious,” 5). Question (c): “I had 
anticipated that the risk preference of the majority 
Of the other group members would be” (“more 
risky,” 1, through “more cautious,” 5). Question 
(d): “On this problem it was desirable to be 
LP among the" (‘most risky members,” 1, through 
“most cautious members," 5). 


Results 
Group and Individual Shifts 


The T, — G and I, — I; shift scores were 
Obtained the same way as in Experiment I. 
These scores are summarized in the top 
Dart of Table 4. All shifts are again in the 
“pected direction, although the I; — Ie 
Shifts on the cautious items fell short of sta- 
Ustical significance. 


lajority Estimate Shifts 


Average majority estimate shifts per item 
Were calculated in the same way as the I, — 
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TABLE 4 


MEANS AND TESTS OF SIGNIFICANCE FROM ZERO FOR 
Ty — G, TI; — Ts, Me, — Ma, I; — Ma, AND 
I; — Me, SCORES FOR Risky AND CacTIOUS 
Irrws: ExrkurukNT II 


IL—————————————— 


Score | Item type Shift df Li 
L-G | Risky ór | 9 248* 
| Cautious —46 |10| —220* 
L-I: Risky 57 | 9 3.57 
Cautious —.28 | 10 | —1.01 
Ma — Me, | Risky | 1.20%] 9] 610% 
Cautious —.27> | 10 2.10* í 
L- Ma Risky —146° | 9| —6.69** t 
Cautious | —.07° | 10 -19 a 
| 
1-9 MSS pay -—8*|o|-ase 7 
Cautious —.1* | 10| —1.16 


* Positive score indicates risky shift; negative score indi- 
cates a cautious shift. 

b Positive score indicates subjects thought others were 
more risky than estimated before discussion. Negative score 
indicates subjects thought others were more cautious than 
estimated before discussion, 

* Positive score indicates subjects thought others were 
more risky than themselves: negative score indicates subjects 
thought ETT were more cautious than themselves, 

> «.05. 


Iz scores. The group Mea, — M,, scores and 
tests of significance from zero are summarized 
in the middle rows of Table 4. On risk-evok- 
ing items, group members estimated the cul- 
turally appropriate risk level to be signifi- 
cantly more risky than they had anticipated 
before discussion. On the caution-evoking 
items, members estimated that the cultural 
norm was significantly more cautious than 
they had anticipated before discussion. 

The group Iz — Me, scores and the tests of 
difference from zero, presented at the bottom 
of Table 4, replicate the findings of Experi- 
ment I for the risk-evoking items but not for 
the caution-evoking items. Although subjects 
estimated that their new preferences were 
more risky than the newly perceived pref- 
erences of others on the risk-evoking items, 
they appeared to see no significant difference 
between themselves and others on the cau- 
tious items. 

On risk-evoking items at least, it seems 
that subjects show a considerable change in 
their estimates of the appropriate cultural 
norm (Me, — Mes; see Table 4) and at the 
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same time still think they epitomize the rele- 
vant value as compared to other people (Iz 
— M; see Table 4), at least on risk-evoking 
items. This suggests that there may be a con- 
siderable difference in the degree to which 
subjects see themselves as more extreme than 
others before as opposed to after discussion. 
This question was examined by comparing the 
subjects’ I, — Me, scores (perceived self-other 
discrepancy before discussion) to their I9 — 
Me scores (perceived self-other discrepancy 
after discussion) by correlated £ tests. On the 
risk-evoking items subjects did not see them- 
selves as being as risky in relation to others 
as they originally had (¢ = 247, df = 9, p< 
.05), whereas on caution-evoking items there 
was no significant difference between subjects’ 
perceived self-other differences before and 
after discussion (4 = .53, df = 10, ms). 


Relative Risk Position Analysis 


The mean I; — Is, Ma — Mo, and I; — 
Mes shift scores for each relative risk position 
are summarized in Table 5, and a 2 X 4 X 2 
(Item Type X Position X Sex) analysis of 
variance on each score is summarized in Ta- 
ble 6. There was no effect due to sex composi- 
tion for any of the scores. The I; — Is scores 
are consistent with the pattern obtained in the 
first experiment. The Ma, — Me, scores are 
consistent with Prediction III. That is, on 
risk-evoking items, the magnitude of ma- 
jority estimate shift (in a more risky direc- 
tion) was inversely related to members’ ini- 
tial risk positions. On caution-evoking items, 
the magnitude of shift (in a more cautious 
direction) was directly related to the mem- 
bers’ initial risk positions. The I, — Mes 
scores, evidencing a strong position main ef- 
fect, were consistent with Hypothesis III. On 
risk-evoking items, the more risky an indi- 
vidual was initially, the more cautious he 
estimated others to be in relation to himself. 
The least risky member (P4) estimated that 
others were just slightly more cautious than 
himself. On caution-evoking items, the ini- 
tially more risky members (P, and P2) 
thought others were more cautious than them- 
selves, whereas the initially more cautious 
members (Pa and P4) thought others were 
more risky than themselves, 
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Postexperimental Questionnaire 


Scores on the questionnaire were 
lated in a manner similar to the pr 
scores. That is, members’ responses on | 
item were arranged according to initial ri 
position. These scores were then averaged over 
items to obtain an average item response by 
position for subjects who had risky items and 
subjects who had cautious items. A 2X4 
(Item X Position) analysis of variance was 
performed on each question. 

The first question showed an item type 
main effect (F = 10.0, df = 1/68, p < .005) 
and a position main effect (F = 5.27, df= 
3/69, p < .005), which indicate that on risky 
items subjects were aware they shifted in a 
risky direction and on cautious items they 
were aware they shifted in a cautious direc- 
tion. The second question asked the subject 
whether his initial risk preference was overly 
risky or overly cautious (anchored by 1 and 
5, respectively). The means for the P, through 
P4 members on risky items were 2.6, 2.9, 3.1, 
and 3.4, respectively. On the cautious items: 
the means were 2.5, 2.8, 3.4, and 3.6. The 
position effect was significant beyond the .001 
level (F = 19.14, df = 3/68) and accounted © 
for 40.0% of the variance. Thus, on both risk- 
and caution-evoking items, initially risky: 
subjects felt that they had been too risky, 
while the initially more cautious subjects felt 
they had been too cautious. The third ques- 
tion asked whether the subject had anticipated 
that the risk preference of the other group. 
members would be more risky or more cau- 
tious. The position effect was not significant, - 
but the item-type effect went beyond the .001 
level (F — 33.25, df — 1/68) and accounted 
for 27.4% of the variance. The means were 
3.6 and 2.9 for the risky and cautious items, 
respectively. Thus, regardless of relative risk 
position, subjects indicated that prior to dis- 
cussion they had anticipated that on risk items 
others would be more cautious than them- - 
selves and on cautious items others would be 
slightly more risky than themselves. The 
fourth question asked whether it was desirable 
to be among the most risky or among the. 
most cautious members. Again there was no 
position effect, but item type reached the .001 
level (F — 100.17, df — 1/68) and accounted 


f 
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for 55.29 of the variance. The mean was 2.5 
for risky items and 3.7 for cautious items. 
This latter finding is quite important for 
Brown’s (1965) hypothesis, since it supports 
the notion that regardless of initial risk posi- 


tion, subjects felt it was desirable to be 
among the most risky people on risk-evoking 
items and among the most cautious people on 
caution-evoking items, 


DISCUSSION 


Two experiments using different ways of 
obtaining majority estimate shift Scores, us- 
ing risk- and caution-evoking items with the 
same groups or with independent groups, and 
using conscripted subjects or paid male and 
female volunteers, support the critical assump- 
tion of Brown's (1965) value hypothesis that 
group discussion will result in changes in sub- 
jects’ estimates of culturally appropriate risk 
levels. The postexperimental questionnaire of 
the second experiment supports the value hy- 
pothesis explanation of the cognitive dynam- 
ics behind the shifts. 

Both experiments supported the assumption 
that subjects will still perceive themselves to 
be more extreme than others even after group 
discussion. After the discussion in Experiment 
I, subjects having risk-evoking items felt that 
their preferences were more risky than others’ 
Preferences, while those having caution-evok- 
ing items felt their preferences were more 
cautious than the preferences of others. This 
finding was replicated on the risky items of 
Experiment IT, but was not replicated on the 
Cautious items. On cautious items, subjects 


_ did not perceive any difference between their 


| 


9wn positions and those of others. This find- 
ing is not inconsistent with the small signifi- 
tance of the cautious Ma, — Me, difference 
(see Table 4), as opposed to the much 
Stronger M,, — Mes difference on risky items. 

Finally, the magnitude of Me, — M,. shifts 
and the Is — Me, differences, as well as indi- 
vidual shifts (T, — Is), was related to the ini- 
tial risk position of the group members. More- 


Over, the shift patterns associated with risk 


Position were consistent with the patterns im- 
Pied by Brown's hypothesis, even when the 
Averaged-over-group-member shifts were small 
°F nonsignificant (i.e., the cautious items). 


APPROPRIATE Risk LEVELS 


It should be noted here that although the 
present data on majority estimate shifts are 
consistent with Brown’s notion that group dis- 
cussion results in a change in perceived norms, 
which in turn results in changed personal 
preferences, the data are subject to an alter- 
native causal interpretation. This other expla- 
nation is that some other factor in the group 
discussion changes the subjects’ personal 
preferences, and these changed personal pref- 
erences in turn cause changes in the percep- 
tion of the culturally appropriate norm. That 
is, the I, — I, shift, or even some third varia- 
ble, causes the Me, — Me, shift instead of 
vice versa. The authors prefer the original 
interpretation, however, on both a priori and 
empirical grounds. Presumably, majority esti- 
mate shifts are based on the “objective” real- 
ity of actually being exposed to other group 
members’ preferences, whereas the individual 
shifts are based on the subjective preferences 
of each member and hence are prey to more 
variability across subjects. Examination of 
Table 4, for example, shows that in fact the 
Mea — Me shifts are larger and more robust 
than the I, — I» shifts. It seems reasonable 
to assume, therefore, that the majority esti- 
mates are the more basic phenomenon.* * 

The magnitude of both individual prefer- 
ence and majority estimate shifts was gen- 
erally smaller for cautious items than for 
risky items. This finding is consistent with 
the data reported by Stoner (1968). Exami- 
nations of the I, — M,, difference scores in 
Experiment I (see Table 2) indicated that on 
cautious items the discrepancy between the 
subject's own initial preference and his esti- 
mate of how others would respond was consid- 
erably smaller than it was for risky items. 
The data for Experiment II are similar. For 
example, the mean for the group I, — M, 


*Steiner (personal communication) has additional 
evidence to implicate changes in perceived cultural 
norms as the causal factor behind the shift. By 
having an experimenter provide subjects with a 
false norm about how others responded to an item, 
he obtained shifts in the subjects personal pref- 
erences. If the false norm was more risky than 
the subjects’ initial preferences, there was a risky 
shift, and if the false norm was more cautious than 
the subjects’ initial preferences, there was a cautious 
shift. 


“~~ a 


CE 


IH Duncan A. FERGUSON AND NEIL VIDMAR 


TABLE 5 


Mean I, — Ian, Ma — Mop, AND I; — Mo, SmirTS 
BY Posrrion FOR Risky AND CAUTIOUS 
Treas; EXPERIMENT II 


| 2 
epe | "pendens |i Ie | Ma — Mat | Ds — Mage 


Risky P. —32 39 | —1.70 
(most risky) 

P: A4 1.28 —.83 

Ps EI 147 —41 

Pi 243 1.67 -6 

(most cautious) 

Cautious —1.56 —.58 —1.28 
fost risky) 

—59 —.39 —.43 

5 24 —.26 .18 

LU 16 67 


Tapak cautious) 


* Positive sign indicates a risky shift; a negative sign indicates 
a cautious shij 

* Positive sign indicates that after group discussion, sub- 
jects indicated they felt others were more risky than originally 
estimated; a n ive sign DIS subjects felt others were 
more cautious than originally e: ted. 

. Mie sign indicates Subjects felt others were more risky 
t ves; negative sign ea subjects felt others 
were ES cautious than themselves, 


scores for risky items was —1.46 (SD — .69) 
and that for the cautious items was .07 (SD 
=.71). If the I, — M, discrepancy is the 
major factor behind group and individual 
shifts, and the evidence implies it may be, 
this may explain why cautious shifts are 
harder to obtain and generally smaller than 

y shifts when they are obtained. Note 
that we still have not explained wy the dis- 
crepancy is smaller for cautious items, Per- 
haps cautiousness is a weaker value than riski- 
ness. Or perhaps it is because the instructions 
for the life dilemmas problems cue subjects 


toward greater riskiness. Recently, Clark and 
Willems (1969), using risky items only, de 
leted the instructions to choose the lowest 
probability and merely asked subjects to 
choose the probability of success they would 
consider acceptable to recommend the alter- 
native in question. Groups under this condi- 
tion did not show a risky shift. The Clark 
and Willems procedure should be repeated 
with both risky and cautious items, and sub- 
jects’ Mes should also be obtained. 

In conclusion, this study provides positive 
evidence in favor of Brown's (1965) value 
hypothesis explanation of the  risky-shift 
effect. While it does not refute alternative 
hypotheses like Kogan and Wallach's (1967) 
diffusion of responsibility hypothesis, Collins 
and Geutzkow’s (1964) and Marquis’ (1962) 
leadership hypothesis, or Bateson's (1966) 
familiarization hypothesis, neither explanation 
could have made similar predictions. Re- 
cently, Madaras and Bem (1968) and St. 
Jean (1970) have proposed a version of the 
value hypothesis that is different from 
Brown's. Instead of a social comparison 
mechanism, they proposed that the dynamic 
involves relevant information in the form of 
arguments or suggestions elicited during 
group discussion in accordance with the par- 
ticular value engaged by the item. While the 
present research makes no critical test be- 
tween the two hypotheses, it does lend more 
support to Brown's version, since social com- 
parison does seem to be occurring. 


TABLE 6 


SUMMARY OF ANALYsIS OF VARIANCE FOR I, — Iù, M, — M.,, AND I; — Me, SCORES: 
ExrzRIMENT II 


I -Ik Ma —Me; I: — Me 
Factor df 
MS F MS F MS F 
Item type (A) 1 12.29 16.14*** 5412 81.14*** 6.96 8.08% 
Position (B) 3 27.91 36.66*** 2.68 4.03* 11.15 12.95*** 
Sex ©) 1 38 50 1.36 2.05 4T 89 
AX 3 1.46 1.92 1.56 2.35 1 36 
A e 1 2.50 3.28 142 2.13 46 58 
PAS 3 39 38 1.48 2.22 12 14 
AXBXC 3 90 1.19 17 26 70 08 
Error 68 16 6 .86 
mD SOE 
p < 001. 
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GROUP CONSENSUS AND CULTURAL VALUES As 
DETERMINANTS OF RISK TAKING' 


ROBERT S. BARON,? KENNETH L. DION, PENNY H. BARON,’ 
AND NORMAN MILLER + 


University of Minnesota 


Subjects discussed risk- or caution-oriented choice-dilemmas items. During 
discussion, subjects confronted an artificial group consensus either riskier or 
more cautious than their initial decision. Brown's cultural value explanation 
of the risky shift predicts greater conformity to the group consensus when the 
group deviates from the subject in a direction that coincides with the value 
orientation elicited by the item (risky or cautious). This prediction was not 
confirmed. A group consensus riskier than the subject's initial position pro- 
duced a risky shift; likewise, a more conservative group consensus produced 
a cautious shift. Yet contrary to Brown's cultural value explanation, a 
congruity between group consensus and item orientation did not produce 
larger shifts. Supplemental data, however, indicated (a) that during discussion 
subjects were aware of the culturally valued level of risk and (6) when they 
conformed in a “nonvalued” direction, they did so despite their desire to 
deviate in a culturally valued direction. These supplemental data support the 
cultural value explanation, while the risk-taking data suggest a limitation of 


that explanation's generality. 


The cultural value explanation (as articu- 
lated by Brown, 1965) assumes that in cer- 
tain situations our culture values (and re- 
wards) risky behavior, while in others it 
values cautious behavior. Thus, when a situ- 
ation permits a choice, risky and cautious be- 
havior will differ in their degree of social 
desirability. According to the cultural value 
hypothesis, when subjects individually make 
decisions in such situations, they attempt to 
conform to the dominant value; that is, they 
want to behave in a socially desirable manner. 
Therefore, they make decisions that they be- 
lieve will deviate in a culturally valued direc- 
tion from a given reference group. 

One can think of at least two psychological 
processes that underlie such behavior. First, if 
a given type of behavior (e.g., risk taking) is 
valued in a particular situation, an individual 
may try to exhibit more of that behavior 
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three authors as well as a National Institutes of 
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Miller, who is now at the Department of Psychology, 
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fornia 90007. 
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than others in order to view himselí more 
favorably and/or gain more reward. li one 
has more of a valued trait than others, these 
outcomes follow naturally. But in addition, 
risk and caution (as well as a variety of 
other personal characteristics) can only be 
defined in relative terms. That is, in many 
situations one must rely on social comparison 
processes; one can only define oneself as risky 
or cautious if one is riskier or more cautious 
than others. Both of these processes point to 
the fact that the desire to deviate from 
others in a culturally sanctioned direction is 
essentially an “image maintenance process 
[Madaras & Bem, 1968, p. 353].” 
Numerous studies show that individuals 
modify their initial decision about risky or 
cautious behavior after discussing the problem 
with others. In accord with the cultural value 
explanation, the evidence clearly shows that 
at least with choice-dilemmas items, the direc- 
tion of shift on a given item corresponds to 
the value that is dominant in that item. For 
example, on items that reliably produce à 
risky shift, subjects typically choose an initial 
(prediscussion) risk level that they think is 
riskier than those of others (Hinds, 1962; 
Levinger & Schneider, 1969; Pruitt & Teger, 
1967; Stoner, 1968; Wallach & Wing, 1968), 
and the opposite occurs on items that re- 
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produce a cautious shift (Levinger & 
1969; Pruitt & Teger, 1967). One 
hardly expect individuals to willingly 
from others unless such deviation 
irable. Similarly, several investigators 
e reported that individuals select riskier 
than their own when asked to indicate 
vel of risk taking they most admire. As 
d, this outcome also occurs only on 
that reliably produce a risky shift 
vinger & Schneider, 1969; Pilkonis & 
a, personal communication). Moreover, 
adaras and Bem (1968) reported that for 
same (risk-oriented) items, people de- 
$ l as risk takers are rated more favor- 
ably on semantic differential scales than 
people described as risk rejectors. 

Thus, risk is indeed valued on items pro- 
ducing a risky shift, whereas caution is valued 
m items producing cautious shifts. This evi- 
dence alone, however, is not sufficient to con- 
the cultural value explanation of the 
shift. In order to confirm cultural value 
theory (as presented by Brown, 1965), one 
nust demonstrate that the primary motivation 
f subjects during group discussion is to ad- 


ar in his discussion of a situation in which 
fisk is culturally valued. 


Those who find themselves (during the discussion) 
low the mean . . . of the group discover that 
are failing to realize the ideal of riskiness that 
may have thought they were realizing. Conse- 
ently they feel impelled to move in a risky direc- 
both in accepting the decision of the group and 
Changing their private opinions. Subjects at or 
ve the group mean feel no such impulsion, they 
relatively risky just as they meant to be [p. 701]. 


cle: 


_ If Brown's analysis is correct, it leads to 
the interesting prediction that those who devi- 
e in a culturally valued direction are less 
vulnerable to an opposing conformity pres- 
üre than those who deviate in a nonvalued 


mity literature as well. 

- To investigate this question we factorially 
Banipulated (a) whether risk or caution is 
dominant cultural value in the hypo- 
ical problems to be discussed and (5) 
ther an artificially established group con- 
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sensus deviated from subjects’ initial position 
in a risky or cautious direction, The con- 
formity literature, as well as social influence 
theory, clearly predicts a main effect for the 
direction of group consensus. However, 
Brown’s cultural value explanation of group 
risk taking predicts a main effect for value 
orientation of the decision problem. Thus, 
when the group consensus is riskier than sub- 
jects’ initial position, a risky shift should oc- 
cur with decision problems eliciting both value 
orientations, but this effect should be strong- 
est on risk-oriented problems. When the group 
consensus is more conservative than subjects’ 
initial position, a cautious shift should occur, 
but this effect should be strongest on caution- 
oriented problems. In short, those deviating in 
a culturally valued direction should resist op- 
posing conformity pressures more than those 
deviating in a nonvalued direction (regardless 
of whether risk or caution is culturally pre- 
ferred in the particular situation). 

Deviating moderately in a culturally valued 
direction from others allows one to be de- 
fined (or define oneself) as a person who 
possesses more of a valued trait than others. 
Tn contrast, those who deviate in a nonvalued 
direction are more likely to change their 
decision. For them, conforming means shifting 
toward opinions that are more socially desir- 
able. 

Wallach and Mabli (1970) presented data 
that conform to these predictions. Their sub- 
jects individually completed the 10 choice- 
dilemmas items that consistently produce a 
risky shift. Then, groups of subjects discussed 
the items. Wallach and Mabli varied group 
composition so that groups contained either 
two initially risky subjects and one initially 
cautious subject or two cautious subjects and 
one initially risky subject. As predicted, ini- 
tially cautious subjects shifted toward risk 
after discussion, regardless of whether they 
constituted a minority or a majority of their 
group, while initially risky subjects showed no 
shift, regardless of whether they were in the 
minority or the majority. However, if we ac- 
cept their own interpretation of the data, then 
it follows that the risky minority’s failure to 
conform stemmed from an absence of any 
opposing conformity pressure rather than an 
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ability to resist such pressure. In short, it re- 
mains unclear whether or not such resistance 
does indeed occur. 

On the other hand, even if we take these 
data as indicating resistance to opposing con- 
formity pressures, a methodological criticism 
weakens such an interpretation of the data. 
One variable in their study was a subject 
variable, that is, the initial riskiness of sub- 
jects. Whenever one uses a subject variable, 
it is possible that observed differences between 
levels of that subject variable reflect some in- 
dividual difference correlated with the subject 
variable. For example, advocates of a leader- 
ship explanation of the risky shift suggest 
that risk-prone individuals exert greater in- 
fluence during group discussion than cautious 
individuals (Collins & Guetzkow, 1964; 
Marquis, 1962). If so, this could explain why 
initially risky subjects in Wallach and Ma- 
bli’s study were not influenced during discus- 
sion, whereas initially cautious subjects were. 
Alternatively, risk-prone people might be 
particularly independent, and cautious people 
might be particularly submissive, thereby 
producing the observed differences in conform- 
ity. Of course, had Wallach and Mabli also 
included items that elicited values of caution 
and obtained opposite effects with risk- and 
caution-oriented items (i.e, on caution-ori- 
ented items, no change among cautious sub- 
jects and a cautious shift among risky sub- 
jects), these alternative explanations would 
not be plausible. However, given the ambig- 
uities inherent in their design, we remain 
basically ignorant about whether one’s ability 
to resist an opposing conformity pressure is 
greater when one deviates from the group in 


a culturally valued as opposed to a culturally 
disvalued direction. 


Metuop 
Design 


The value orientation of risk-taking items (risk or 
caution oriented) and the direction (more risky or 
more conservative) in which group consensus devi- 
ated from naive subjects’ pretest scores were manip- 
ulated in a 2X2 factorial design. In each experi- 
mental condition, measures of risk taking were ob- 
tained at three temporal points: prior to group dis- 
cussion (pretest), at the end of group discussion 
(consensus), and after group discussion (post- 
consensus). 


Em 
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Decision-Making Materials 


Eight decision problems were used, six from Ko- 
gan and Wallach’s (1964) Choice Dilemmas Question- 


naire and two from those developed by Stoner 
(1968). The two Stoner items and two of the choice- 
dilemmas items were caution oriented; that is, they 
typically produce a cautious shiít. The remaining 


four choice-dilemmas items were risk oriented. The 
specific content of the eight items, their source, and 
the expected direction of group shift (ie. their 
“orientation”) are summarized below: 


1. An electrical engineer is faced with an oppor- 
tunity of joining a new company with an uncertain 
future which, if it is successful, could offer greater 
chance for advancement than his present, more se- 
cure position offers (Kogan & Wallach; risky). 

2. A college senior is choosing between attending 
a high-prestige university, which may be too rig- 
orous for him to receive his doctorate, and a lower 
prestige university from which he is certain to obtain 
his degree (Kogan & Wallach; risky). 

3. A chess player must decide whether to take & 
maneuver that might bring victory if successful or 
a defeat if unsuccessful (Kogan & Wallach; risky). 

4. A research physicist is choosing between two 
research projects. One project is a series of easy 
short-term problems of minor importance, while the 
Other is a diffücult long-term problem of major 
importance (Kogan & Wallach; risky). 

5. The president of a light metals corporation 


must decide whether to build a plant in the United - 


States for a moderate return on his investment or in. 
a politically unstable country where a higher return 
is possible (Kogan & Wallach; cautious). 

6. A couple must choose between allowing à 
complicated pregnancy to continue, with danger to 
the mother's life, or having the pregnancy termi- 
nated (Stoner; cautious). 

7. A couple must decide whether or not to get 
married given that they have some sharp differences 
of opinion (Kogan & Wallach; cautious). 

8. A man about to embark on a vacation trip ex- 
periences severe abdominal pains and must choose 
between disrupting his vacation plans in order to see 
a doctor or boarding an airplane for an overseas 
flight (Stoner; cautious). 


The standard choice-dilemmas response format 
(Kogan & Wallach, 1964) was inserted after each 


item. It presents several different probabilities that - 


the risky alternative will succeed and asks the sub- 
ject to indicate the lowest odds he would accept and 
still advise taking the risky action. 


Subjects 


Eighty-seven undergraduate males enrolled in in- 
troductory psychology at the University of Minne- 
sota served as subjects and received points toward 
their final examination for participation. From among 
50 subjects who were pretested on the eight decision 
problems, 33 were assigned to one of the four ex- 
perimental conditions on the basis of their scores on 
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the two sections (risky items versus cautious items) 5 
© From among 37 subjects pretested at another time, 
30 were selected on the basis of their pretest scores 


to serve as "regression controls" for subjects in the 
experimental conditions. These selection procedures 
and their justifications are explained in a later sec- 
tion. 

Procedure 


Subjects were recruited for an experiment in group 
decision making. Initially all subjects indicated (a) 
their initia] pretest levels of risk taking on all eight 
items and (b) the levels of risk taking they expected 
an average member of a self-selected reference group 
(eg. 200 persons like you) to advise. 

l A week after the initial testing, subjects in the 

experimental groups participated in a group discus- 
- sion. Each session contained one naive subject and 

three confederates who played the role of naive sub- 

jects. The group discussion involved only one type 

item—cither “risk oriented" or “caution oriented.” 
] Upon arriving at the group decision-making session, 
subjects in the experimental conditions received a 
booklet containing either all four risk-oriented items 
or all four caution-oriented items, along with their 
initial responses to them. The experimenter had the 
subject reacquaint himself with these items and his 
previous responses to them. Then he gathered the 
subject and confederates around a discussion table 
and provided standard instructions (cf. Teger & 
Pruitt, 1967) for the conduct of group discussions. 
He instructed them to discuss each of the four items 
in turn and to try to arrive at a unanimous decision 
i on each within a time limit of 5 minutes per item, 
- emphasizing, however, that a unanimous decision 

Was not required. A discussion of the four items 

followed these instructions. The decisions reached by 

subjects during discussion provided a first post- 
measure of risk taking. 

Manipulation of group consensus. On three of the 
four items discussed, the accomplices systematically 
took stands that deviated from the naive subject’s 
pretest level of risk taking. On the item positioned 
third (Item 3 or Item 7 in the list above), the 
accomplices agreed with the subject's own predilec- 

_ tion. The direction of deviation (more risky or more 
conservative) on the three critical items constituted 
the manipulation of group consensus. More specifi- 
cally, two of the three accomplices deviated two 
scale positions from the subject's pretest score. For 
*xample, if the subject advocated attempting the 
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i 
—. "Two subjects were later eliminated from the data 
analyses because of suspicion concerning the nature 
1 of the experiment. 
®The “regression control" subjects were run ap- 
Proximately 14 years after the experimental subjects. 
However, the close correspondence of their mean 
Tesponse on the two types of items argues strongly 
“that the two groups of subjects were comparable. 
e mean scores of experimental subjects were 15.45 
"hd 9.02 for risk- and caution-oriented items, re- 
SPectively; those for the control subjects were 15.59 


nd 9.21, respectively. 
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risky alternative of a given item with odds for suc- 
cess of 5 out of 10, two of the accomplices espoused 
odds of either 1 out of 10 (in the more risky con- 
dition) or 9 out of 10 (in the more conservative 
condition). The third confederate played a “slider” 
role; initially, his preferred odds differed from the 
subject’s by only one scale position, but he ulti- 
mately moved to the position advocated by his 
fellow accomplices after discussing the item. This 
slider role was alternated among the three accomp- 
lices in order to mitigate suspicion that might arise 
from a uniform series of actions on the part of the 
accomplices. Finally, to control the content of dis- 
cussion to some extent, in advocating particular 
levels of risk taking, the confederates presented a 
set of standard prepared arguments. One danger of 
the group consensus manipulation, however, was that 
it might undermine the naive subject's initial esti- 
mate of what was most valued in a given situation. 
For example, if one’s associates persist in being 
cautious on a risk-oriented item, naive subjects might 
question whether risk was indeed valued in the 
particular situation depicted by that item, thereby 
weakening their tendency to deviate in what is typi- 
cally a culturally valued direction. To minimize this 
danger, the standard arguments used by confederates 
incorporated statements that endorsed the value pre- 
sumably dominant on a given item. For example, 
for a subject discussing risky items and facing a 
cautious consensus, if his premeasure score on Item 
2 was 3 out of 10, a confederate might say, “I agree 
that he really ought to try for the good school. 
Thats the thing to do. But I'd only suggest it if 
the odds are 7 out of 10 for success.” 

Postdiscussion phase. After discussion of all items, 
subjects indicated their final, private levels of risk, 
thereby providing a second postmeasure of risk tak- 
ing. Last, a postexperimental questionnaire as- 
sessed influence and cohesiveness, evaluation of the 
group and group discussion, and suspicion, 

Subject selection. As indicated, the "group con- 
sensus" manipulation required that the confederates 
deviate from the naive subjects by two scale posi- 
tions on three of the four items discussed. Since the 
response format only allowed response alternatives of 
1, 3, 5, 7, or 9, this rule meant that subjects se- 
lected to face a cautious group consensus necessarily 
had to have chosen premeasure risk levels of 1, 3, 
or 5 on the three critical items within one of the two 
sets (risk-oriented or caution-oriented items). Like- 
wise, subjects selected to face a risky consensus had 
to have chosen premeasure risk levels of 5, 7, or 9 
on the three critical items within one of the two 
sets. Within these restrictions, the rule for assigning 
subjects to the four experimental conditions was 
simply that of maximizing the number of pretested 
subjects that could be used. Thus, the decision on 
whether to assign a subject to a discussion of risk- 
or caution-oriented problems also simply reflected a 
rule of discarding as few pretested subjects as possi- 
ble. 

Obviously, some such subject selection was neces- 
sary to permit the construction of a group consensus 
among the confederates that differed substantially 
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from the subjects’ own premeasure tendency. How- 
ever, as a consequence, the subjects within each 
experimental group constituted an extreme, atypical 
selection. Those exposed to a cautious group con- 
sensus—whether selected to discuss risk- or caution- 
oriented items—would tend to be riskier than the 
average subject on either set, but particularly so 
for those who could be selected to discuss caution- 
oriented items. Contrariwise, those exposed to a 
risky group consensus—whether selected to discuss 
risk- or caution-oriented items—would tend to be 
more cautious than the average subject, but particu- 
larly so for those who could be selected to discuss 
risk-oriented items. In other words, the type of 
consensus to which subjects were exposed (risky or 
cautious) was confounded with subject selection 
such that those exposed to a risky consensus tended 
to be more cautious than average, whereas those 
exposed to a cautious consensus tended to be riskier 
than average. 

Control for subject selection. Since the experi- 
mental variables predict shifts toward group con- 
sensus for the subjects in all conditions, the subject 
selection described above presents a rival explana- 
tion for the predicted experimental effects. In any 
situation where subjects are selected so as to be 
atypically more extreme than the group from which 
they were selected, regression toward the mean offers 
a rival explanation for less extreme scores on a sec- 
ond measure—precisely the same direction of effect 
predicted from the experimental treatments. There- 
fore, regression control subjects were included to 
provide a base line of shift against which the shifts 
produced by the treatments could properly be evalu- 
ated. Toward this end, the 30 regression control 
subjects were selected from a group of 37 additional 
subjects who were pretested on all eight problems, 
Their administration paralleled exactly that of the 
experimental subjects. 

The pretest scores of the control subjects were 
then matched against the pretest scores of the ex- 
perimental subjects. On this basis 30 control subjects 
were assigned to one of four control groups, so that 
for each experimental group there was a control 
group whose pretest scores corresponded in direction 
and extremity to those of the experimental group. 
When perfect matching was not possible, to be con- 
servative the more extreme of the available control 
subjects were used. As a result, the pretest means of 
the control groups were often more extreme than 
those of the corresponding experimental groups (see 
Table 1). 

One week later, control subjects were given one of 
their original pretest booklets? This booklet con- 
tained either all risk-oriented or caution-oriented 
items, depending upon which control group the sub- 
ject was assigned to. Control subjects were then told: 


We had you complete these items in order to 
allow you to familiarize yourself with the problems 
and to give you some idea where you might stand 
on each. Up until now you have had no guide- 
lines by which to make your decisions. Possibly 


T This was the procedure that had been followed 
with experimental subjects. 
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by reconsidering each problem after having seen if 
before, you will have a better idea of the ki 
of decision you would like to make. Note that 
there is no need to feel bound by what you 
marked before, Whether or not you change OF 
how much you change is not important. What is 
important is that you reconsider each answer 
carefully. à 


These instructions were designed to minimize the 
need to maintain consistency and indeed could b 
viewed as a mild demand for change. In other word 
this constitutes a conservative procedure in that 
the base line of change against which experimental 
treatment effects were assessed was, if anything, 
greater than that due solely to regression. 


RESULTS AND DISCUSSION 
Check on the Value Orientation Manipulation 


Supporting the contention that the r 
and caution-oriented items elicited differen 
values, pretest scores on the former were sig 
nificantly riskier than on the latter (¢ = 9.78 
df = 49, p < .001).* Moreover, subjects esti- 
mated that their pretest scores on the four 
risk-oriented items would be riskier than the 
most frequent decision of 200 people like 
themselves? (¢= 3.31, dj—49, p< .005, 
one-tailed), and there was a nonsignificant — 
tendency for the opposite effect to occur on 
the four caution-oriented items (4 = .85, df 
= 1/49, p < .20, one-tailed). As mentioned 
above, such tendencies suport the assumption 
that risk- and caution-oriented items engage 
widely held values favoring risk and caution, 
respectively. 


Risk Taking 


Individual’s risk scores on the risk-dilemma 
items were first reversed so that a high score 
indicated greater risk. (The lowest odds out 
of 10 that an individual advised for a given” 
situation were subtracted from 10.) Each sub- 
ject’s reversed scores were then summed 
across the three items on which subjects devi- } 
ated from the group. This yielded a risk- f 
taking index for the pretest, consensus, and j 
postconsensus phases, respectively. E 

The repeated-measures format of the design 
provides two measures of shift toward risk or 
caution: pretest-consensus shift and con- 


8 All p values reported are two-tailed unless speci- 
fied otherwise. ^ 
? Data reported in this section are based on the 
responses of the 50 subjects initially screened fo 
assignment to the experimental cells (see Footnote 5i 
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Risk-oriented items 


Others Others more 
Riskier Cautious 
Fic. 1, Theoretical predictions according to Brown's (1965) cultural value theory. 


sensus-postconsensus shift. The former re- 
flects changes due to both overt compliance 
to group pressure and covert or internalized 
changes in judgment. The postconsensus mea- 
Sure, completed in private, should be less 
Susceptible to the effects of group pressure. 

Figure 1 depicts the effects predicted from 
a combination of conformity pressures and 
Brown’s cultural value theory. Though group 
consensus should induce subjects to shift 
toward the consensus position, their willing- 
hess to shift should be greater when the con- 
Sensus position is consonant with the value 
ordinarily elicited by the item. In analysis of 
Variance terms, these expectations should yield 
two main effects. 

In our prior criticism of Wallach and Mabli 
(1970) we stated that the leadership expla- 
nation of risky shifts points to person- 
ality differences between risky and cautious 
subjects. Specifically, risky subjects are hy- 
pothesized to be less susceptible to influence 
than cautious subjects. As previously noted, 
the procedure confounds subject selection 
with the manipulation of item orientation 
Such that those who received caution-oriented 
items were riskier than those who received 
tisk-oriented items. This subject manipula- 
tion therefore permits a test of the leadership 
hypothesis as well. Of course, the truth or 
falsity of this hypothesis could be quite inde- 
Pendent of the other predictions. However, if 
We add it to the predictions above, it would 
alter Figure 1 such that the line labeled “risk- 
oriented items" would have a steeper slope 
than the line labeled “caution-oriented items,” 
m that as a consequence of the selection pro- 
Cedures the subjects in the former conditions 


were more cautious than those in the latter, 
In other words, in terms of the present design, 
the leadership explanation in conjunction 
with the other predictions, predicts an inter- 
action as well as the two main effects. 

Table 1 presents the mean pretest scores 
as well as shifts in risk taking for each ex- 
perimental condition. In the two-way analy- 
sis of variance of the pretest-consensus differ- 
ences (ie., shift scores), the only test ap- 
proaching significance was the highly signifi- 
cant group consensus main effect (F = 
136.89, df=1/27, MS error = 9.37, p< 
001). As can be seen from Table 1, a risky 
consensus invariably induced subjects to in- 
crease their risk taking after discussion, re- 
gardless of whether the discussion involved 
risk- or caution-oriented items. Conversely, a 
cautious consensus led subjects to reduce their 
risk taking, regardless of whether the discus- 
sion involved risk- or caution-oriented items. 
All of these shifts were significantly different 
from zero (smallest 7 — 4.96). Most impor- 
tant, the magnitude of these changes did not 
differ as a function of value orientation. This 
is reflected in the absence of a value orienta- 
tign main effect (F = 13, df = 1/27, p> 
.50). That is, the amount of change toward 
the group consensus was unaffected by 
whether or not the position advocated by the 
group was consonant with the value orienta- 
tion elicited by the decision problem. Last, 
there is no support for any personality differ- 
ences between risky and cautious subjects 
(interaction F = .004, df = 1/27, p> .90). 

In the two-way analysis of variance of the 
consensus-postconsensus differences, no effects 
were significant (lowest p > .60). Moreover, 
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TABLE 1 


MEAN PRETEST SCORES AND PRETEST-CONSENSUS SHIFTS IN Risk TAKING FOR 
EXPERIMENTAL AND REGRESSION-CONTROL CONDITIONS 
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Pretest- ( 


j 
Condition | n | Pretest* consensus 
| | shift 
Risk-oriented items | 
Experimen: roup: Consensus 
pene coed : | 7 11.14 7.14 71 
Regression control for risky consensus | 7 | 8.14 1.29 
Experimental group: Consensus | 
Eee n | 9 14.33 —6.11 22 
Regression control for cautious consensus | 9 17.44 .89 
Caution-oriented items 
Experimental group: Consensus 
more risky 5 | 3.60 | 6.80 —.60 
Regression control for risky consensus 4 | 3.75 | —1.75 
Experimental group: Consensus | 
more cautious | 10 14.40 — 6.66 —.80 
Regression control for cautious consensus 10 | 15.40 1.52 


* High scores indicate greater risk taking. Possible range: 0-27, 
» High scores indicate an increase in risk taking from pretest to first 
* Positive scores indicate a decrease in risk taking from consensus (first posttest) to postconsensus (second post 


scores indicate an increase to postconsensus. 


the test for the grand mean was also nonsig- 

nificant (p> .30). In short, subjects main- 
- tained their consensus decisions even after the 
~ group was dissolved, and this tendency was 
. . constant in all conditions. 

As previously indicated, the implementa- 
tion of the design required the selection of 
subjects with extreme pretest scores. Inspec- 
tion of Table 1 shows substantial pretest 
differences in the expected direction. Analysis 
of the simple effects at each level of the value 
orientation manipulation shows that among 
the subjects selected to discuss risk-oriented 
items, the pretest mean of those selected to 
face a risky consensus was marginally more 
cautious than the mean of those selected to 
face a cautious consensus (F = 3.83, df = 
1/27, p < .10). Among those selected to dis- 
cuss caution-oriented items, the difference be- 
tween those facing a risky versus cautious 
consensus is parallel, but more substantial (F 
= 37.17, df = 1/27, p< 001). Further, as 
expected, the subjects facing the two different 
directions of consensus did differ from the 
pretest means of 9.02 and 15.45 for all 50 pre- 
| measured subjects on the three cautious and 

risky items, respectively. Comparison of the 
pretest means of the four experimental groups 
to these values shows that regression could be 
a rival explanation for the obtained shifts in 
all but the risky orientation-cautious con- 
sensus cell. Therefore, the use of the regres- 
sion-control groups seemed warranted. 
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sttest, and negative scores indicate a decres 


Table 1 also presents the mean pretest and 
change scores for the regression-control 
groups. The amount of change between the 
pretest and the retest in each regression-con- 
trol group was compared with the pretest- 
consensus change in the corresponding ex- 
perimental groups by using a nested analysis 
of variance (i.e., experimental group versus. 
control group within each condition). For all 
four comparisons, the experimental groups 
exhibited significantly more change than the 
control groups (lowest F = 9.46, df = 1/53, 
p < .01). Clearly, regression accounts for very 
little, if any, of the change observed in the 
experimental groups. 

In sum, these data do not support the con- 
tention that individuals seek to deviate mod- 
erately from others in a culturally valued 
direction. If this contention were true, one. 
would expect those deviating in such a direc- 
tion to resist conformity pressure more than 
those deviating in a nonvalued direction. In 
the present instance, at least, such differential 
conformity clearly was absent. Moreover, | 
other data suggest that the present findings | 
may have a good deal of generality. A num- 
ber of studies investigating the performance: 
of gifted children indicate that after age 9 oF 
10 such children fail to perform up to their 
level of ability (DeHaan, 1961; Torrance, 
1962; Wilt, 1959; Zorbaugh, Boardman, & 
Sheldon, 1951). In addition, Lewis (1943) 
reported that not only do such children per- 
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] form below their potential, but they per- 
* form no better than relatively less gifted chil- 
dren. Lewis examined 930 children with IQs 
between 125 and 144 and 50 children with 
IQs of 145 or higher. Not only were there no 
differences between these groups on standard 
achievement tests, but only one-third of the 
145 and above IQ group were described by 
their teachers as being bright. Even when peo- 
ple have the ability to deviate in a culturally 
valued direction, they tend to conform to a 
less valued group norm. These data are quite 
consistent with the present findings. 

But why does this conformity occur? One 
explanation is the null hypothesis; that is, 
risk-taking behavior is not influenced by cul- 
tural values. The fact that the present data 
do not permit a rejection of the null hypothe- 

Sis is certainly disheartening for advocates 
| of the cultural value explanation, for such re- 
. jection would represent particularly strong 
| evidence in favor of the theory. On the other 
. hand, accepting the null hypothesis may be 
_ premature, Specifically, one can think of at 
. least two additional explanations for the ob- 
| tained data. 

' First, it is possible that in spite of our ef- 
forts to include precautions against this 
within our procedure, group consensus rede- 
fines what is valued. If a person who deviates 
in a valued direction observes that others 
make no attempt to *catch up" with him, he 
may reevaluate what is indeed valued in the 
given situation. Pruitt (1969) suggested that 
people in a decision-making situation first es- 
tablish what type of behavior is valued, and 
only then do they determine how much of 
that behavior they must emit to adhere to the 
value. Group discussion may affect both of 
these decisions. One would of course expect 
group norms to have a greater influence on 
defining what is valued in novel situations in 
which valued behavior is not clearly specified 
by the culture. If the group consensus did con- 
tribute to defining what was valued, the data 
Could be reinterpreted in terms of the cultural 
value explanation. Regardless of our manipu- 
lations, all subjects would have felt (after dis- 
cussion) that their premeasure decision devi- 
ated in a nonvalued direction. Consequently, 
the equal conformity in all cells may simply 
Teflect subjects’ attempts to reduce this non- 
Valued deviation. As already mentioned, how- 
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ever, the fact that the standard arguments 
used by the confederates were carefully con- 
structed to endorse the value presumably dom- 
inant on a given item, and thereby avoid 
undermining the salient value, seriously weak- 
ens this interpretation. It seems tenuous in- 
deed to assume that subjects redefined what 
was valued despite comments by the confed- 
erates directly supporting that value.'^ 

A second explanation for the observed con- 
formity effects is that conformity pressures 
were so excessive that they precluded the 
possibility of displaying any desire to endorse 
the value typically elicited by the decision 
problems. In terms of this explanation, appli- 
cation of conformity pressure places subjects 
in a conflict. On one hand, they may want to 
endorse the elicited value (and perhaps may 
wish to exceed their fellows in doing so), but 
on the other hand, they are reluctant to ignore 
others’ entreaties for conformity (Levinger & 
Schneider, 1969). Clearly, with excessive con- 
formity pressures, the resolution is likely to 
result in conformity rather than deviancy. 
Thus, the present results may stem from con- 
formity pressures so great that even when 
subjects were in the two conditions generating 
conflict, they acquiesced to group demands. 
Reconsideration of the procedure does sug- 
gest that conformity pressures were intense. 
First, the instructions suggest consensus deci- 
sions. Second, the confederate in the "slider" 
role moved away from the subject toward the 
group consensus during discussion. Thus, the 
subject was faced with being the only person 
preventing a consensus. However, direct in- 
spection of the data itself shows that con- 
formity pressures were not so excessive as to 
produce maximal or ceiling conformity. The 
confederates took a position that deviated 
two scale units from the subjects’ initial po- 
sitions. If subjects conformed completely to 
the position advocated by the confederates on 
all three critical items, their shift scores 
would be either +12 or —12 (depending on 
the direction of the consensus they faced). 
Inspection of Table 1 shows that in all four 
cells, the mean shift was approximately half 
of the amount of shift advocated by the con- 


10 Supporting this assertion, Pilkonis and Zanna 
(personal communications) reported that the level of 
risk taking that subjects most admire is unaffected 
by a discrepant group consensus. 
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TABLE 2 
Mean RATINGS or THE Group 


| How 


How How 
Condition intelli- demo- satis- 
| gent cratic factory 
Risk-oriented items 

Risky consensus 24.28 23.57 23.57 
Cautious consensus 21.50 19.62 22.62 

Caution-oriented items 
Risky consensus 20.20 18.80 21.40 
Cautious consensus 22.80 22.10 24.50 


Note.—On all ratings, high scores indicate more favorable 
ratings. The possible range on all items was 1—30. 


federates. Simply taking these data at face 
value argues that the conformity pressures 
were not so powerful that they completely 
overrode subjects' tendency to be responsive 
to the value implicit in the item contents. In 
other words, it cannot be claimed that faulty 
experimental implementation (inadvertently 
creating overwhelming conformity pressures) 
precluded a fair test of a derivation from 
Brown’s cultural value explanation of the 
risky shift. 


Ratings of the Group™ 


Following the postconsensus measure of 
risk taking, subjects used a 30-point scale 
with described end points to rate their group 
on the dimensions of happy, approving, in- 
telligent, rewarding, democratic, strong, good, 
and satisfactory. While analyses of variance 
on five of these eight items showed nonsig- 
nificant outcomes, marginally significant 
crossover interactions were obtained on the 
items intelligent (F = 3.57, df= 1/26, p 
< 07), democratic (F = 3.49, df= 1/26, p 
< .08, and satisfactory (F= 3.96, df = 
1/26, p < .06). 

The cell means on these three items, pre- 
sented in Table 2, show a similar pattern of 
outcomes on all three items, When subjects 
discussed risk-oriented items, they rated the 
group confronting them with a risky con- 
sensus as more intelligent, democratic, and 
satisfactory than a group that confronted 
them with conservative consensus; the oppo- 
site occurred when subjects discussed caution- 
oriented items. While the risk-taking data 


11No significant effects wer, 
ratings of the grou 
ceived influence, 


i e observed on the 
p discussion, the measures of per- 
or the cohesiveness measures, 
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supply no verification for the predictions from 
Brown’s theory, these postsession ratings do 
give indirect support to his formulation. The 
ratings conform precisely to what one would 
expect if the “conformity conflict” explana- 
tion (Levinger & Schneider, 1969) were valid, 
in that the most negative ratings are received 
by precisely those groups that deviate from 
the subject in a nonvalued direction. More 
specifically, if a subject felt that despite his 
behavior during the group discussion, risky 
behavior was indeed valued on risk-oriented 
items and cautious behavior was valued on 
caution-oriented items, it is sensible that he 
should judge groups that deviate from him in 
a culturally nonvalued direction (the cautious 
group and risky groups, respectively) as less 
intelligent than those deviating in a valued 
direction. From the subjects perspective, 
groups would deviate from him in a non- 
valued direction only if they were not intelli- 
gent enough to ascertain what behavior was 
valued. Similarly, after succumbing to pres- 
sure by a group to conform to a disvalued 
norm, one might well describe that group as 
less democratic than a group that pressures 
him to conform to a valued norm. Thus, the 
group ratings indirectly support the conten- 
tion that on risk-oriented items, when sub- 
jects conformed to a more conservative group 
norm they did so despite the fact that they 
judged riskier decisions more appropriate; 
conversely, when they conformed to a riskier 
norm while discussing caution-oriented items, 
they did so despite the fact that they judged 
a more cautious decision more appropriate. 
Moreover, these data are inconsistent with 
the “redefined value” explanation suggested 
above. 

The only finding inconsistent with the 
“conformity conflict” interpretation of these 
data is the absence of any recidivism on the 
postconsensus measure of risk taking. If the 
conformity in the risky orientation-cautious 
consensus cell and the cautious orientation- 
risky consensus cell primarily represented 
compliance to group pressure, one would ex- 
pect subjects to revert to their initial predis- 
cussion decisions once they were in private. 
Analysis of the consensus-postconsensus shift 
scores, however, showed that subjects in all 
cells maintained the risk level they adopted 
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during discussion. This lack of recidivism, 
however, may be due to subjects’ desire to 
maintain a favorable impression with the 
experimenter (or themselves). Indeed, it is 
not unreasonable to suppose that if one does 
comply to conformity pressure, one would 
want to mask the fact of one's capitulation. 
To turn around 10 minutes later and give 
decisions on the postconsensus measure that 
were quite similar to one's premeasure posi- 
tion makes it apparent to the world (and one- 
self) that one simply “went along with the 
group," *gave in," or "lied." Had the post- 
consensus measure been administered by a 
separate experimenter in a different setting, 
subjects might have displayed substantially 
more backsliding. In other words, we interpret 
the lack of recidivism as an attempt by sub- 
jects to maintain a favorable self-impression 
in the face of an extremely direct assessment 
of their compliance to group pressure. 


CONCLUSION 


The supplemental data from the postexperi- 
mental questionnaire indirectly show that a 
desire to deviate in a culturally valued direc- 
tion was present during discussion. This pro- 
vides at least indirect support for the cultural 
value explanation of the risky shift. On the 
other hand, it is noteworthy that the risk-tak- 
ing data themselves indicate that when con- 
lormity pressures are substantial, such pres- 
sures will be a more powerful determinant of 
tisk-taking (and other types of) behavior 
than the desire to deviate in a culturally 
valued direction. In one sense, however, the 
fact that the reluctance to shift in a nonval- 
ued direction was not reflected directly in 
tisk-taking behavior does not contradict the 
cultural value explanation of the risky shift, 
in that the theory assumes that in typical 
group discussions of risk-dilemma situations, 
Conformity pressures are exerted in the direc- 
tion of the value elicited by the risk-dilemma 
Item. It is clear, however, that there are many 
Other situations where groups do apply pres- 
Sure to discourage individuals from deviating 
in a culturally valued direction (see discus- 
Sion of Lewis, 1943, above). The data from 
lhis experiment indicate that in those situa- 
tions—instances where conforming to a cul- 
tural value and conforming to a group con- 
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sensus are mutually exclusive responses, the 
group will be more powerful. Moreover, when 
such conformity occurs, the behavioral re- 
sponses may be maintained even after the 
group pressure no longer exists. 
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INFORMATION INPUT AND THE GROUP SHIFT 
PHENOMENON IN RISK TAKING* 


COLIN P. SILVERTHORNE °? 


University of San Francisco 


This research was designed to investigate the group decision-making process 
in risk taking rather than merely consider the overall outcome of the group 
discussion. Both risky- and cautious-shift items were used in a two-stage experi- 
ment. Subjects were either assigned to the standard group risk-taking paradigm 
or one of three experimental conditions designed to affect the information input 
of the group discussions either toward greater risk, greater caution, or a 
balance of risk and caution. A content analysis validated that the information 
input was successfully manipulated, and the size and direction of the group 
shifts under different conditions indicated the effects of information on the 
shift. The results are discussed in the light of theories of the group shift, and 
the results are interpreted as support for a relevant information theory. 


'The major assumption of this research was 
that the group shift in risk taking is influ- 
enced by the information content of the group 
discussion. More specifically, it was assumed 
that shifts toward risk result from a prepon- 
derance of risky arguments in the discussion, 
and shifts toward caution, from a prepon- 
derance of cautious’ arguments. This assump- 
tion has been put forward by Brown (1965) 
and St. Jean (1970). 

The following hypotheses, derived from this 
assumption, were tested: (a) A larger number 
of risky than cautious arguments will be 
found in discussions of items that shift toward 
risk and a larger number of cautious than 
risky arguments in items that shift toward 
caution. (5) Discussions that contain an ex- 
perimentally induced equal proportion of 
risky and cautious arguments will exhibit no 
shifts. (c) Discussions that contain an ex- 
perimentally induced preponderance of risky 
arguments will produce a shift toward risk. 
(d) Discussions that contain an experiment- 
ally induced preponderance of cautious argu- 
ments will produce a shift toward caution. 


1This research was completed as part of the au- 
thor's doctoral dissertation while at the University 
of Cincinnati. The author would like to thank L. 
Oseas, D. Lundgren, and C. Shepherd for their help 
in this research. 

* Requests for reprints should be sent to the au- 
thor, Department of Psychology, University of 
San Francisco, San Francisco, California 94117, 


METHOD 
Subjects and Procedure 


One hundred and sixty students from introductory 
psychology classes at the University of Cincinnati 
were used. Subjects were randomly assigned to one 
of four experimental conditions: (a) tbe standard 
condition, (b) the balance condition, (c) the risk 
condition, and (d) the cautious condition. Subse- 
quently, subjects were formed into eight groups of 
five subjects in each of the four conditions. Booklets 
with instructions were given to each subject. Each 
subject responded to eight items. Four had been de- 
signed to produce a cautious shift, and four had 
been designed to produce a risky shift. All eight 
items had been previously shown (Stoner, 1968) to 
produce the expected shifts after group discussion. 

The experiment was administered in two sessions. 
Four items, two risky and two cautious, were com- 
pleted the first week, and the remaining four items 
were completed 1 week later. During one session, 
the standard procedure was utilized. During the 
other session, the procedure appropriate for the 
condition to which the subjects were assigned was 
utilized. Half of the groups received the experimental 
condition first, and the remaining half received the 
standard procedure first. In this way each group was 
used as its own control for any group-specific fac- 
tors. To randomize presentation of the items and 
to provide a systematic control for order effect, 
items were randomly assigned to cells in one of 
four randomly selected 4X4 (Risk X Caution) 
Greco-Latin squares. 

In all conditions, subjects were asked to report 
their individual decisions on the risk scale in the 
booklet containing four items, The completed book- 
lets were then collected. Subjects were then assigne 
to groups of five and each subject given an identical 
booklet, Groups in the standard procedure then dis- 
cussed each item in the booklet until they reached a 
unanimous decision on each item. 
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ith the three experimental procedures, groups were 
given an additional task prior to the group discus- 
sion. Subjects were told that they had completed 
the booklet individually in order that they might 
familiarize themselves with all the situations. This 
would provide them with an opportunity to form an 
opinion on each one. It was suggested that in reach- 
ing this opinion, they had taken into consideration 
several factors which would result in the recom- 
mendation of a specific course of action which had 
either a high or a low probability of succeeding. 
Subjects were told to consider carefully each item 
and list reasons for a particular course of action. In 
the balance procedure, subjects were asked to list 
two reasons favoring the risky alternative and two 
© reasons favoring the cautious alternative. In the risk 
- procedure, subjects were told to list four reasons in 
favor of the risky alternative. In the caution pro- 
cedure, subjects were told to list four reasons favor- 
ing the cautious alternative. This procedure was 
repeated for all the items, regardless of the nature 
| of the item. 
Groups were then told to discuss each item and 


come to a unanimous decision on each item. Further, 
the groups were told that it was the responsibility 
of each group member to include the reasons they 
had listed in the group discussion for the considera- 
tion of the whole group. 

Aiter each group had completed discussion of the 
first item, subjects were reminded by the experi- 
menter to include their listed reasons. The groups 
had no designated leader or chairman. The experi- 

. menter, though present during the experiment, did 
not take part in the discussion and answered any 
procedural questions that arose. 

The group discussion of all four items was re- 
corded, using a tape recorder. These tape recordings 
were subsequently subjected to a content analysis 
by three independent judges. These judges received 
a brief training session using tapes obtained from 
the pilot data. Training was completed when the 
judges showed a high level of agreement in their 
judgments. Correlation coefficients were obtained on 
the data from the standard procedure for pairs of 
judges (y= .93 and .94). Because of the high level 
of agreement between the judges, a single judge 
Was used to score the remainder of the data. The 
judges did not know the purpose of the experiment 
or the condition to which a group was assigned. 

The content analysis involved categorizing each 
Matement made in the group discussion either as 
Suggesting a risky approach, as suggesting a cautious 

- Approach, or as neutral. A statement referring to an 
individual- or group-recommended probability of 
Success was not considered for the purpose of the 
Content analysis. This content analysis was com- 
Pleted to validate the hypothesis concerning the 
Rroup discussion in the standard procedure and to 
Provide evidence of the effectiveness of the experi- 
menta] manipulations in the other three procedures. 
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RrsuLTS 
Standard Condition 


The basic assumption to be tested stated 
that in the standard condition, the recom- 
mended probability of success of the group 
decision would be different from the averaged 
individual decision. These differences were 
expected to be in line with previous research 
(cf. Stoner, 1968). The purposes of this as- 
sumption were to provide validation of the 
items used and confirmation of the existence 
of the group shift for the population used in 
this study. Of primary interest, therefore, are 
the results obtained from groups in the stan- 
dard condition. Groups in this condition un- 
derwent the standard risk-taking experimental 
paradigm. 

Table 1 shows the amount and direction of 
the group shift for each item. Positive num- 
bers reflect a shift toward risk, and negative 
numbers, a shift toward caution. 

A correlated ¢ test was completed for each 
item comparing the mean individual score and 
the group score. As can be seen from the 
table, all items showed a significant shift in 
the typical direction. The basic assumption 
concerning the existence and direction of the 
group shift was therefore confirmed. 


Group Discussion Content in the 
Experimental. Conditions 


The various experimental procedures were 
designed to affect the content of the group 
discussion. The question of prime interest at 
this point concerns the effectiveness of these 
procedures. In the risk, caution, and balance 
procedures, subjects were instructed to intro- 
duce into the discussion the reasons they had 
listed prior to the formation of the groups. A 
two-way analysis of variance was used to 
analyze the data from the content analysis. 
Table 2 shows the mean number and type of 
statements made in the group discussions for 
the balance, risk, and caution conditions. 

On the caution-oriented items, the balance 
procedure was successful in equating the 
type of statement made. Inspection of the 
data shows no difference in the number of 
risky and cautious statements. However, the 
balance procedure was not as successful on 
the risk-oriented items. In this case, the num- 
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TABLE 1 
Sumatary OF Mean Group Sutrts 
Balance condition Risk condition | Caution condition 
Standard |- = A 4]. 
Pos Logos: Balance Standard | — Risk Standard Caution Standard 
procedure | procedure | procedure | procedure | procedure | procedure 
E OTe sIm E l i 
Caution-oriented ^ " ES P 
A —123**| —.05 --1.25** 200.01] -—1.05* — .B0 - 70 7.10 
B | —1.58** 20 -1.25** 40 | —1.05* — 2.05* —175% 7.12 
F | —.82* ES —1.10 .80 -1.75** =, .90* —145** 6.22 
G | —1.00* —1415 —1.50 .80 —1.30 —1.15* | —1.30* 7.47 
Risk-oriented A 
C Cus | 1.5379 .85 .35 1.05* 55 | -—65 -| ,85* 447 
D | em 35 1.00** 1:55* 1.35 —.65 | .90** 4.17 
E 1.18* 10 ud 1.35* 95 —.80** | 125** 3.88 
H 1:25* 10 1.30* 49* .55 —.55 | 1.00** 4.31 


* p <.05, two-tailed test. 
** > € 01, two-tailed test. 


ber of risky statements was statistically 
greater than the number of cautious state- 
ments. 

In the risk procedure, subjects were in- 
structed to introduce reasons favoring a more 
risky approach into the group discussion. It 
was predicted that the number of statements 
favoring risk would be greater than those 
favoring caution on all items. Inspection of 
the results shows that these expectations were 
confirmed. There were significantly more risky 
than cautious statements on both the risk- 
oriented and the caution-oriented items. 

In the caution procedure the introduction 
of reasons in favor of the cautious alternative 
into the group discussion was expected to re- 
sult in more statements favoring caution than 
risk. The results indicate that the data for 
the caution procedure confirmed these expec- 
tations. There were significantly more cau- 


TABLE 2 


Summary OF MEAN NUMBER OF STATEMENTS 
Mape IN THE BaLANCE, Risk, AND 
Caution CONDITIONS 


Type of statement 


Procedure | Type of |= ll (SES 
Risk Neutral Caution 
Balance Caution | 10.25 4.00 12.25 
Risk 14.25 4.75 8.50 
Caution Caution 3.25 4.00 18.00 
1 Risk 10.25 4.25 15.00 
Risk Caution 14.25 4.25 8.00 
Risk 16.50 4.25 1,50 


tious than risky statements on both the risk- 
and the caution-oriented items. A 
The nature of the statements made during. 
the group discussion in the standard condition 
is also of interest. The first hypothesis stated 
that the group discussion in the standard con- 
dition would contain a larger number of risky 
than cautious statements on the risk-oriented - 
items and a larger number of cautious than - 
risky statements on the caution-oriented 
items. The four caution-oriented items were 
considered independently of the four risk- 
oriented items. In order that the number of 
risk and caution statements could be more 
clearly compared, neutral statements were not 
used in the analysis. i 
Inspection of Table 3 shows that a similar 
total number of statements was made on each 
kind of item. Further, groups made more 
statements concerning caution when discuss- | 
ing items which ultimately elicited a cautious 
shift. In discussing items which elicited 4 | 
risky shift, groups made more overt state- 
ments concerning risk. In the three exper- 
mental conditions, groups underwent both ang 
experimental procedure and the standard pro- 
cedure. The results for these groups with the 2 
standard procedure were consistent with those 
obtained with the standard condition. Inspec - 
tion of the data shows that the order of 
presentation had no effect on group behavior. 
In conclusion, the content analysis showed 
that in the standard condition the more fre- 
quent type of statement, either risky or cau- 
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tious, corresponded to the subsequent group 
shift. Further, the analysis verified the effec- 
tiveness of the experimental procedures in 
affecting the content of the group discussion. 
These results are important, since they sug- 
gest that the informational component is an 
independent variable in the group shift phe- 
nomenon. 


Initial Decisions 


It was hypothesized that the direction of 
the group shift with the standard procedure 
would correspond to the average initial ten- 
dency of the group members on their indi- 
vidual judgments. The numbers in the final 
column of Table 1 indicate the average prob- 
ability of success recommended by an indi- 
vidual. The most risky probability of success 
possible was O in 10 (ie. certain failure). 
The most cautious probability of success pos- 
sible was 10 in 10 (i.e. certain success). For 
the purpose of considering the initial deci- 
sions, the range of possible recommendations 
was considered as continuous, with the mid- 
point of 5 in 10. 

As can be seen from these data, the average 
recommended probability of success on the 
risk-oriented items tends to be less than 5 in 
10, while on the caution-oriented items it 
tends to be greater than 5 in 10. The sub- 
jects’ average initial position reflected the 
typical direction of the group shift on an 
item. 


Group Shift in the Balance, Caution, and 
Risk Conditions 


Having established the existence of the 
group shift, it was necessary to confirm that 
the procedures used within the three experi- 
mental conditions were successful in affecting 
the direction and degree of shift. The second 
hypothesis stated that the balance procedure 
would eliminate the typical group shifts. 

Inspection of Table 1 shows that the size 
of the group shift with the balance procedure 
tended to be considerably smaller than that 
found with the standard risk-taking pro- 
cedure. The balance procedure was particu- 
larly successful on items where a cautious 
Shift was expected. On two of these items, B 
and F, the shift was in fact reversed toward 
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risk. Item G showed a different trend, with 
the mean shift remaining high and toward 
caution. This item concerning abortion con- 
sistently showed less flexibility in group dis- 
cussion. As can be seen from Table 1, the 
group shift was not statistically significant 
for any of the items. A comparison with the 
data obtained from the same groups with the 
standard procedure shows that the group shift 
was consistent with other findings obtained 
with the standard procedure. These data, 
therefore, support the hypothesis that the 
balance procedure would elicit no significant 
group shift. The reduction of the shift by 
equating the type of information input shows 
the dependence of the group-shift phenome- 
non on the content of the group discussions. 

The third hypothesis about the nature of 
the group shift stated that the risk procedure 
would lead to more risk-taking behavior on 
all items after group discussion. As can be 
seen from Table 1, on items where a shift to 
risk was expected, the size of the group shift 
in the risk procedure was enhanced. More 
importantly, items on which a shift to caution 
after group discussion is usual showed a re- 
versal of direction, that is, a shift toward risk. 
While only one of these shifts was statisti- 
cally significant, it is apparent that the ex- 
perimental procedure was successful in re- 
versing the normal trend of the shift to cau- 
tion. All the items on which a shift to risk 
was expected showed statistically significant 
shifts to greater risk. A comparison with the 
data obtained with the same groups with the 
standard procedure shows that the group 


TABLE 3 


SUMMARY OF MEAN NUMBER OF STATEMENTS 
MADE IN THE STANDARD PROCEDURE, 
ALL CONDITIONS 


Type of statement 
Condition | Type of 

Risk Neutral Caution 

Standard | Caution 3.75 4.25 22.50 
Risk 25.00 4.50 2.00 

Balance Caution 2.25 4.00 12.00 
Risk 13.00 3.75 1.25 

Caution Caution 3.25 4.50 12.50 
Risk 12.25 4.00 2.25 

Risk Caution 3.25 4.25 11.25 
Risk 10.25 4.75 2.25 
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shifts were similar to those obtained with 
other groups under this procedure. Accord- 
ingly, these data support the hypothesis that 
the risk procedure would lead to a shift 
toward risk on all items. 

The fourth hypothesis stated that the cau- 
tion procedure would lead to a shift toward 
caution on all items. Once again, Table 1 
shows the size of the group shift with the 
caution and risk procedures. Inspection of 
this table shows that items where a shift to 
greater risk is usual showed an opposite shift, 
that is, toward caution. While only one of 
these shifts was statistically significant, the 
experimental procedure was again successful 
in reversing the normal trend to a shift toward 
risk. All of the items on which a shift toward 
caution was expected showed statistically sig- 
nificant shifts toward greater caution. A com- 
parison of the same groups data with the 
standard procedure shows that the group 
shift resembled that found in other groups 
with the same procedure. Hence, these data 
support the hypothesis that the caution pro- 
cedure would result in a shift toward caution 
regardless of the nature of the items. 


Discussion 


The results of the present study are gen- 
erally consistent with the hypotheses pro- 
posed. These results show, first, that in the 
standard procedure, a bias in the discussion 
corresponded to the direction of the subse- 
quent shift. Second, the results show that 
the experimental procedures were successful 
in manipulating the information input of the 
group discussion. Third, the results show that 
shifts occur as the result of inducing content 
biases in the group discussions. Finally, and 
most importantly, the results supported the 
hypotheses concerning the effects of the nature 
and amount of information on group deci- 
sions. 

The content of the group discussions cor- 
responds to the direction of the group shift. 
Thus, groups made more comments concern- 
ing caution when discussing items on which a 
cautious shift occurred and more comments 
concerning risk in discussing items eliciting a 
risky shift. This finding is important in that 
it offers evidence that the type of information 
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generated within the group discussion affects 
the outcome of that discussion. While this 
might appear merely a demonstration of the 
obvious, this is the first empirical demon- 
stration with both cautious- and risk-oriented 
items. Further, it shows that the assumptions 
used in developing the experimental proce- 
dures for varying information input are 
equally tenable. The content analysis per- 
mitted consideration of the informational com- 
ponent while retaining the group situation, 
Specifically, the large item-type differences 
shown in Table 3 indicate the importance of 
the discussion content on the group decision. 

The results of the present research show 
that the information generated in the group 
discussion is one of the major causes of the 
group shift. These results favor a theory 
with a strong informational component. This 
relevant-information theory states that the 
information brought out in the group discus- 
sions causes the group shift on the Stoner 
problems. A similar approach has been con- 
sidered in past research. Kogan and Wallach 
(1967) dismissed such theory as untenable. 
They suggested that a discussion favoring a 
specific alternative may be more a function 
of the subjects’ needs than a cause of the in- 
clination of the group toward greater risk or 
greater caution. Kogan and Wallach argued 
that this is in fact the case and the relevant- 
information notion is therefore not a tenable 
hypothesis. The present research offers evl- 
dence that seriously questions the Kogan and 
Wallach conclusion. When the direction of the 
shift and the type of information are opposite, 
the failure of expected shift points out that 
the relevant-information interpretation 15 
feasible. The group shift is shown to be a 
consequence rather than a cause of informa- 
tion, : 

The design of the present experiment makes 
it possible that the listing process could en- 
hance the salience of widely held values as- 
sociated with the alternatives posed by the 
problem (Marquis, 1962). The making of 
notes on the items could lead the subject to 
see more clearly the greater consistency be- 
tween one course of action and widely held 
values. In developing a case for one course of 
action in preference to the other, the course 
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of action which is most consistent with the 
values held by the subject, and by the hypo- 
thetical person or persons he is implicitly try- 
ing to persuade, should be more compelling 
than a course of action which runs counter 
to these values. This is in fact verified by the 
present research. Information in favor of a 
course of action which was in line with the 
usual shift on an item was much more suc- 
cessful than when the information on a course 
of action was counter to the usual shift. While 
the value elicited by an item is an important 
aspect of the group shift, the current research 
indicates that the information component 
plays a more important role than the value 
theory suggests. 

The present study supports a relevant-in- 
formation theory. It showed that the informa- 
tion generated in the group discussion is one 
of the major causes of the group shift. An 
additional question to be answered is why a 
particular item elicits the type of information 
it does. It has been shown that the type of 
information elicited and the subsequent shift 
corresponds to the average initial position on 
an item. However, it is still unclear why one 
item elicits a cautious approach while another 
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elicits a risky approach. Here, perhaps, is the 
relevance of the value theory (Brown, 1965). 
It seems acceptable that the American culture 
values risk in some situations and caution in 
others. The value engaged will influence the 
initial response to an item. This in turn will 
affect the flow of information so that more 
relevant information will be elicited support- 
ing the value than opposing it. 
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RISKY SHIFT IS EMINENTLY RATIONAL ' 
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Findings from two experiments demonstrate that both the individual and group 
decisions concerning minimum acceptable probability (risk level) are the 
result of a rational decision rule based on subjective expected utility theory 
In the first experiment, the relative utilities of the outcomes were used to 
predict the subject’s initial risk level, assuming he was rational. These esti- 
mated values were quite close to the actual risk levels chosen, both before and 
after group discussion. In the second experiment, the group decision concerned 
the relative utilities of the outcomes instead of the usual issue of agreeing on 
minimum acceptable probabilities. The shifts in relative utilities compared 
favorably with the shifts in risk obtained in the first experiment, assuming 
the group decision was also rationally determined. Shifts in risk, therefore, 
may be considered an epiphenomenon, the direct result of changes in utilities 


produced during discussion. 


Over the past several years, researchers 
have repeatedly demonstrated that on certain 
decisions groups are consistently riskier, and 
on other decisions they are consistently more 
cautious than the average individual. The 
most frequently used task in studies of group 
decision involving risk are the “life-choice” 
dilemmas developed by Wallach and Kogan 
(1959). These are 12 hypothetical situations 
in each of which the hero must choose be- 
tween two courses of action. One alternative 
is riskier, that is, it has greater likelihood of 
failing, but also has greater value if success- 
ful, when compared to the other, more cau- 
tious, alternative. The subject's task is to 
decide what the minimum probability of suc- 
cess for the riskier alternative would have to 
be before he would recommend the hero pur- 
sue it. 

At first, the shift toward risk received the 
most attention. Because many vital decisions 
are made collectively, there was concern over 
the real-world implications of this effect; that 
is, under conditions of some uncertainty, so- 
cial processes induce individuals to be fool- 


1 This research was supported by a grant from the 
National Institute of Mental Health (MH-16950-01). 
? Requests for reprints should be sent to Eugene 
Burnstein, Research Center for Group Dynamics, In- 


stitute for Social Research, Ann Arbor, Michigan 
48106. ; 


hardy, irresponsible, etc. Explanations of the 
risky shift based on the diffusion of responsi- 
bility in groups or the social desirability of 
appearing risky certainly suggest as much 
(see reviews of these theories by Brown, 1965; 
Burnstein, 1969; Clark, 1971; Kogan & Wal- 
lach, 1967; Vinokur, 1971). Implicit in these 
theories is the belief that the shift toward 
greater risk taking is irrational, in that the 
choice is influenced by factors which are 
logically irrelevant to the outcomes explicitly 
given in the decision task. This naturally 
raises the question: Do group decisions in- 
volving risk follow some model of optimal or 
rational decision making? The model which 
we consider worth investigating is the sub- 
jective expected utility model (or SEU model) 
(see Becker & McClintock, 1967; Edwards, 
Lindman, & Phillips, 1965). 

SEU theory deals with choices among al- 
ternatives in terms of the subjective values 
(utilities) of the outcomes associated with 
each alternative, and the subjective proba- 
bilities that each alternative will lead to cer- 
tain outcomes. The rational strategy, accord- 
ing to this model, is to choose the alternative 
with the greatest subjective expected utility 
for the individual. The SEU of an alternative 
is computed by multiplying the subjective 
probability of an outcome of the alternative 
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by its utility and summing over all possible 
outcomes of the alternative. 
Kogan and Wallach (1967) have argued 


that there is evidence to show that the risky 


shift is not adequately explained by a rational 
mode! such as SEU theory. For instance, in 
a few studies all choices were assigned iden- 


tical expected value, and still a risky shift 
was obtained (Clausen, 1965; Pruitt & Teger, 
1969; Wallach, Kogan, & Bem, 1964). Ac- 
cording to Kogan and Wallach, a rational 
mode! would predict no shift. It is quite pos- 
sible, however, that the alternatives did not 
have the same SEU. In fact one of these re- 
searchers (Clausen, 1965) had her subjects 
rate how good they thought their choice was. 
She found that riskier individuals tended to 
feel their chosen alternative was the best 
possible one. This suggests that at least for 
some individuals there may be a difference in 
SEU between alternatives with identical ex- 
pected value. Finally, it should be noted that 
studies which contrive alternatives with equal 
expected value necessarily use a decision task 
different from the standard dilemmas. Thus, 
in general, it would seem that Kogan and 
Wallach provide no real ground for rejecting 
a rational model. 

The SEU hypothesis about the risky shift 
has been explicitly developed by Miller 
(1970). A different version of this hypothesis 
was presented somewhat later by Vinokur 
(1971). In general, it assumes that each sub- 
ject is rational and chooses the risk level in 
accordance with SEU theory. Further, the 
group decision is rationally determined in the 
same manner. During discussion, information 
is exchanged which leads some members to 
change the utilities they had assigned to cer- 
tain outcomes, and consequently to change 
their choice of risk level. Thus, the risky or 
conservative shift is the direct result of a 
rational group decision. 

Each dilemma describes two alternative 
courses of action, a risky alternative and a 
cautious alternative. The risky alternative has 
two possible outcomes, a successful outcome 
and an unsuccessful outcome. Let V denote 
the utility of the successful outcome and let 
K denote the utility of the unsuccessful out- 
come. If S is the subjective probability that 


Risky Suirvt is EMINENTLY RATIONAL 


DC IS = Fal PS 


the risky alternative will lead to success, then 
the SEU of the risky alternative is 


SEU; = SV + (1 — S)K. 


Let U denote the SEU of the cautious alter- 
native (which may have several possible out- 
comes). /f the subject's task were to ration- 
ally choose one of the two alternatives, he 
would choose the alternative with the higher 
SEU. Therefore, he would choose the risky 
alternative if and only if 


SEU, 2 U. 


Equivalently, the risky alternative is pre- 
ferred if and only if 


SV +(1-S)K 2U. 
Solving for S, 
S2 (U -K)/(V — K). 


Therefore, the minimum probability for which 
the subject would prefer the risky alternative, 
denoted pr, is 


pr = (U — K)/(V — K). [1] 


Among other things, this demonstrates that 
the SEU hypothesis predicts the minimum 
acceptable probability of success from knowl- 
edge of the relationship between the utilities. 
Thus, as has been noted by Vinokur (1971), 
pr need not be determined from the absolute 
utilities (U,V,K), but instead from the ratio 
of the relative utilities. The relation between 
pr and the utilities also shows what changes in 
utilities (U,V, K), but instead from the ratio 
level. For example, pr will decrease (a risky 
shift) if either the SEU of the cautious al- 
ternative (U) decreases relative to the utility 
of the successful outcome (K) or if the util- 
ity of the successful outcome (V) increases 
relative to the utility of the unsuccessful out- 
come. 

This line of reasoning suggests that if the 
ratio of the utilities were known, then we 
should be able to predict an individual's risk 
level; and if the change in this ratio were 
known, then we should be able to predict 
shifts in risk levels. 

The present study contains two experi- 
ments that test the SEU hypothesis as an ex- 
planation of the risky shift. On the basis of 
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Equation 1, if a measure of the ratio of the 
relative utilities is obtained, there should be 
a high positive correlation between pr and the 
individual's risk level, denoted r. It is well 
established that subjects change their indi- 
vidual choices following group discussion in 
the direction of the group decision. Thus, a 
high positive correlation between pr and r 
following a shift means that a shift in utilities 
must have occurred. This constituted the first 
experiment. 

The SEU hypothesis contends that the ratio 
of the relative utilities and the minimum ac- 
ceptable probability are one and the same. 
This then suggests that an alternative pro- 
cedure to asking for a group decision about 
minimum probabilities is to require a group 
decision about the ratio of the relative utili- 
ties. We can then compare the group decision 
of the first experiment, which concerned risk 
level, denoted rG, with the group decision of 
the second experiment, which concerned rela- 
tive utilities, denoted prG. 


EXPERIMENT I 
Method 


Subjects. The 113 subjects (97 males and 16 fe- 
males) participated to fulfill a requirement in their 
introductory psychology courses, Ten five-member 
groups, 12 four-member groups, and 5 three-member 
groups were run. Group size depended on the number 
of subjects who could be scheduled and actually 
showed up during a particular hour, 

Materials, The questionnaires administered con- 
tained the following six life-choice dilemmas, very 
slightly modified from Wallach and Kogan (1959): 


1. Mr. A, an electrical engineer, is married and 
has one child; he graduated from college 5 years 
ago. He has worked since graduation with a 
large electronics corporation, where he is assured 
of a lifetime job with a modest though adequate 
salary, and liberal pension benefits upon retire- 
ment; however, he has little chance of a significant 
salary increase before retirement. While attending 
a convention, Mr. A is offered a job with a small, 
newly founded company which has an uncertain 
future. Mr. A would prefer to take the new job, 
if the company would survive the competition of 
the larger firms, since it would pay more money to 
start and would offer the possibility of a share in 
the ownership. However, he would rather keep his 
present job than join the new company and have 
it fail. 

2. Mr. B is the captain of College X's football 
ieam. College X is playing its traditional rival, 
College Y, in the final game of the season. The 


game is in its final seconds. College X has time 
to run one more play. Mr. B, the captain, could 
settle for a tie score with a play which would be 
almost certain to work. On the other hand, he 
could try a more complicated and risky play which 
he would prefer if it succeeded, since it would 
bring victory. However, he would prefer the cer- 
tain play to failure at the risky play, since failure 
at the risky play would bring defeat. 

3. Mr. C is president of a light metals corpora- 
tion in the United States. The corporation has 
already decided to expand by building an addi- 
tional plant in the United States, where there 
would be a moderate return on the initial invest- 
ment, or building a plant in a foreign country. 
There is a history of political instability and revo- 
lution in the foreign country under consideration. 
In fact, the leader of a small minority party is 
committed to nationalizing—that is, taking over— 
all foreign investments. Mr. C would prefer to 
build the plant in the foreign country, if it re- 
mained politically stable, since lower labor costs 
and easy access to raw materials in that country 
would mean a much higher return on the initial 
investment. However, he would rather build in 
the United States than have a foreign plant na- 
tionalized. 

4. Mr. D is currently a college senior who is 
very eager to pursue graduate study in chemistry 
leading to the Doctor of Philosophy degree. He 
has been accepted by both University X and Uni- 
versity Y. The standards of University X are rig- 
orous, so that only a fraction of the degree candi- 
dates actually receive the degree. Mr. D would 
prefer to receive a degree from University X, since 
it has a worldwide reputation for excellence in 
chemistry, and the degree would signify outstand- 
ing training in this field. At University Y, vir- 
tually everyone admitted is awarded the Doctor 
of Philosophy degree, though the university has 
less of a reputation in chemistry, and its degree 
has less prestige than at University X. However, 
Mr. D would rather go to University Y than fail 
at University X. : 

5. Mr. E, a married 30-year-old research physi- 
cist, has been given a 5-year appointment by a 
major university laboratory. He could work on a 
single, long-term, but difficult, problem which 
would resolve basic scientific issues in the field 
and bring high scientific honors. If no solution 
were found, however, Mr. E would have little to 
show for his 5 years in the laboratory, and this 
would make it hard for him to get a good job 
afterwards. On the other hand, he could work on 
a series of short-term problems where solutions 
were easier to find, but where the problems are of 
lesser scientific importance. Mr. E would rather 
succeed at the short-term problems than fail at 
the long-term one. 

6. Mr. F is contemplating marriage to Miss T, 
a girl whom he has known for a little more than 4 
year, Recently, however, a number of arguments 
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have occurred between them, suggesting some 
sharp differences of opinion in the way each views 
certain matters. Although they would definitely 
get married if they could be assured that their 
marriage would turn out happy and successful, they 
realize that a happy marriage, while possible, 
would not be assured. They would prefer not 
getting married over having an unsuccessful mar- 
riage. 


On the first questionnaire (titled “Opinion 
Questionnaire”), immediately beneath each di- 
lemma were nine probability values. Subjects were 
to indicate the minimum probability of success 
they would accept and still choose the risky al- 
ternative. For instance, under the first dilemma 
there was the statement: 


Try to place yourself in the position of Mr. A. 
Check the lowest probability that you would con- 
sider acceptable to make it worth while for Mr, A 
to take the new job. 


Mr. A should take the new job if the chances are 
at least 
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——9 in 10 that the company will prove financially 
sound. 

in 10 

-——4 in 10 

in 10 

— 5 in 10 

in 10 

— 3 in 10 

means 30 

in 10 that the company will prove financially 

sound. 


LI 


This has been the standard procedure in experiments 
which use the dilemmas, The first, second, fourth, 
and fifth items have consistently produced a risky 
shift in past research; the third and sixth items have 
consistently produced a conservative shift. 

The second questionnaire (“Value Estimation 
Questionnaire”) provides a measure of the ratio of 
the utilities, as given by Equation 1, for each of the 
six dilemmas. For example, immediately beneath the 
first dilemma on the second questionnaire was the 
following: 


LOW HIGH 
VALUE VALUE 
TAKE NEW JOB; TAKE NEW JOB; 
COMPANY FAILS COMPANY PROVES 
FINANCIALLY 
SUCCESSFUL 


The subject was told that the left end of the scale 
represented his utility for the unsuccessful outcome, 
that is, K, and the right end, his utility for the 
Successful outcome, V. His task was to “try to put 
yourself in the place of Mr. A and put a checkmark 
on the scale to indicate the relative value of con- 
tinuing his present job,” that is, U. Assuming a linear 
utility scale for the subject, the distance from the 
left end of the scale to the checkmark (U-K) di- 
vided by the total length of the scale (V—K) is ex- 
actly the ratio given by Equation 1.9? Two examples 
of how to make this rating were given at the be- 
ginning of the Value Estimation Questionnaire, one 
involving monetary outcomes and the other, non- 
Monetary outcomes. The Opinion Questionnaire also 
had an example at the beginning which was very 
Similar to the items that followed. 

Procedure. The subjects were seated at a table 
While the experimenter went over the written in- 
Structions and examples in the first booklet, which 
Was always the Value Estimation Questionnaire. 
This provided a prediscussion measure of the ratio 

*More recently, Vinokur (1970) has carried out 
"Xperiments in which each of these utilities, K, V, 
ind U, were assessed independently by a variety of 
methods, His results are similar to those in the pres- 
nt study, although here U is measured relative to 
V and K, 


of the relative utilities, pr. At this point, a randomly 
chosen 60 of the 113 subjects (14 groups) were given 
the Opinion Questionnaire. Again the experimenter 
first reviewed the instructions and the example be- 
fore allowing them to fill out the questionnaire. This 
provided a prediscussion measure of the risk level, r. 
Next, clean copies of the Opinion Questionnaire were 
distributed, and the group was asked to discuss and 
reach a concensus regarding the minimum acceptable 
probability for each item, rG. After the group had 
reached concensus on all six items, clean copies of the 
Value Estimation Questionnaire and the Opinion 
Questionnaire were filled out individually, providing 
postdiscussion measures of pr and r, respectively. 
Having obtained pre- and postmeasures of the ratio 
of the relative utilities, r1, 52, as well as pre- and 
postmeasures of the risk level, rl and r2, a shift 
may thus be observed not only by comparing r1 with 
72, as is usually done, but also by comparing rl 
with pr2. 

As a control, 53 subjects (13 groups) answered 
only the Value Estimation Questionnaire before and 
after group discussion, but, as with the experimental 
group, the group decision concerned only the mini- 
mum acceptable probabilities. In sum, the experi- 
mental group made individual decisions about utili- 
ties and risk levels, while the contro] subjects made 
individual decisions only about utilities. Both, how- 
ever, made group decisions about risk levels. 
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TABLE 1 


FREQUENCY DISTRIBUTION OF r BETWEEN ¢ AND pr WITHIN SUBJECTS 
OVER THE SIX Lrre-Cuoice DILEMMAS 


Range of r 
Item = Y 286 
1.00—.81 ,80—.61 .60—.41 ,40—.21 | -20-.00 r<—.30 

Prediscussion 

Frequency of r(r1,pr1) 37 8 6 3 2 | 4 
Postdiscussion | k 
Frequency of r (r2, pr2) 45 7 3 2 0 3 

Note.—N = 60. 
RESULTS the relation which obtains between pr and r 


Predicted minimum acceptable probabilities 
were computed before (r1) and after (pr2) 
the group decision. Correlation coefficients 
were computed over items, for each subject in 
the experimental condition, between these val- 
ues and the actual probabilities chosen before 
(r1) and after (72) the group decision. The 
distribution of these coefficients is shown in 
Table 1. For the overwhelming majority of 
subjects, the actual and predicted choices are 
very closely related, before as well as after 
the group decision. In over 60% of the cases 
r(rlprl) was between 1.00 and .81; and in 
75% of the cases it was between 1.00 and .61. 
This tendency improved slightly following the 
group decision. In 75% of the cases r(r1,prl) 
fell between 1.00 and .81; and in about 87% 
of the cases, between 1.00 and .61. Correla- 
tions between pr and r across subjects for 
each dilemma were also computed and are 
shown in Table 2. 

Since the SEU hypothesis specifies that pr 
and r are one and the same, that is, pr = r, it 
is appropriate to give the regression line of 
br on r to provide an indication of how well 


TABLE 2 


CORRELATION BETWEEN r AND pr FOR EACH LIFE- 
Cnoice DILEMMA OVER SUBJECTS BEFORE AND 
AFTER Group Discussion 


Dilemma number 
Item inia 


Prediscussion r(r1,pr1) | .43 | .71 


x 58 | .56 | .49 
Postdiscussion 
r(r2,pr2) «72 | .76 | 45 | .57 | .56 | .50 
Note.—N = 60, y 


fits the function pr =r. The linear regression 
equation for pr on r, pre- and postdiscussion, 
is 

rl =.726 pri + .155 

r2 = .829 pr2 + 097. 


The fit appears to be reasonably good, both 
before and after group discussion. The positive 
bias is probably primarily due to the fact that 
pr was measured on a 20-interval scale, while 
r was measured on a 9-interval scale. For ex- 
ample, suppose pr was 5.5. If the subject were 
rational, he would have to choose 6 for 7, 
since it is the lowest acceptable probability 
value available to him on the scale, and not 
5, which would be too low. 

Comparisons * were made between the shift 
in individual levels of risk, r2 — r1, and the 
mean shift in individual ratios of utilities, 
pr2 — pri, for each of the six dilemmas. 
These are shown in Table 3, and, in general, 
there appears to be considerable agreement, 
as would be expected from the high correla- 
tion found between pr and r. The control 
group yielded similar shifts in the ratios of 
the utilities when compared to the exper- 
mental group (see prl — pr2 in Table 3). As 
expected, all of the risky dilemmas yielded sig- 
nificant risky shifts which were also main- 
tained after group discussion (see 7G — 7l 
and 72 — r1 in Table 3). In all but one case 
the conservative dilemmas yielded significant 
cautious shifts which were also maintaine 
after group discussion. Finally, comparisons 
were made between the mean individual ratios 


* All statistical tests in both experiments are ir 
tailed with the » based on the number of group? 
not on the number of subjects. 
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TABLE 3 
MEAN PRrepiscusston RATIONAL AND AcrUAL Risk LEVELS (pri AND rl), THE ACTUAL 
Group Decision (rG), AND MEAN PostpiscussION RATIONAL AND ACTUAL RISK 
LEVELS (pr2 AND r2) WITH Comparisons BETWEEN CERTAIN PAIRS OF MEANS 
| Experimental condition Control condition 
Dilemma ^ 
pri| ri | rG | pr2| r2 | rl —rG| pr — rG | ri —r2 | pri — pr2| pri | rG | $2 | pri —rG | pri — 2 
Risky pr 
1 47 | .49 | .43 | .37 | .42| .06 -04 07 10 43 | .37 | 35 .06 .08 
| | (.10) (ns) (.02) (.05) | (ns) (.05) 
2 | 45 | 42 | 30 | .34 | 33 .12 BI .09 JM | <42 | .32 38 | 10 .05 
(005)| (005) | (02) | (005) (.05) (10) 
3 40|.47|.35|.30|.35 | .12 .05 AOE fc 10 46 | .34 | .39 12 07 
: (40) | (us) (005) (10) (.02) (10) 
Conserva- 
tive | 
4 41 | 41) 35 | 30 | 31 .06 .06 .10 12 -38 | .32 | .36 .06 .02 
(.10) (.05) (.01) (.01) (ns) (ns) 
5 62 | .69 |.74 | .71 | .70 | —.05 —.12 —.01 210 62] .70|.65| —.08 =102 
(.05) (.05) (ns) (ns ) (.05) (ns) 
6 .58 | .63 | .70 | .69 | .72 | —.07 —42 —.03 —.10 58 | 75| .71 | —.17 =13 
(.10) (.05) (.10) (.05) (.02) (02) 
Note, — of significance is indicated in parentheses. 


of utilities and the group decision concerning 
risk levels, (see prl—rG in Table 3) for 
both the experimental and control groups, 
with results highly similar to the standard 
comparison, r1 — rG. The within-subject cor- 
relations between changes in pr (pri — pr2) 
and changes in r (rl —72) following group 
discussion are shown in Table 4. For the vast 
majority of subjects, the relationship was 
fairly robust and quite reliable. 


EXPERIMENT II 
Method 


Subjects. The 103 subjects, 68 females and 35 
males, participated to fulfill a requirement in their 
introductory psychology courses. They were run in 
15 five-member groups, depending on the number who 
could be scheduled and actually showed up during a 
particular hour. 

Procedure, Yn one condition 50 subjects (11 groups) 
filled out only the Value Estimation Questionnaire, 
Which asks for a rating of U relative to V and K. 
In a second condition, 53 subjects (12 groups) both 
before and after discussion filled out only the Opinion 
Questionnaire, which asks for the minimum accepta- 
ble probabilities, Four of the dilemmas taken from 
Wallach and Kogan (1959) are almost identical to 
those used in the first experiment. They are Numbers 
l 2, 3, and 6, shown below, and correspond to 
Numbers 1, 6, 2, and 4 of the first experiment. The 
first, third, and sixth items have been found to 


produce a risky shift; the second, a conservative 
shift. Dilemmas Number 4 and Number 5 are taken 
Írom a study by Vidmar and Burdeny (1969) in 
which these items produced a conservative shift. 
Thus, each questionnaire contained three risky and 
three cautious items 


1. Mr. A, an electrical engineer, who is married 
and has one child, has been working for a large 
electronics corporation since graduating from col- 
lege 5 years ago. He is assured of a lifetime job 
with a modest, though adequate, salary and liberal 
pension benefits upon retirement. On the other 
hand, it is very unlikely that his salary will in- 
crease much before he retires. While attending a 
convention, Mr. A is offered a job with a small, 


TABLE 4 


FREQUENCY DISTRIBUTION OF r BETWEEN PRE- TO 
PosrpiscussiON CHANGES IN RATIONAL Risk 
Levers (rl — pr2) AND PrE- TO Post- 
DISCUSSION CHANGES IN ACTUAL RISK 
LEVELS (ri — rs) WITHIN SUBJECTS 
OVER THE Six LrrE-CHOICE 


DILEMMAS 
Range of r 
Item TU 
1.00-| .80- | .60- | .40- | .20- | r = 
81 | .61 | .41 | .21 | .00 | .00 
Frequency 16 | 13 | 10 8 5 8 


Note.—N = 60. 


i 
f 


d 


468 


newly founded company which has a highly uncer- 
tain future. The new job would pay more to start 
and would offer the possibility of a share in the 
ownership if the company survived the competition 
of the larger firms. 

2. Mr. M is contemplating marriage to Miss T, 
a girl whom he has known for a little more than 
a year. Recently, however, a number of arguments 
have occurred between them, suggesting some sharp 
differences of opinion in the way each views cer- 
tain matters. Indeed, they decide to seek profes- 
sional advice from a marriage counselor as to 
whether it would be wise for them to marry. On 
the basis of these meetings with a marriage 
counselor, they realize that a happy marriage, 
while possible, would not be assured. 

3. Mr. D is the captain of College X's football 
team. College X is playing its traditional rival, 
College Y, in the final game of the season. The 
game is in its final seconds, and Mr. D's team, 
College X, is behind in the score. College X has 
time to run one more play. Mr. D, the captain, 
must decide whether it would be best to settle for 
tie score with a play which would bring victory 
if it succeeded, but defeat if not. 

4. Mr. E is about to board a plane at the air- 
port at the beginning of his overseas vacation. 
Although he has been looking forward to this trip 
for some time, he is troubled because he awoke in 
the morning with a severe abdominal pain. Be- 
cause he has never flown before, he thinks that 
the pain may simply be an upset stomach brought 
on by anticipation of his flight. Although he is not 
far from a hospital where he knows he will obtain 
quick attention, he realizes that a visit to the 
hospital will cause him to miss his flight which 
in turn will seriously disrupt his vacation plans. 
The pain has gotten more severe in the last few 
minutes. 

5. Mr. and Mrs. F have been attempting to have 
a child for a number of years. Mrs. F is now 
pregnant, but a complication has arisen. The doctor 
has advised Mr. and Mrs. F that if the pregnancy 
is allowed to progress, Mrs. F’s life could be in 
danger. If further complications do develop, the 
doctor will not be able to save the child and may 
not be able to save Mrs. F. If the pregnancy is 
terminated at this time, it will be impossible for 
Mrs. F to become pregnant again. 

6. Mr. F is currently a college senior who is 
very eager to pursue graduate study in chemistry 
leading to the Doctor of Philosophy degree. He 
has been accepted by both University X and Uni- 
versity Y. University X has a worldwide reputa- 
tion for excellence in chemistry. While a degree 
from University X would signify outstanding 
training in this field, the standards are so very 
rigorous that only a íraction of the degree candi- 
dates actually receive the degree, University V, on 
the other hand, has much less of a reputation in 
Chemistry, but almost everyone admitted is 
awarded the Doctor of Philosophy degree, though 
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the degree had much less prestige than the cor- 

responding degree from University X 

In both conditions, after the first questionnaire 
was completed, all subjects received a clean copy of 
the Value Estimation Questionnaire and were told 
to discuss each problem and arrive at à consensus on 
the rating of U relative to V and K. Thus, the group 
decision involved reaching a consensus on the ratio 
of the utilities rather than on the minimum accepta- 
ble probability. The method for choosi »robabili- 
ties in the Opinion Questionnaire and ratings in the 
Value Estimation Questionnaire were identical to 
that of the first experiment. However, to facilitate 
discussion during group decision making, each in- 
terval of the rating scales in the Value Estimation 
Questionnaire was labeled with a number. That is, 
the first interval at the low value end had Number 
1 under it, the second interval, Number 2, and so 
on, until the last interval at the high end, which had 
20 under it. 
Results 


Comparisons were made between the mean 
individual shifts in the risk levels, 72 — rl, 
and the mean individual shifts in the ratio of 
the utilities, pr2 — pri. These are presented 
in Table 5. The findings are fairly clear, with 
almost every comparison giving a difference 
in the appropriate direction and most of them 
being statistically reliable. Comparisons were 
also made between the mean individual deci- 
sions concerning risk levels or ratio of utilities 
and the group decision which always con- 
cerned the ratio of utilities (see r1 — prG and 
pri — prG in Table 5). Most important, very 
similar shifts are found when the individual 
and group decisions concern the ratio of the 
utilities as were found in the first experiment 
when the individual and group decisions con- 
cern the minimum acceptable probability 
(compare prli — prG with r1 — rG and prl 
— pr2 with r1 — r2 in Table 5.) 


Discussion 


Findings from both experiments demon- 
strate that the individual and group decisions 
are based on a rational decision rule deter- 
mined by SEU theory. When minimum accep- 
table probabilities were predicted from the 
ratio of the utilities of the outcomes, pr, the 
estimated values were quite close to the actua 
risk levels chosen, r. In fact, the function or 
= r described the obtained relationship fairly 
well, before as well as after a group decision. 
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TABLE 5$ 


Mean PREDISCUSSION. RATIONAL AND ACTUAL IpiVIDUAL Risk Levets (pri AND r1), 
THE RATIONAL Group Decision (prG), aNp MEAN PosTDISCUSSION RATIONAL AND 
ACTUAL Risk LEVELS (pr2 AND r2) WITH COMPARISONS BETWEEN CERTAIN 


Pairs or MEANS 
— sea 
| Value Estimation Questionnaire condition | partial datà | Opinion Questionnaire condition 
Dilemma | "s - EA | E P . ae 
| pri | prG | pr2 | er! — r6 pri = pr2|ri—=rG|ri -r| ri pG 7: |n-pG|ri-n 
^ | [77 TOM 
Risky 
E 38 | 325 | 27 Bk A1 06 07 | 51 | 38 | 44 13 07 
| (005) | (005) | (10) | (02) | | (005) | (01) 
3 a |26 | 28| 16 13 42 | 09 | 44 | 27 | 36) a7 | 308 
| (02 | (01) (005) | (.02) | (02) | (01) 
| | 
6 | 40 | 532 1 32 .08 .08 06 40 551 | 31 | 46 .20 .05 
EC | | (.005) (.01) (.10) (.01) (005) | (02) 
"Conservative | | 
2 | .60 | .68 | .70 —.08 —.10 —.07 —.08 | .67 | .66 | .70 .01 —.03 
(02) (01) | (10) | C10) | (ns) | (ns) 
4 | 64 | .74 | 71 | —.10 —.08 68 | .70 | .72 | —.02 |—04 
| (.05) (.05) (ns) (10) 
P5 | 71 | 79 | gt | —.08 —.10 74 | 80 | 81 | —.06 | —.06 
y | (10) (01) | | (07) | (08) 


ae Significance levels are in parentheses. 


Similar shifts when both the individual and 
- group decisions concern risk levels or relative 
Utilities lends further support to their being 
equivalent. 

This implies that shifts in risk levels may 
be considered an epiphenomenon, a natural 
and inevitable result of changes in utilities. 

"That is to say, if the person actually believes 
he is taking a greater or lesser risk than 
previously seemed wise, certain symptoms 
Should appear. For instance, he might feel 
apprehensive, tense, relieved, elated, etc. Yet 
Considering the vast literature in this area, it 
is surprising that there is no evidence of such 
feelings following a change in risk levels, even 
though individuals are quite aware of the 

nge itself (Wallach, Kogan, & Burt, 1965). 
ence, in labeling the risky shift as epiphe- 
menal we are suggesting that the person 
es not feel more risky or more cautious; 
enomenally, the risky shift does not exist. 
ther, the person is aware that he has 

ged his mind about the minimum proba- 
ty of success which would be acceptable to 
him because it is the rational thing to do— 


Note.—The Value Estimation Questionnaire condition involved subjects filling out only that questionnaire before and after 
ssion. Similarly, subjects in the Opinion Questionnaire condition only filled out that questionnaire before and after group 


once he had revised his opinion about the rela- 
tive value of the outcomes. 

The critical question is, why does discussion 
of the dilemmas lead to apparent changes in 
utility of the outcomes? Two conjectures are 
currently being explored by the authors, One 
is that many decision situations such as the 
dilemmas elicit standard persuasive arguments 
for or against a particular alternative. 

A persuasive argument is thought to have 
an effect similar to that of a correct solution 
in group problem solving. In other words, 
once such a solution of argument is emitted, 
it is likely to be accepted either because the 
argument itself is intrinsically cogent, the 
person who possesses such an argument is as 
a result highly confident of his position, or 
both. This general point has been raised else- 
where in greater detail (Burnstein, 1969; 
Burnstein & Katz, 1971; Vinokur, 1971). It 
has several implications for group decision 
making, one of which is relevant here: The 
extent of the shift in a risky or cautious direc- 
tion depends on the likelihood that an average 
member will have available all or most of the 
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persuasive arguments, It has been argued 
(Vinokur, 1971) that when the likelihood is 
extremely high or extremely low, large shifts 
will not occur. Suppose the likelihood is very 
great. Individuals then will have made their 
initial choice on the basis of all or most of 
the persuasive arguments favoring risk or 
caution; thus, not only are they extremely 
risky or cautious even before the discussion 
begins, but they know beforehand the most 
cogent arguments for their position and/or 
are highly confident about their choice. Sup- 
pose the likelihood is very small. Then hardly 
any individual will think of many persuasive 
arguments; during the discussion they will not 
possess cogent arguments for their position 
nor will they be confident about their choice. 
Therefore, the largest shifts should obtain in 
decision situations in which the standard 
persuasive arguments mostly favor either risk 
or caution, and these arguments have a mod- 
erate likelihood of being available to the 
average member. 

This line of reasoning suggests (a) that 
dilemmas which produce a shift are associated 
with a set of persuasive arguments that have 
a moderate likelihood of being available to 
the average member; (5) that dilemmas which 
shift in a risky direction elicit arguments 
which predominantly favor risk, and those 
which shift in a cautious direction elicit ar- 
guments which predominantly favor caution; 
and (c) that given the present findings on the 
revision of utilities, the persuasive arguments 
for the most part address themselves to the 
value of an outcome instead of its probability. 

While there is no good evidence available to 
evaluate the above analysis, a few findings 
from some informal research may be worth 
citing. In one study (Burnstein & Haidle, 
1969), individuals were asked to “think out 
loud” and verbalize the arguments they con- 
sidered while responding to four of the di- 
lemmas. The subjects were not to indicate 
the minimum acceptable probability for the 
uncertain alternative, but merely to decide 
whether to pursue the certain or uncertain 
alternative. The statements were taped and 
later coded according to whether an argument 
was concerned with the value of an outcome or 
its probability of occurrence, The vast ma- 
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jority of respondents produced arguments in- 
volving the value of an outcome, not its prob- 


ability; a few produced both kinds; but 
hardly any argued solely in terms of proba- 
bilities. Three of the dilemmas used in the 


study give a risky shift, and one gives a con- 
servative shift. On the former, nearly 70% of 
the arguments favored the risky alternative; 
on the latter, about 55% of the arguments 
favored the cautious alternative. Since the 
conservative shift is consistently weaker than 
the shift toward risk, it is interesting to note 
that in the one conservative dilemma. there is 
only a slight preponderance of arguments for 
caution, while in the three risky dilemmas 
there is a sizable preponderance of risky ar- 
guments. On the average there were about a 
half-dozen different kinds of persuasive argu- 
ments per dilemma, and the median number 
of arguments per person was between three 
and four. In addition, during Experiment I, 
tapes were made of the group discussion. Only 
the discussions on the two dilemmas (Num- 
bers 2 and 5) which produced the largest 
shifts were examined. A count was made of 
the frequency with which a member argued 
on the basis of the value of an outcome and/ 
or its likelihood of occurrence. The former 
argument was made almost three times as 
often as the latter. Both this finding and 
those in Burnstein and Haidle (1969) at 
least are consistent with the idea that utilities 
change because the standard persuasive argu- 
ments center on the gains and losses which 
will result from a choice; and initially each 
member has access to only part of the total 
set of arguments which are available to the 
group as a whole. Finally, the standard per- 
suasive arguments are assumed to be effec- 
tive because they are intrinsically cogent, and/ 
or those who possess them are instilled with 
confidence about their initial choice. There 
are two experiments on the risky shift which 
examine the relationship between an indi- 
vidual’s confidence in his choice and the 
amount of influence he exerts in the group 
discussion (Burnstein & Katz, 1971; Clausen, 
1965). Among other things, both find that 
confidence and influence are positively related. 
These studies, however, did not use the di- 
lemmas nor did they analyze the different 
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arguments made by individuals who were high 
or low in confidence. 
Our second conjecture should be distin- 
guished from the preceding one in that a 
X change in utilities as a result of discussion is 
not postulated. Instead it demonstrates that 
a simple averaging of individual utilities can 
imply a shift in the preferred probability 
level. This point of view is supported by 
direct empirical evidence which appeared in 
Miller (1970). He argued that the purpose 
of group discussion is to afford each member 
the opportunity to exchange his utilities with 
the other members. The group first reaches 
consensus on what the utilities for the out- 
= comes are to the group, after which the deci- 
| sion concerning minimum acceptable proba- 
bilities is determined by the rational decision 
rule. If we let U(G), V(G), and K(G) de- 
note the group decisions concerning the utili- 
ties, then the group decision, prG, is given by 


prG = [U(G) — K(G)]/ 
| [V(G) —K(G)]. [2] 


The assumption is made that the group deci- 
sion concerning a utility will be the average of 
the individual utilities, essentially conforming 
| to the group norm. The result is that even 
though each group utility is the average of 
= the respective individual utilities, the group 
risk level, prG, will not be equal to the aver- 
age of the individual risk levels, simply be- 
cause the utilities and the risk level are not 
. linearly related (see Equation 2). In sum- 
mary, the shift in risk levels is the result of 
the group members conforming to the group 
norms for the utilities of the three outcomes. 
The individual shifts reflect the fact that they 
persist in conforming even after the group 
decision. 
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COGNITIVE AND AFFECTIVE PROCESSES INFLUENCING 
RISK TAKING IN GROUPS: 


AN EXPECTED UTILITY APPROACH? 


AMIRAM VINOKUR* 


University of Michigan 


In order to test specific hypotheses concerning group shifts in risk taking, a theory 
l of individual choice behavior was applied to the choice-dilemmas task. Three 
| experiments demonstrated that individuals’ decisions regarding risk could be 
accounted for by a rational decision rule, which is maximization of subjective 
, expected utility (SEU). Two hypotheses that accounted for the shifts in terms of , : 
cognitive processes were tested. The first hypothesis, informational influence, 
suggests that new information raised during the group discussion influences the 
person’s assessment of the utilities of the potential outcomes and this in turn 
influences his decision regarding risk. This hypothesis was supported in four 
experiments. The second cognitive hypothesis, the rationality hypothesis, suggests 
that the shifts in risk are the result of increasing rationality due to the group 
discussion, where individuals adhere more closely to the SEU maximization rule. 
In contrast, a third hypothesis suggests that the shifts in risk are the result of an 
affective process where new utilities (rewards and punishments) that are associated 
with the circumstances under which the decision is made influence subjects’ 
decisions. The bulk of the evidence was found to be against these hypotheses. 


was referred to as the informational influence 
hypothesis. 
In contrast, another explanation suggests -' 


Recent reviews of the literature of group 
decisions involving risk have pointed out that 
consistent risky and conservative shifts follow- 


ing group discussion are dependent on informa- 
tion raised during the discussion that is 
relevant to the dilemma under consideration 
(Clark, 1971; Vinokur, 1971). Vinokur (1971) 


suggested that the shifts in risk-level choices 


could be accounted for by partially shared 


.. information and group problem-solving models. 
— More specifically, it was suggested that new 


information in the form of persuasive argu- 
ments is raised during group discussion and 
influences subjects’ view of the situation, 
that is, their assessment of the utilities of 
the potential outcomes, and this in turn 
influences their decisions: This explanation 


l'This research was supported by a predoctoral 
Ford Foundation Fellowship and in part by a grant 
from the National Institute of Mental Health (MH- 
16950-01). This article is based on the author's doctoral 
dissertation at the University of Michigan. The 
author is indebted to his doctoral committee: Eugene 
Burnstein (chairman), Melvin Manis, Lawrence Mohr, 
Cameron Peterson, and Robert Zajonc, for their valu- 
able help throughout this research. 

2 Requests for reprints should be sent to the author, 
Institute for Social Research, Research Center for 
Group Dynamics, University of Michigan, Ann Arbor, 
Michigan 48106. 


that the shifts toward risk and conservatism 


are due not to the influence of new information, ^ 


but to a process of rationality in which better 
use is made of the same information in order to 
maximize average gain, that is, subjective 
expected utility (SEU) as a decision rule. 
In other words, group discussion increases 
rationality and results in closer adherence to 
the initial SEU decision rule» Consequently, 
subjects are influenced to change (shift) 
their risk-level choices. It was pointed out that 
this modified version of the rationality'hypoth- 
esis, which was originally investigated by 
Wallach, Kogan, and Bem (1964) and Bem 
Wallach, and Kogan (1965), is still a yiable 
explanation for the shifts in risk th should 
be subjected to an empirical test inokur, 
1971). 

In contrast to these two cognitive hypoth- 
eses, a third class of hypotheses suggests that , 


process, Implicit in these hypotheses is the 
assumption that some potential rewards and 
punishments, inherent in the circumstances 
under which the decision is made (e.g m 
a group), influence the person’s decision 


472 / 


| 
; 
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garding risk. From this class of hypotheses, 
‘best known are familiarization, diffusion 
responsibility, and Brown’s value hypothesis 
ee reviews of these and earlier hypotheses 
bey Clark, 1971; Kogan & Wallach, 1967; and 
ur, 1971). From these hypotheses, the 
of the evidence favors Brown’s value 
pothesis (Clark, 1971). 
So far, all these hypotheses have attempted 
explain the group process that is responsible 
Dr the shifts in risk-level choices; but none of 
tl has explained how these risk-level 
Choices are generated in the first place. 
sequently, only indirect tests of these 
otheses are found. It is believed that more 
direct tests of the relevant hypotheses could 
ecome apparent if a theory of how the initial 
-level choices are generated by the individ- 
could be constructed and validated. In 
Zajonc (1965) has suggested that one of 
the requirements for any standard group task 
sa good theory to explain individual responses 
on the task. 
Thus, the purpose of the research reported 
Lin this paper was twofold: to construct and 
alidate a theory of the initial risk-level 
choices, and to contrast and subject the 
hypotheses mentioned above to a direct 
mpirical test. 
It is suggested that the SEU model of 
behavioral decision theory can account for 
‘individual decisions involving risk (cf. Becker 
!& McClintock, 1967; Edwards, Lindman, & 
Phillips, 1965). According to this model, 
dividuals choose from among several alterna- 
‘tives the one with the greatest SEU, that is, 
maximizing average gain. The SEU of an 
alternative is computed by multiplying the 
subjective probability of each possible outcome 
by its utility and summing over all the 
products. 
- Consistent group shifts toward greater risk 
and conservatism following discussion were 
mostly demonstrated with the Choice Dilem- 
s Questionnaire that was developed by 
lach and Kogan (1959). It consists of 12 
sothetical choice dilemmas. On each item, 
subject is instructed to indicate his 
imum odds (i.e., probability, p) for success 
t he considers acceptable in order to prefer 
in uncertain, that is, risky, alternative to a 
ertain alternative. If successful, the outcome 


of the risky alternative is most desirable 
(utility, U,), and, if unsuccessful, its outcome 
is the most undesirable (utility, U;). On the 
other hand, the certain alternative guarantees 
a safe outcome (utility, Ue) which is neither 
the most desirable not the most undesirable 
outcome. 

An SEU model for predicting risk-level 
choice (minimum odds) for a typical choice- 
dilemmas item can be constructed as follows: 

Let p, q, designate, respectively, the subjec- 
tive probabilities that the risky alternative 
wil be successful or unsuccessful. Assume 
(a) U, > U, 2 Un (D p+ 9 =1 Ge, g=1 
— p) for every subject. Then: subjective 
expected utility for the risky alternative: 
SEU = pU. + (1 — p)U;; and subjective 
expected utility of the certain alternative: 
SEU c) = 1- U.. According to the SEU model 
the choice of the risky alternative should 
follow only when its SEU is equal to or greater 
than the SEU of the certain cautious alterna- 
tive, that is, when SEU(a) 2 SEU¢e). Or, | 
equivalently, the risky alternative should be 
chosen if and only if pl. + (1— p)U; 
> 1-U.. Solving for p, 

U, — U; 
U,— Ur 
In other words, according to the SEU model, | 
the lowest probability of success that a subject 
is predicted to consider acceptable (ie. his | 
risk-level preference) in order to attempt the — 
risky alternative is 


2 ph —- Us 
PTS iy 


[1] 


p2 


p] 


(f: the ratio of the utilities of the outcomes, as 
judged by the subject, which also serves as à 
predictor of the subjects risk-level preference, 


p). 

Two important consequences of the SEU — 
model are indicated by Equation 1. First - 
note that the p which maximizes SEU is not 
determined by the absolute utility of any 
single outcome alone, but by the relative 
differences between all the utilities; or, to 
put it differently, it is determined by the 
ratio of the utilities (i.e. utilities’ ratio, f). 
Tf the ratio of the utilities was known, then 
we should be able to predict an individual's 
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TABLE 1 


SEU AxaLvsis or THE EFFECTS or GROUP Processes ON INDIVIDUAL AND 
Group DECISIONS INVOLVING Risk 


AMIRAM VINOKUR 


Analysis and results 


Choice-dilemma | Change in f, i 
type | risk preference | Change in $, 
| | utilities’ ratio 
IRuky: p:—pn«0 
Conservative; | p2— $120 | 
Risky : | h—h«0| h-À 
Conservative: | ps— 5h 20 | bibi 
Risky : | p2— 1«0 
Conservative: | Px— pr >0 
Risky: | pz- pr <0 ps = h | pr 
Conservative: | $3— 5120 | i pr 


Fit between p and 5 


br— p <0 | |pr— pil = |b2— P: | Informational influence hypothesis 
| pr-— f>0 | |p — pil = |p: — ?: 
| |pu— bil > [ds 2: 
| I — Pil > |p2— Pal 
p:— pi <0 | [pi — fil > [pr — ba |*| Informational influence and 
b:— pi >O | |pi— pil > |p2— be |" 


= fal < |»— pa| 
[pr — pil < [fs — B: 


Evidence in support of 


Rationality (SEU) hypothesis 


rationality (SEU) hypotheses 


Affective hypotheses 


the American Psychological Association, 1971. 


* Note that the increase in goodness of fit can result from either changes in p or f or both. According to the rationality hy poth- 
esis, which purports to account for the shifts in p, the goodness of fit should increase as a result of the changes in p rather than in » 
Consequently, the results that satisfy the above-mentioned equations constitute evidence for the rationality hypothesis only if. 
the mean change in $ Is greater than the mean change in  (ie., |p: — i| > |2» — £10). 


risk level; and if the change in this ratio 
was known, then we should be able to predict 
changes (shifts) in risk level. 

Second, in order to reach a group decision 
concerning the lowest p, it is not necessary for 
the subjects to reconcile their absolute utilities 
directly, as is often assumed, but merely to 
agree on their relative magnitudes, that is, on 
the ratio among those utilities. 

In order to validate the SEU model, positive 
correlations between p and f (Le, f as in 
Equation 2) should be obtained across subjects 
for each choice dilemma as well as within 
subjects across several choice dilemmas. 
Given data regarding risk-level choices before 
(pı) and after (2) group discussion, and 
subjects’ assessments of the utilities? ratio, also 
before (fı) and after (f) group discussion, it 
is possible to test important explanations for 
consistent shifts in tisk taking. 

According to the informational influence 
hypothesis, there has to be a change in p 
(i.e., risk preference) in the direction of the 
change in f (i.e., utilities’ ratio) as a result of 
the additional information that is raised during 
the discussion regarding the utilities on which 
Ê is based. However, there need not be any 
increase in the goodness of fit between p and Ê 
following group discussion (i.e., the absolute 

difference between p and f need not decrease), 
since it is not assumed that increased rational- 
ity results from group discussion, 


Note,—Reprinted from an article by Amiram Vinokur published in the October 1971 Psychological Bulletin. Copyrighted by 


On the other hand, the rationality hypoth- 
esis implies that there should zot be a change 
in the subsequent to the group discussion, 
since no new information on the utilities has 
been added. The shifts in p are in the direction 
of the initial f, because by changing their 
risk preference to fit their utilities’ ratio, 
subjects are trying to maximize SEU. Con- 
sequently, the goodness of fit between p and P 
should increase considerably after group 
discussion (i.e., the absolute difference between 
p and f should decrease). 

If both informational influence and rational- 
ization processes are operative, there will be 
both a change in f in the direction of the change 
in p and an increase in the goodness of fit 
between the two. 

"There remains of course the possibility that 
neither of these two cognitive hypotheses is 
responsible for the risky and conservative 
shifts. In this case there should not be a change 
in f after group discussion; instead, the | 
goodness of fit between p and f should decrease. ^ 
Evidence of this kind would suggest that an, 
affective process is responsible for the shifts— 
that new utilities (e.g., utility for risk, or for 
appearing risky, as Brown’s risk-as-a-value 
hypothesis suggests) that are not represented 
in the outcomes of the choice dilemma (i.e., 
Equation 1) are influencing the risk-level 
choices. A summary of this analysis is presented 
in Table 1. 
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set of four experiments was conducted 
j test the SEU model and the three major 
potheses. The first experiment provides the 
ts outlined above and is also concerned with 

he extent to which group discussion rather 
than other factors (e.g., repeated experience 
with the task) is responsible for the shifts in 
“tisk and for the possible concomitant shifts in 
utilities’ ratio that are predicted by the 
informational influence hypothesis. Therefore, 
the basic design of this experiment consists of 
n experimental condition with group discus- 
on and a control condition without discussion. 


EXPERIMENT I 


troductory courses in psychology at the University 
Michigan participated as subjects in this experiment. 
Students participated during the winter semester 
1970 as part of their course requirements. 
Materials. (a) Risk-taking measure: The risk-taking 
ment for measuring P was a questionnaire 
“Opinion Questionnaire”) containing instruc- 
and eight choice-dilemma items. 

The eight items were chosen to include three risky 
items (ie. items that have been found to yield risky 
shifts), three conservative items (i.e., items that have 
found to yield conservative shifts), and two 
eutral items (i.e., items that have been found to yield 
i consistent significant shifts). The three risky items 
taken from the original Choice Dilemma Question- 
fully reproduced in Kogan and Wallach (1964, 
endix E). These items (Numbers 4, 6, and 11 in 


lesirability of ihe play which would result in a tie. 
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the Choice Dilemma Questionnaire) begin with Mr. A, 
Mr. B, and Mr. C, respectively, in our Opinion Ques- 
tionnaire. From the three conservative items only one 
was taken from the Choice Dilemma Questionnaire 
(Number 12), and it begins with Mr. D in our question- 
naire. The other two conservative items, Mr. E and 
Mr. F, were taken from a questionnaire used by 
Stoner (1968; Numbers 2 and 8, respectively, in his 
questionnaire). The two neutral items were also taken 
from the Choice Dilemma Questionnaire (Numbers 2 
and 5). They begin with Mr. H and Mr. G in our 
questionnaire. The risky, conservative, and neutral 
items were dispersed in the questionnaire given to the 
subjects so that no two items of the same type appeared 
adjacently. 

Each choice dilemma was accompanied by the stan- 
dard instructions to choose—between odds of 1, 2, 3, 4, 
5, 6, 7, 8, or 9 chances in 10—the lowest odds of success 
acceptable in order to recommend trying the uncertain 
(i.e., risky) alternative; or, instead, to indicate that 
this alternative should not be attempted, no matter 
what the odds. In the latter case, the response was 
scored as 10. Risk was measured in the standard manner 
as the odds that were chosen by the subject. 

(b) Direct assessment of utilities’ ratio ($): The 
instrument for assessing utilities’ ratio was a question- 
naire (titled “Desirability Estimation Questionnaire") 
containing instructions followed by the same eight 
choice dilemmas as they appeared in the Opinion 
Questionnaire, but without the risk-taking scale. 
Instead, beneath each choice dilemma a 15-point scale 
was provided in order to obtain an assessment of the 
utilities’ ratio. For example, one choice dilemma was 
concerned with a football captain: he was to decide 
between choosing a play that was certain to result in a 
tie, or a risky play that if successful, would result in 
victory, but if unsuccessful would result in defeat. 
The following scale and instructions were provided 
below this item: 


__ Try to place yourself in the position of Mr. A. Place an “X” on the desirability scale to indicate the relative 


0 examples of how to make this rating were given 
beginning of the questionnaire. 

e distance from the left end of the scale to the 
mark divided by the total length of the scale is 
the ratio of utilities given by Equation 2. 
"rocedure. Subjects were randomly assigned to two 
tions. In the first condition, the experimental 
tion, 90 subjects were run in eighteen 5-man 
ups. In the second control condition, 62 subjects + 
te run in groups ranging in size from 3 to 7. 


scale is closely related to one used earlier by 
tein, Miller, Vinokur, Katz, and Crowley (1971). 
though the control condition includes fewer 
than the experimental condition, it allows for 


Low High 
desirability desirability 
Risky play c rr temi us iis. DE lc Diy 

fails; 1 2 3 4 5 6 7 8 910 11 12 13 14 15 succeeds; 

defeat victory 


Briefly, the procedure involved the administration 
of five questionnaires, measuring pı, fi pe (ie, 
risk level arrived at by group consensus), ps, fz, in 
succession in the experimental condition, and four 
questionnaires measuring pı, f; P» f» in the control 
condition. Except for the questionnaire measuring pa 
in the experimental condition, the other four were filled 


statistical tests of the same or even better power than 
those done with the experimental condition. This is 
because the tests using the data of the experimental 
condition were based on groups (18; i.e., means of 
the group members’ scores), whereas those using the 
control condition data were based on subjects' individ- 
ual decisions (62). 


^ 


476 


AMIRAM. VINOKUR 


TABLE 2 
Means or Urrurrigs! Ratio (f) AnD Risk PREFERENCES (f), Propuct-MoMENT CORRELATIONS BETWEEN 


UmnurriEs! Ratio AND Risk PREFERENCE, AND 
AND IN UTILITIES’ RATIO 


MEAN SHIFTS IN Risk PREFERENCE 


Experimental condition (N = 90) Control condition (N = 62 
| $" "Lag I| 
[mea | 
Choice- Uuli- A XE E Utili- ES | Stins ta 
ar tier Risk | Correla- Shifts in risk ules ties’ | Risk | fion |r 
h La r pa — pi pot »à-h ES pi EE -: $ k P 
"rud 7 e **|—r189* | 423 |3.85 | .57** | —.16 | —.62* 
PF uM eimi ss 
C 421 |409| 38 | —.76 | —.62* | —.66** | 419 |4.10| .55** | 30 |—.19 
M 434 4.33 44% | —1,03*** —.96*** | —.92*** | 442 |4.28 | .57** 14 |—.26 
Now 5.66 | 7.01 | .44** 22 27 .66* 6.16 | 7.06 | .33* 33 .42* 
E 6.86 | 7.72 | .50** .86** 5 .87*** | 6.60 | 7.61 | .54** .07 .22 
F 6.57 |8.26| 37 | 61% i4 (609 | 6.67 | 7.60 | .25* 45° | ae 
M 636 |7.66| 44 | 56%" Sà* |. gamn | 647 |T42| 37 | 06 | 35 
Neutral y 
De 6.80 | 7.81 | .53** -08 03 AS 6.61 | 7.71 | .49** —.32 2 
H 6.04 | 7.40 | 40** Th 04 12 6.02 | 6.92 | .59** 06 | 22 
M 642 | 7.60 | .47** 09 04 3 6.32 | 7.81 | .54** —.13 AT 
Note.—All tests concerning shifts are two-tailed correlated ¢ tests, and are based on group means in the experimental con- 
dition (N = 18) and on scores of individual subjects in the control condition (N = 62). 
05, : 
MUR 
***  « .001 


out by the subjects privately with no discussion among 
subjects. Thus the administration of the third question- 
naire measuring po constitutes the experimental 
manipulation of group discussion to consensus. 

"The procedure for the two conditions was identical 
for the first two questionnaires. Subjects arriving to the 
laboratory were seated at small tables. Each subject 
was given the first Opinion Questionnaire. After all 
subjects read the instructions, questions were answered 
by the experimenter. Then, subjects were told to fill out 
the questionnaire at their own rate. Upon completion, 
the questionnaires were collected and the ‘Desirability 
Estimation Questionnaire’ was then given to each 
subject. The same procedure was repeated with this 
questionnaire as well as with the last two questionnaires, 
Opinion Questionnaire (final personal decisions) and 
Desirability Estimation Questionnaire (final personal 
estimation). In each case subjects worked on these 
questionnaires privately. 

After the third questionnaire, Opinion Questionnaire 
(group discussion and group decision), was distributed, 
subjects were instructed to discuss each item and to 
reach a group consensus regarding the minimum 
acceptable probability. In those cases (less than 15%) 
where no consensus was reached within 7 minutes of 
discussion, the experimenter asked the subjects to 
mark their persona] opinion at the moment each on 

his own questionnaire and to start the discussion on 
the following item. The mean of these personal opinions 
was taken in the analysis in place of a unanimous group 


decision. After the procedure of discussion to consensus 
was completed for all items, the questionnaires were 
collected. Then, the last two questionnaires were 
administered as mentioned above. 


Results 


SEU model for individual choice behavior. - 
The first question that should be answered 
before we can turn to the analysis of the group 
process is whether the SEU model proves to 
be a valid model for individual choice behavior. 
In other words, do subjects’ initial decisions 
(i.e., risk-level choices) reflect the relationship 
among utilities as required by SEU theory? 
It was already mentioned that in order to . 
validate the SEU model positive correlations 
between p and f (ie. Ê as in Equation 2) 
should be obtained for each choice dilemma 
across subjects, as well as within subjects 
across several choice dilemmas. Some of the 
results pertinent to this question are presented 
in Table 2. . 

The values of pı are given in odds in 10 
(i.e., the probability multiplied by 10). The | 
values of f were calculated according 19 
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Equation 2 and multiplied by 10 to make 
easier the comparison to the values of p. 

The results indicate moderate correlations 
between f£; and pı for the choice dilemmas 
across subjects. All correlations are significant 
beyond the .05 level. 

The results of the correlations across subjects 
for the choice dilemmas support the SEU 
model. However, since the SEU is a model for 

‘individual. choice behavior, its crucial test 
‘should be based on correlations between p 
and ô across items within subjects. For this 
test a product-moment correlation between 
pand P across the eight choice dilemmas was 
computed for each individual subject. Each 
p - was then transformed to Z score 
(ic, Z = 1/2 logt ) These scores were 
summed and averaged. The transformation 
of the mean Z scores to r yielded r = .74 and 
r = .72 for experimental condition and control 
condition, respectively. Each correlation was 
‘significant beyond the .001 level. Thus, crucial 
support for the SEU model was obtained. 

How the utilities! ratio changes: Analysis of 
the informational influence hypothesis. The 
next analysis is concerned with the question 
of whether informational influence is the 
process responsible for the shifts in risk 
preferences. The results pertinent for this 
analysis are presented in Table 2. 

First, it should be noted that in the experi- 
mental condition, risky shifts were produced 
on the risky items, A, B, and C; conservative 
shifts on the conservative items, D, E, and 
F; no shifts were found on the neutral items, 
G and H (column pg—pi). These shifts in 
risk preference hold up for individual decisions 
following the group discussion as shown in 
column f»— pı. Moreover, no such consistent 
and significant shifts appear in the control 
condition. 

Second, for the experimental condition the 
changes in f are perfectly consistent with the 
shifts in p. There is a decrease in Ê for the risky 
items, A, B, C (p< 01), an increase in f 
for the conservative items, D, E, F (p < .05), 
ind only negligible changes in f on the neural 
items, G and H. The product-moment correla- 
tion between mean shifts in risk preference 
(p-p) and mean shifts in utilities’ ratio 
\Pe—fı) across the eight items is r= .98 


We 
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(p < .001). Though there are some shifts in 
f in the control condition, they are small and 
only two of them are statistically significant. 
More importantly, these small shifts in f 
did not appear consistently in each dilemma 
type nor did they correspond to the changes in 
p. The correlation between f»— pı and f»—fi 
is negligible, .08, and not significant. 

A more stringent test of the informational 
influence hypothesis should be based on the 
correlation between changes in p and changes 
in f within subjects across the eight items, 
For each subject a product-moment correlation 
between shift in risk preference (i.e. p»— py 
and shift in utilities’ ratio, (i.e., 2—p1) was 
computed across the eight items. These corre- 
lations were transformed to Z scores and 
averaged. The mean Z scores of the experi- 
mental and control conditions were trans- 
formed to r again, yielding r = .51 (p < .001) 
and r = .14 (p < .05), respectively. It then 
seems clear that for the average subject in 
the experimental condition a shift in risk 
preference is associated with a change in the 
way he assesses the relative utilities of the 
outcomes (i.e., utilities’ ratio).^ Consequently, 
the information influence hypothesis gains 
additional support. 

Shifts in risk: Analysis of the rationality and 
the affective hypotheses. The next analysis has 
to do with whether the shifts can in part be 
accounted for by increased rationality (i.e., 
better maximization of SEU) or by an 
affective process due to the group discussion. 

According to the SEU model for individual 
choice behavior, a subject determines his risk 
preference on the basis of his assessment of 
the utilities’ ratio. If choices are made on this 
basis, then p should be equal to f. Furthermore, 
if choices on that basis are considered rational, 
then any difference between p and P indicates 
deviation from rationality. If subjects change 
their risk preference (p) so as to make it more 
closely correspond to their evaluation of the 
outcomes (ie. to the utilities’ ratio 2), the 
difference between p and f will decrease; 


5'The correspondence in item means between pı 
and f, and between pz—pi and f»—f (see Table 2) 
replicates earlier findings by Burnstein et. al. (1971). 
So also does the high within-subject correlation coeffi- 
cients between p; and fı and between po—pr and 


fi~ ĝi 


TABLE 3 


MEAN INckEASE IN Goopness or Fit on RISKY 
JONSERVATIVE, AND NEUTRAL Items 


Experimental Control 
condition | condition 
Choice-dilemmas ae. 
type a ^ 
P — Pr in — Pal 
-| -f — 191 — 91 
Risky | 
A — 05 30 
B 0i | —.30 
c 44 —.31 
M 00 —.15 
Conservative | 
D 39 EL 
E 09 AS 
E 14 —.04 
M mi .28 
Neutral | 
G .12 | ES 
H .08 | 16 
M 10 30 


that is, the goodness of fit between p and f 
will increase as follows: | p1—/1|—|p2—f2| > 0 
(see also Table 1). If the increased goodness 
of fit is due to changes in p but not in f, 
enhanced rationality has occurred. If goodness 
of fit decreases due to changes in p, this would 
be evidence for an affective process. Suppose 
the increase in goodness of fit occurs only in 
the experimental condition and not in the 
control condition. Tt clearly must be produced 
by group discussion. 

Suppose the initial p for the risky items was 
above the initial f and the initial p for the 
conservative items was below the initial 
b. That is to say, the person initially 
violates rationality in the former case by being 
too cautious and in the latter case by being 
too risky. Then, if as a result of the group 
discussion on each item the p moved closer 
to the initial f (i.e., decrease in p for risky 
items and increase in p for the conservative 
items), it could be concluded unequivocally 
that the shifts occur because the individual 
has become more rational, and he has become 
more rational because of the discussion. 
Close examination of the data discloses that 
this is not the case. Consider the results that 
were presented in Table 2. It is clear that for 
the three risky items, the risk preferences? P 
before discussion are virtually identical to 
those of the Ê (p = 4.17, 4.73, and 4.00; 
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p = 4.12, 4.71, and 4.21, respectively). So, 
the risky shifts produced on these items cannot 
be accounted for by increased rationality, 
since there were no deviations from rationality 
on these items to begin with. 

For the three conservative items the initial 
p was always higher (rather than lower) than 
the corresponding (p = 7.01, 7.72, and 8.26; 
p = 5.66, 6.86, and 6.57, respectively). So, 
if any process of rationality occurs on these 
items, it should lead to risky shifts! 

When more rigorous analysis of the data is 
performed to test the rationality hypothesis, 
the rationality hypothesis becomes untenable. 
According to the model presented in Table 1, 
in order to support the rationality hypothesis 
the analysis requires that |pi—f1| > | po—fo| 
for total means of both risky and conservative 
items, provided that the shifts in p are greater 
than those in f- (Le, |p2—pi| > |f»—£1l)- 
The results of this analysis are presented in 
Table 3. 

These data make it evident that the risky 
items do not demonstrate any increase in 
rationality that could account for the risky 
shifts. The conservative items demonstrate 
a negligible increase in goodness of fit. How- 
ever, as will be shown below, this increase in 
goodness of fit in the experimental condition 
was not due to changes in p, but rather to 
changes in f, and therefore, cannot be inter- 
preted as a result of a rationality process. 

If we reexamine Table 2, it is clear that the 
small increase in goodness of fit that was 
found in the earlier analysis cannot account for 
amy part of the shifts in risk preference due to 
group discussion. For the risky items, the 
changes in f were virtually identical to those 
in p (—.92 compared to —.96, respectively). 
For the conservative items the changes in $ 
were even greater than those in p (.74 com- 
pared to .53). Consequently, increased good- 
ness of fit due to changes in p alone (shifts in 
risk preference) is nowhere in evidence. ] 

Finally, similar increase in goodness of fit 
also appeared in the control condition where 
no consistent shifts toward conservatism were 
found. Thus, this increase cannot be attributed 
to the conservative shift and vice versa. 

Though the shifts toward greater risk and 
conservatism can in no way be explained by 
an increase in rationality, they cannot be 
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ned by an affective process cither. This 
ause only increases in the goodness of fit 
p and f were observed, rather than 
es required by the affective hypothesis. 
ore, these results provide even stronger 
lence against the affective hypothesis than 
gainst the rationality hypothesis. 


lusions and Discussion 


experiment produced results which 
ide direct support to the SEU model of 
idual choice behavior and also to the 
formational influence hypothesis. It also 
iroduced direct and strong evidence against 
le affective hypothesis and the rationality 
pothesis purported to account for the shifts 
risk preference. 

"However, some potential weakness of this 
vidence must be noted and dealt with. When 
both measurements of p and f are obtained 
efore and after group discussion, it is possible 


ind the concomitant changes (shifts) in f 
ind f occurred not because of the causal 
relationship between utilities’ ratio and risk 
preference as SEU theory suggests, but 
because the subjects were adapting one 
Measure to the other for reasons unrelated to 
EU theory. We shall term this possible 
"effect the "consistency effect.” 

The following two experiments provide 
additional tests for the hypotheses while 
attempting to minimize the possibility of a 
consistency effect. This possibility can be 
minimized by employing measures of p and f 
‘that make it difficult for subjects to see the 
elationship between the measures. Under these 
Conditions, consistent responses are unlikely 
to be the result of an artifact. This method 
vas used in Experiment I and more strin- 
ently in Experiment III. Generally speaking, 
in these two experiments the utilities’ ratio p 
Was not measured directly as in Experiment I, 
but instead, subjects made separate assess- 
nts of the utility of each outcome of a 
hoice dilemma. (for each choice dilemma) in 
riment II, and also made independent 
essments (i.e., using different set of scales 
the utility of each outcome) in Experiment 
. The utilities’ ratio for each choice dilemma 
then computed on the basis of these three 
flerent assessments according to Formula 2 
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rather than on the basis of one assessment as 
in Experiment I. These methods of assessing 
utilities also provide more detailed information 
on changes in utilities of specific outcomes, 
and consequently, they may make it possible 
to determine which of the outcomes are 
reassessed before there is a change in subjects" 
risk preference. 


Experiments Il AND HI 
Method 


Subjects. A total of 164 male students from the 
introductory courses in psychology at the University 
of Michigan participated as subjects in the two experi- 
ments. The students participated during the winter 
semester of 1970 as part of their course requirements. 
Eighty-five subjects participated in Experiment Il; 
all were run in seventeen 5-man groups. Seventy-nine 
subjects participated in Experiment III. They were run 
in fifteen 5-man groups and one 4-man group, composing 
a total of 16 groups. Each experimental session with a 
group lasted approximately 75 minutes. 

Materials. (a) Risk-taking measure: The same choice- 
dilemmas task (i.e, Opinion Questionnare) that was 
used in Experiment I for the measurement of fi, fo, 
and p: was used in these two experiments. Due to a 
time limitation on the availability of the subjects for 
these two experiments, Item C was omitted from the 
questionnaires of Experiment II, and Items C and D 
were omitted from the questionnaires of Experiment 
ILL. The scoring of the risk preference (p) was identical 
to the standard method used in Experiment I. 

(b) Assessment of utilities’ ratio: Two different 
methods of assessing utilities were attempted with 
questionnaires titled General Desirability Estimation 
Questionnaire. 

In Experiment II, the utilities of the outcomes, 
Uy, Us, and Ue, were each assessed separately. Beneath 
each of the seven choice dilemmas there was a desir- 
ability scale that consisted of a line (140 millimeters) 
with the left and right end points representing the 
least and most desirable event or outcome that the 
subject could conceive. For example, beneath the item 
concerning the football captain and his dilemma the 
following instructions and scale were provided: 


Try to place yourself in the position of Mr. A. 


Indicate with lines across the scale, and letters 
above the lines, the desirabilities of the following 
alternatives and their outcomes: 

F:Risky play attempted, resulting in defeat. 

S:Risky play attempted, resulting in victory. 

C:The certain play attempted, resulting in a tie. 


Lowest Highest 
desirability desirability 


Subjects were instructed to make their assessments 
in the same order presented by the letters F, S, and CG 
Two elaborate examples of how to make these ratings 
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on a choice-dilemmas item were presented at the begin- 
ning of the questionnaire. 

The utility of each outcome was scored according to 
the distance in millimeters of the intersection of the 
line representing this utility from the left-hand end point 
of the desirability scale. Thus low values indicate low 
utilities. The utilities’ ratio was calculated according 
to Formula 2 and multiplied by 10 to make the scores 
comparable to the risk-taking score as follows: 

% Us — Ur 
digi 7o ^ 

In Experiment ILL, as in Experiment II, the assess- 
ment of the utilities’ ratio (f) again involved the 
separate assessment of each of its components, that is 
Us, Us, and U.. However, unlike in Experiment Il, 
these assessments are done independently of one another 
on separate sets of semantic differential scales provided 
beneath each item. For example, beneath the item 
concerning the football captain, the following instruc- 

tions and scales were provided: 


Try to place yourself in the position of Mr. A. 
Risky play attempted, resulting in victory: 
undesirable —:__:__:__: ea desirable 


good —:— bad 
harmful —:— beneficial 
rewarding punishing 


The four scales above were also provided for the other 
two outcomes: “Risky play attempted, resulting in 
defeat" and “Certain play attempted, resulting in a 
tie.” One elaborate example of how to make these 
ratings was given at the beginning of the questionnaire. 

The particular four bipolar adjectives chosen are 
known to have high loading on the evaluative factor of 
the semantic differential (see Osgood, Suci, & Tannen- 
baum, 1957). The positions of the positive and negative 
adjectives were reversed from left to right all along the 
Sets of scales so as to make it difficult to respond in an 
ostensibly consistent fashion. 

A score of 1 was assigned to responses at the negative 
end point, increasing up to 9 for responses at the 
positive end point. For each set of four scales, the four 
scores were then summed. This sum was taken as the 
score representing the utility of the particular outcome 
assessed by the four scales. Thus, low scores represent 
low utility outcomes, and high scores represent. high 
utility outcomes. The utilities’ ratio ($) was computed 
from the outcomes utilities’ scores in the same manner 
as in Experiment II. 

Experimental design and procedure. The experimental 
design and procedure of Experiments II and III was 
identical to that of the experimental condition in 
Experiment I. The difference among them lics only 
with the methods employed for assessing the utilities’ 
ratio. 


Resulls 

SEU model for individual choice behavior. 
The following analysis is identical with that 
performed on the results of the previous 
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experiment, Again we are concerned with the 
validity of the SEU as a model for individual 
choice behavior. The results relevant to this 
analysis are the product-moment correlations 
between utilities! ratio (f) and risk preference 
(p) across subjects for each item. But even 
more relevant are the correlations within 
subjects across the choice-dilemma items. 
Some of these results are presented in Table 4. 

The results of Experiment II demonstrate 
moderate correlations between utilities’ ratio 
and risk preference for the choice dilemmas 
across subjects, All the correlations are 
significant beyond the .01 level. The results of 
Experiment III are not fully consistent with 
those of Experiment II, as only three out of 
the six correlations are of moderate value 
and significant. 

Thus, the correlations across subjects for 
the choice dilemmas support the SEU model, 
with the exception that in Experiment MI 
there were three items that did not demonstrate 
significant correlations. Close examination of 
the data revealed that on these items, there 
were a few subjects (less than 2%) that 
deviated from the basic assumption of the 
model and assessed the utilities of the outcome 
of the certain alternative U, either lower than 
Uy; or higher than U,. These assessments 
strongly affected the utilities’ ratio and created 
strong deviations that led to the small corre- 
lation of these particular items. 

However, these deviations from the model 
that were caused by a few subjects could not 
affect the crucial test of SEU model: the 
correlations within subjects across the choice 
dilemmas. Again, product-moment correlations 
within subjects were computed and trans- 
formed to Z scores. The mean Z scores were 
.65 in Experiment II and .38 in Experiment 
III, corresponding to r — .57 and y = .36, 
respectively. Although these correlations be- 
tween p and f are somewhat weaker than 
those obtained in Experiment I, they are 
nevertheless statistically significant (p < .001 
for both). 

How the utilities’ ratio changes: Analysis of 
the informational influence hypothesis. The 
results pertinent for this analysis are presented 
in Table 4. 

The pattern of results of Experiments I 
and III is virtually identical to that produced 
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TABLE 4. 
Propuct-MomENtT CORRELATIONS BETWEEN UriLrrikes' Ratio (f) AND Risk PREFERENCE ($) AND 
MEAN SHIFTS IN Risk PREFERENCE AND IN Utitrtres’ RATIO 
Experiment II (N = 85) Experiment MI (N = 79) 
ice-diler s Shifts i S i 
Choice-dilemmas | Correlation Shift in risk utilities Correlation Shift in risk Siilitie 
ratio ratio 
= e 
É r p—h | »-hnh|Ph-h r pa-p | -h | $:-h 
Risky 
A spr -—12099 foa S Ua |) = gare .40** —1.15*** | —1.15*** | —1.88 
3 Eid —113*** | —.79** | —1.07 18 —102**| —.93*** | —2.02 
M A0** -147**|--—.9]7**|.—.95** Datus —1.08*** | —1,04*** | —1.95* 
Conservative 
D nM .26 38 93 
E 4 .74* .68* 37 .28* 1.08** ees .69 
I .41** 33 130 .88** .32** .96*** 95*** 1.37** 
M agg .44** A6** .60** 30 1.02*** 95er 1.03** 
Neutral 
G 29** —.14 7i —.52* .12 .02 .08 —.14 
H .49** .26 Age 1.01* .06 07 10 93 
M 3918 .06 46 24 .09 .05 09 39 


Note. —ÀAll tests concerning shifts are two-tailed correlated / tests and are based on group means in both Experiment II 


(N = 17) and in Experiment III (N = 16). 
tp <.05. 
**p «.01. 
*** p < 001. 


in the experimental condition of Experiment I. 
Risky shifts were produced on the risky items 
(p < .001), conservative shifts on the con- 
servative items (significant beyond .01 for 
every item in Experiment III and for the 
pooled data in Experiment II), and no shifts 
were found for the neutral items (see columns 
pa—pi). These shifts remain for individual 
decisions following the group discussion as 
shown in column f»— pı. Moreover, the shifts 
in utilities’ ratio (f) are almost perfectly 
consistent with the shifts in risk preference 
(p). There are decreases in f on the risky items ° 


‘These decreases (Le., negative utilities’ ratio 
Shiíts) are substantial. Nevertheless, the mean shift in 
ltem B in Experiment II and in Items A and B in 
Experiment III is statistically not significant. The 
lack of statistical significance is the result of the 
- responses of three subjects (one in each item) whose 
- utilities" assessments deviated from the basic assump- 
tion concerning the applicability of the SEU model for 
à choice dilemma, that is, the assumption stating that 
U,« U, « U,. These subjects assessed the utility 
of the certain alternative (U,) either much lower than 
the utility of failure (Uy) in the uncertain alternative, 
Or much higher than the utility of success (U,) in the 
Uncertain alternative. These few, but strong, “deviant” 
Tesponses strongly affected the magnitude of the 


and increases in f on the conservative items. 
These shifts based on pooled data obtained 
from items of the same type are statistically 
significant beyond the .05 level. Although 
there are some shifts in f on the neutral items, 
they are not consistent. The correlation 
between mean shifts in risk (p»—5:) and in 
utilities’ ratio (f»—f1) across items is 7 
(p< .05) for Experiment Il and r 
(p « .01) for Experiment III. 

Again, the more stringent test of the informa- 
tional influence hypothesis regarding the 
correlations between shifts in p and f within 
subjects was performed. The correlation 
between pə—pı and f4—f: across items was 
computed for each subject and transformed 
to Z scores. The mean Z scores were trans- 
formed again, resulting in correlations of 
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utilities’ ratio and the variance, resulting in shifts 
which are statistically not significant. To demonstrate 
this we eliminated the data of the groups where the 
deviant responses occurred (only one group in each 
item) and computed the mean utilities’ ratio shift 
again. The computation produced substantial negative 
shifts of —.57 for Item B in Experiment II, and 1.05 
and —.92 for Items A and B, respectively, in Experi- 
ment III, all significant beyond the .05 level. 


= 
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TABLE 5 
MEAN SHIFTS IN Outcomes’ UTILITY (Uy, Uc, Us) Fottowine Grour Discussion 
Experiment II (N = 85) Experiment III (N = 79) 
Choice-dilemmas type — 
Ur: — Un Uis —Ua Uis — Un Uy. — Un Uc — Ue Un —Us 
rif 2.99 =3.79 A3 1.74** — 1.03 .63* 
B 6.91** —1.43 .96 183** =.67 67* 
M 4.95** —2.61 ESI 1.78** =:85* .65** 
Conservative 
3,68 6.15* —1.82 
E — 2.58 1.72 2.66 —.06 ES 44 
E .02 8.54** — 1,08 —44** 3.14*** —.02 
M 37 5.47** .08 25 Lye 23 
Neutral | z 
—.26 —235 2.13 —.26 —.08 | Ai 
H ES 10.29 — 350°" 64 Beg | v — 40 
M .09 3.97 71 19 1.09 18 
Note.—All tests are two-tailed correlated ¢ tests. j à X 
*p < .05. 
»*5 «.01. 
p < 001. 


r = AQ and r = .63 for Experiments I and 
III, respectively. Both correlations were 
statistically significant beyond the .001 level. 

The pattern of results demonstrated by 
Experiments II and III is identical to that 
produced by Experiment I. Consequently, 
substantial support has been added to the 
informational influence hypothesis. 

Shifis in utility of individual outcomes: 
The utilities’ ratio decomposed. The utilities’ 
ratio was obtained in Experiments IT and III 
through direct assessments of its components 
(i.e. the utilities of the three outcomes, Uy, 
U,, and U,, involved in each choice dilemma). 
Thus it may be possible to find out whether 
the shifts in risk preference occurred as a 
result of a change in the utility of a particular 
outcome or, alternatively, whether the shifts 
occurred as a result of changes in the assess- 
ment of all outcomes or different ones depend- 
ing on the particular item, For this analysis, 
the assessment of each outcome after the group 
discussion was subtracted from the initial 
assessment before the group discussion, The 
mean changes (shifts) in the assessments of 
outcomes’ utility for each item are presented 
in Table 5. Keep in mind that the values 
tabulated in Experiment II represent milli- 
meters and those in Experiment III represent 
sum of scores of four semantic differential 


scales, The scores therefore cannot be compared 
across the experiments. 

Overall, the results in Table 5 show à 
complex pattern. The extent to which changes 
in utilities of specific outcomes contributed to 
the change in the ratio varies from item to 
item. For example, in Item A, the increase in 
Uy, in combination with a decrease in Us 
contributed most to the decrease of the 
utilities’ ratio for that item. For Item B, the 
change in U; alone contributed more than any 
other utility change to the decrease in the 
utilities’ ratio. Similarly, for Item F, the main 
determinant of change in the utilities’ ratio 
is the change in U.. The results of the above- 
mentioned items are identical in both experi- 
ments, and those results concerning the remain- 
ing items are also comparatively close. 

It therefore seems that the locus of the 
informational influence varies from item to 
item; on some items, the major informational 
influence concentrates on the assessment of 
the utility of one particular outcome oF 
alternative, and on other items, it concentrates 
on the utility of another outcome, or combina- 
tions thereof. There was no completely 
consistent pattern of influence that cut across 
the items. Nevertheless, it seems that the 
informational influence on the risky items 15 
mainly concentrated on the consequences of 
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jlure at the uncertain alternative, whereas 
informational influence on the conservative 
s is mainly concentrated on the conse- 
ences of the certain alternative. 
Shifts in risk: Analysis of the rationality 
pnd the a fj ective hypotheses. Recall that support 
r the rationality hypothesis requires that 
f—P:| — | P2—b2| > O for the total means 
both risky and conservative items, provided 
this increase in goodness of fit occurred as a 
result of shifts in p but not in p. The results of 
is analysis are presented in Table 6. 
It is clear that no real increase in the good- 
ess of fit following discussion occurs in either 
eriment. However, the conservative and 
the neutral items show some small increases in 
the goodness of fit, but, as in Experiment 1, 
these cannot be interpreted as enhanced 
rationality. Closer examination of the data 
presented in Table 4 discloses that the increase 
in goodness of fit for the conservative items was 
‘the result of the big shifts in f rather than in p, 
contrary to what the rationality hypothesis 
predicts. For these items, the mean shift in 
p was 46, and the mean shift in B was .60 in 
Experiment II. These mean shifts were .95 
and 1.06, respectively, in Experiment IT. 

It is also clear that no real consistent 
decreases im the goodness of fit occurred in 
Experiments II and IH either. This and the 
above-mentioned results are therefore strong 
evidence against the affective and the rational- 
ity hypotheses. 


Conclusions 


Although different methods of assessment 
have been used to obtain the data in Experi- 
ments II and III, the results of these experi- 
ments are in perfect agreement and also 
parallel the results of Experiment I. The 
J a consistency effect has been 
harply reduced by the indirect assessment of 


for the conclusions that have already been 
drawn on the basis of the results of Experi- 
Ment I. 

Overall, the results of Experiments IL and 
I lend additional support to the SEU model 
individual choice behavior and the informa- 
ional influence hypothesis as an explanation 
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TABLE 6 


Mean INcREASE IN Goopness OF Fir ON RiskY, 
CONSERVATIVE, AND NEUTRAL ITEMS 


Experiment IT Experiment III 
| A 
Choice-ditemmas | goodness of ft Seea p 
I 
|i Px} Iii — Pal 
— |p: — Pal — 2: — Pal 
Risky 
A .30 —.73 
B —.18 44 
M .06 —45 
Conservative 
17 
E 31 29 
F .56 42 
M 14 33. 
Neutral 
G —13 —.22 
H 57 .83 
M 22 .30 


for the shifts in risk preference. It seems that 
the informational influence on the risky items 
is mainly concentrated on the consequences of 
failure at the uncertain alternative, whereas 
the informational influence on the conservative 
items is mainly concentrated on the conse- 
quences of the certain alternative. 

Finally, the results of these two experiments, 
together with the results of Experiment I, 
demonstrated fairly conclusive evidence against 
the enhancement of rationality and the 
affective hypotheses as explanations of the 
shifts in risk preference. 


EXPERIMENT IV 


In this experiment the potential occurrence 
of a consistency effect is entirely eliminated by 
omitting all measures of risk preference from 
the design. Thus only the assessment of the 
utilities’ ratio is obtained before and after 
group discussion. Furthermore, unlike all other 
experiments, the group discussion in this ex- 
periment is on the utility of the certain alter- 
native, U,, relative to the utilities of the out- 
comes of the uncertain alternative, Us and Us 
In other words, the group discussion to con- 
sensus is on the utilities’ ratio, Ê, rather than 
on risk, p. According to the SEU theory and 
the informational influence hypothesis, we ex- 
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TABLE 7 
Mean Suirers is UriLiTIES Rano (p) IN EXPERIMENTS I AND IV, Mean SHIFTS IN Risk PREFERENCE (5) 
IN EXPERIMENTS I, IT, AND III COMBINED l 
2 n Mean shifts in utilities’ ratio Mean shifts in risk 
e] Experiment IV (N = 16 groups) E, | gil 
| fom 5 — bo fh h-R | s-a 
iby | 
dr pape —.05 Sat p fpe c 
B —1L55* 29 = 1.26" — 92s 
C —.42 —.04 —.46 —.66** = 
M —]1.12*** 10 21:03 9x TORRT | = 
| 
Conservative | T 
pode 18 —.14 04 .66* .32* 
E 1.38** —.12 1.26*** Bt SUTEN 
F 1 69*** —.06 1.63*** .60** .60*** 
M 1.08" E ge a | 57% 
Neutral I 
G —30 .06 —.24 AS 00 
H 35 ESTI 24 12 | 19 
M .02 -5,02 .00 BE | i0 
Note. —All tests are two-tailed correlated ¢ tests based on group means, 
*p «.05. 
»*p«.01 
3t p < 001 


pect to find the same shifts in the utilities’ ratio 
in this experiment as in Experiment I. These 
shifts should correspond in direction to the 
shifts in risk found in Experiments I, II, and 
HI on the same items. 


Method 


Subjects. Seventy-six subjects participated in this 
experiment. The subjects were male students from the 
introductory courses in psychology at the University of 
Michigan. They participated in the experiment during 
the winter semester of 1970 as part of their course 
requirements. The subjects were run in 16 groups; 13 
five-man groups, 2 four-man groups, and 1 three-man 
group. 

Experimental design and procedure. Briefly, the 
experimental design and procedure consisted of the 
administration of three questionnaires in succession. 
These questionnaires were identical to those used in 
Experiment I. The first questionnaire was administered 
to obtain subjects’ private assessment of the utilities’ 
ratio on each choice dilemma before the group discussion 
(P). The second questionnaire was administered to 
obtain unanimous group decision (assessment) regard- 
ing the utilities’ ratio (Êa). The third questionnaire 
was administered to obtain private assessment of the 
utilities’ ratio after the group discussion (53. The 
experimental procedure is identical to that used in 
Experiments I, II, and III, the critical difference being 
that the group must decide about U, relative to Uy 


and U, (i.e., the utilities’ ratio), instead of 
(ie. risk preference) as in past work wl group 
decisions are always about f. The instruc ons were 
identical to those preceding the Opinion Questionnaire 
that was used in Experiments I, II, and III to obtain 
group consensus on risk level, that is, pq. 


about $ 


Results and Discussion 


The results are presented in Table 7. For 
comparison purposes, data from our earlier 
experiments are included. 

These results demonstrate a clear-cut pat- 
tern. Negative utilities’ ratio shifts were 
produced in the risky items (significant 
beyond the .001 level for the pooled data). 
Positive utilities’ ratio shifts were produced in 
the conservative items (significant beyond the 
001 level for the pooled date). The utilities 
ratio shifts that were produced in the neutral 
items were small, inconsistent, and statistically 
not significant. Moreover, the negative and 
positive significant shifts in utilities’ ratio also 
appear in comparing before and after group 
decisions of the utilities’ ratio (see column 
po—p,). This indicates private acceptance of 
the group decisions regarding the assessment 
of the utilities’ ratio with only negligible 


differences between the group decisions and 
the second private assessments (see column 
f:—/c). Furthermore, these shifts are very 
close to those found in Experiment I, 

The product-moment correlation between 
the mean shifts in utilities’ ratio (column 
f:— pı) of the present experiment and the 
mean shifts in risk preference in Experiments 
I, II, and III (column ?»—51) is r= .93 
(p < .01). Thus, these results regarding shifts 
in utilities’ ratio closely parallel the pattern of 
results regarding shifts in risk preference which 
were obtained on the same choice dilemmas 
in Experiments I, II, and III. This rules out 
the interpretation that the results of these 
experiments were due to a consistency effect, 
that is, subjects adapting their assess- 
ment of utilities’ ratio to changes (shifts) in 
risk preference in order to appear consistent. 

According to the SEU theory and the 
informational influence hypothesis, the shifts 
in risk are merely an epiphenomenon of the 
changes in the assessment of the outcomes 
(ic. utilities’ ratio shifts). Therefore, it is 
reasonable to expect that the latter shifts 
would be of equal or greater magnitude than 
the former ones. For this analysis the means 
of the utilities" ratio shifts in Experiment IV 
were compared to the means shifts in risk 
preference in Experiments I, IL, and III. 
The mean shift in utilities’ ratio for the risky 
items was 1.02, and the corresponding risky 
shift was .90. The mean shifts in utilities' 
ratio for conservative items was .97, and the 
corresponding conservative shift was .57. Thus 
on both risky and conservative items the shifts 
in utilities’ ratio were greater than the shifts in 
risk preference, as predicted by the informa- 
tional influence hypothesis. 

Thus, all the results of this experiment 
provided strong additional evidence in support 
of the informational influence hypothesis. 


CONCLUSIONS AND DISCUSSION 


The findings of these experiments demon- 
strated that individuals’ decisions regarding 
tisk could be accounted for by a rational 
decision rule which is maximization of subjec- 
tive expected utility (SEU). It was found 
that individuals’ risk-level choices were corre- 
lated with their assessments of the utilities of 
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the potential outcomes of a hypothetical 
choice dilemma. It is commonly accepted in 
psychology that an individual's choice behavior 
is determined by his assessment of the potential 
rewards and punishments (i.e, utilities) in 
the situation. If so, the correlations obtained 
between risk-level choices and the ratio of the 
relative utilities of the outcomes indicate a 
causal relationship. That is to say, the individ- 
ual's assessment of the potential utilities 
determines his risk-level preference. 

Four experiments demonstrated that shifts 
toward greater risk and conservatism following 
group discussion were strongly correlated with 
equivalent shifts in the assessment of the ratio 
of the relative utilities. Consequently, it 
seems reasonable to conclude that the shifts 
in risk are due to a cognitive process of 
informational influence regarding the assess- 
ment of the utilities of the outcomes in the 
choice-dilemma situation. Furthermore, no 
evidence was found in support of either of two 
other explanations for these shifts in risk 
following group discussion. First, there was no 
evidence supporting a cognitive process of 
increased rationality where individuals come to 
adhere more closely to the SEU maximization 
rule as a result of group discussion. Second, 
there was no evidence in support of an affective 
process where new utilities that are associated 
with the circumstances under which the 
decision is made (e.g. utility for risk, for 
appearing risky) enter into consideration and 
influence subjects’ decisions. In fact, most of 
the evidence is against this hypothesis. 

Of course there remains the question of how 
the informational influence operates during the 
group discussion. Vinokur (1969, 1971) has 
suggested that the availability of persuasive 
arguments regarding outcomes, and the likeli- 
hood that these arguments will be raised during 
group discussion, determine the informational 
influence exerted and the amount and direction 
of the shift in risk. 'The following explanation 
is offered : 

For any choice-dilemmas problem there exists 
a pool of persuasive arguments for each alterna- 
tive course of action. A persuasive argument 
can be equated with a correct solution in à 
group problem-solving situation. Once it has 
been brought up it is likely to be accepted by 
the subjects as valid. It is assumed that a 
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subject reading a choice-dilemma item attends 
to the information contained in the item, 
makes some inferences, and organizes the 
information into arguments for and against 
the risky and the cautious alternative. Then, he 
reaches a decision concerning his preferred 
risk level on the basis of these arguments. 
If he comes up with many persuasive argu- 
ments favoring a particular alternative (i.e., 
the risky or the cautious one), he is likely to 
take an extreme position (ie, either very 
high or very low risk level, depending on which 
alternative the arguments favor), and to be 
very confident and committed to his decision. 

If for a given choice dilemma there is a 
preponderance of persuasive arguments for one 
alternative, then individual and group shifts in 
risk level are predicted to occur in the direction 
of the alternative favored by the arguments. 
The strength of the shift depends first on the 
probability that an average subject will 
possess all or most of the persuasive arguments. 
If this probability is either very high or very 
low, strong shifts will not occur. If the prob- 
ability is high, the individuals have already 
made their initial decision on the basis of the 
persuasive arguments. Thus, prior to discussion 
the group members are already extreme. If the 
probability is very low, it is unlikely that 
individuals will come up with those persuasive 
arguments, even after considerable thought, 
Therefore, it is also unlikely that a group will 
contain individuals who possess these persua- 
sive arguments and use them in discussion. 
For these reasons, the strongest shifts are 
predicted on choice items with a preponderance 
of persuasive arguments for one alternative 
but with a moderate probability that an average 
individual will possess them. If group shift 
toward greater risk (or conservatism) following 
persuasive arguments is substituted for group 
movement toward better problem solving, 
then the analogy between the risky shift and 
group problem solving becomes apparent. Tt is 
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for future research to determine whether the 
psychological mechanisms underlying these 3 
phenomena are merely analogous or in fact 
identical. 
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The potential applicability of two social choice models to the group decision- 
making process within the risky-shift paradigm was examined to determine if 
the risky-shift phenomenon was, in part, due to the use of an inappropriate 
model to estimate the most likely group decision. Both models were modifica- 
tions of the simple majority rule model. The predictive efficiency of these 
models was compared with that of a least-squares estimate of the most likely 
group decision in 10 three-person and 10 four-person groups. The results 
indicated that (a) there was no difference in the efficiency of the social choice 
models, and (6) both models provided a significantly more accurate estimate 
of the group decision than the least-squares estimate. An examination of the 
group decision not predicted by the social choice models strongly suggested 
that the risky-shift phenomenon may be due to the use of a procedure con- 
founding the processes of attitude formation and group decision making, as 


The apparent paradox in the group choice 
shift effect typically observed in risky-shift 
studies centers around two interrelated find- 
ings: (a) Individuals’ personal preferences 
are, on the average, more risky following a 
group discussion than prior to such a discus- 
sion; and (b) group decisions are riskier than 
the average of the group members' prediscus- 
sion preferences. These two findings are gen- 
erally treated as synonymous, with the former 
being seen as the cause of the latter. A cen- 
tral, unstated, tenet of the latter aspect of 
such typical findings is that the mean of the 
. individual prediscussion preferences is the 
most reasonable estimate of the group decision 
in such a situation. 

The use of the mean as a reasonable esti- 
mate of the expected group decision in such a 
situation may be questioned on a variety of 
counts. Statistically, the mean, as the first 
moment of a distribution, is that point about 
which the sum of squared deviations are 
minimized for the distribution and also that 
point about which the algebraic sum of the 
deviations is zero. These particular properties 
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well as the use of an inappropriate model for group decision making. 


of the mean led Sir Francis Galton (1907) to 
conclude that the mean was a particularly in- 
appropriate estimator of a group position in 
a situation where the variable under consid- 
eration is measurable, Galton noted that 


the average . . . would give a voting power to 
cranks in proportion to their crankiness. One ab- 
surdly large or small estimate would leave a greater 
impress on the result than one of a reasonable 
amount, and the more an estimate diverges from the 
bulk of the rest, the more influence would it exert 
[p. 414]. 


Galton felt that the estimate about which the 
least objection could be raised was the middle- 
most estimate—the median. The median has 
two properties which make it particularly 
attractive in such a situation. The number of 
preferences above it is exactly balanced by 
the number below it. Further, the median is 
that point in a distribution about which the 
absolute sum of the deviations is minimized.* 

Another objection to the use of the mean as 
an estimate of a group decision stems from 
the task characteristics found in most risky- 


? For a mathematical proof of this statistical char- 
acteristic of the median, the interested reader is 
referred to Sokolnikoff (1939, pp. 121-122). The au- 
thors are indebted to Ben J. Winer and Peter 
Schénemann for their assistance in locating this 
proof. 
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shift studies—particularly those using Wal- 
lach and Kogan’s (1959) Choice Dilemmas 
Questionnaire. The structure of the Choice 
Dilemmas Questionnaire can be seen as pre- 
cluding averaging as a reasonable group de- 
cision-making scheme in most instances. Sub- 
jects are typically required to choose one of 
six distinct alternatives. The group decision 
must also be a preference for one of these 
six distinct alternatives. Thus, in a variety of 
instances, the group might not have an alter- 
native available for selection which would 
correspond to the average of the group mem- 
bers’ initial preferences. Black (1958)* and 
Taylor (1969) have shown that in such in- 
stances (choices made from a set of discrete 
alternatives) some form of a majority-rule 
decision-making scheme would provide a 
mathematically more representative and/or 
satisfactory group decision than any type of 
averaging model. In this instance it is assumed 
that the degree of group satisfaction is deter- 
mined by the number of individuals satisfied 
with the decision. 

These considerations suggest that a portion 
of the risky shift, as it relates to a compari- 
son of individual prediscussion and group 
preferences, may be attributable to the use of 
an inappropriate estimate of the expected 
group decision. At the very least, such consid- 
erations suggest that the extent to which 
groups are riskier than individuals should be 
assessed against a more appropriate social 
choice model than used previously. 

The effectiveness of a social choice model, 
such as majority rule, in predicting group de- 
cisions on the Choice Dilemmas Questionnaire 
would not discount the role played by soci- 
etal values in the initial selection of alterna- 
tives by subjects. The apparent shift, how- 
ever, would be seen as arising from the process 
used by the group to reconcile differences in 
preferred courses of action. The applicability 


* The proof of the representativeness of a majority- 
rule decision scheme presented by Black (1958) is of 
particular interest in the present context, As a por- 
tion of his proof, he shows that the majority-rule 
solution will be the median of the individual pref- 
erences (a) when individual preferences are ex- 
pressed along some measurable dimension and (b) 
when there is no indifference among alternatives on 
the part of the group members, 
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of a social choice model would not rule out 
the possibility that individuals would change 
their preferences as a result of a group dis- 
cussion or decision. Rather, it would be hy- 
pothesized that the group decision would be 
based primarily upon the initial preferences 
of the members. (This, of course, assumes 
that the initial preferences are reliable and 
transitive and that all group members are 
not totally indifferent to the alternatives.) 
Any changes in individual preferences would 
thus be expected to occur affer the group po- 
sition (ie, a normative frame of reference) 
was established. 

The present study examined the extent to 
which two such social choice models, derived 
from the simple majority-rule model, could 
accurately predict the apparent group shift in 
three- and four-person groups. Also examined 
was the extent to which such social choice 
models actually provided a more accurate 
estimate of the group decision than a least- 
squares (LS) estimate. (Such an LS estimate 
is the same type as that provided by the 
mean of the individual choices, except that it 
is for one of the specific alternatives available 
to the group.) The two social choice models 
which were examined were the modified ma- 
jority-rule model described by Luce and 
Raiffa (1957) and a level-of-aspiration model 
based on Siegel's (1957) work and described 
in detail by Harnett (1967). Both models 
avoid the problem of predicted intransitivities 
in group preferences in those instances where 
the group members have highly dissimilar 
preference profiles, which characterize the 
simple majority-rule model (Luce & Raiffa, 
1957). 

The modified majority-rule (MR) model 
uses the individuals’ rankings of available al- 
ternatives as the basis for predicting the 
group decision. The expected group ranking 
of alternatives is determined by the magni- 
tude of an index calculated as follows. An 


index, V(x), is calculated for each Alterna- - 


tive x, which denotes the number of distinct 
paired comparisons over all individuals in 4 
group which involves x and which, from the 
rankings, would be expected to be resolved in 
x's favor, less the number which would be 
expected to be resolved against x. 


When information concerning indifference 
among alternatives is considered, the expected 
majority-rule group decision may still be de- 
rived in the same fashion as previously de- 
scribed (MR/in model). The index V(x) cal- 
culated for each alternative is dependent upon 
the number of paired comparisons involving x 
which is resolved in x’s favor, and that num- 
ber resolved against x. Those comparisons in 
which the individuals are indifferent between 
x and some other alternative(s) would not be 
considered as either for or against x in deriv- 
ing the index. 

A major shortcoming of the MR model as 
a descriptive model, with or without a con- 
sideration of indifference, is that no distinc- 
tion is made between choices which are satis- 
factory and unsatisfactory to an individual. 
Consider a group of three, where one indi- 
vidual strongly prefers Alternative y to x, and 
two individuals rate them nearly equal, but 
would rank x slightly ahead of y. In this in- 
stance, an MR model would predict that x 
would be the group decision, even though all 
three individuals might agree that y is a bet- 
ter group solution. The aspiration level (AL) 
model, in contrast, takes strength of individual 
preferences into consideration and would name 
y as the group solution. 

Specifically, under the assumptions oí the 
AL model, the group ranking of alternatives 
would be expected to maximize satisfaction 
with the ranks assigned to alternatives. This, 
of course, assumes that group satisfaction may 
be equated with the number of individuals 
satisfied. In determining group satisfaction, 
each member is weighted equally according 
to whether or not an alternative meets his 
aspiration level. (Aspiration level is here de- 
fined as the upper bound of the longest inter- 
val, as determined on a higher ordered metric 
scale [Siegel, 1957] separating adjacent 
ranked alternatives.) Let U;(x;) be the utility 
of the jth alternative to the ith individual (i 
=1 24 nj — 1,2, 5) and define a 
satisfaction index S,(a;) as follows: 


è 1if U:(x;) 2 Li 
Sila) ~ (oif U;(x;) = L; 


for all i and j. If L; is the aspiration level of 
the ith person, then it is assumed that group 
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satisfaction will be largest for the alternative 
which maximizes M, where 


M; = Y Sz). 
- 


By ordering the available alternatives ac- 
cording to their value of M;, a complete rank- 
ing of alternatives can be obtained. Although 
the maximum value of M; is not necessarily 
unique (two or more alternatives in a set may 
have the same value), the additional informa- 
tion may be sufficient to eliminate possible 
intransitivities under simple majority rule 
(Harnett, 1967). Thus, in those instances 
where two alternatives had the same value 
My, the expected group decision would be 
that alternative which had the higher value 
of V(x) from the calculations for the MR/in 
model. 

The remainder of this article is devoted to 
an empirical examination of the relative effi- 
ciency of the four foregoing models of group 
decision making (LS, MR, MR/in, and AL) 
for estimating group decisions on Choice-Di- 
lemmas-Questionnaire-type problems, 


METHOD 
Subjects 


The subjects used in this study were 73 volun- 
teers (46 males, 27 females) drawn from the sub- 
ject pool in introductory psychology at Purdue Uni- 
versity. All subjects received experimental credit for 
their participation. Subjects were divided into 11 
three-person and 10 four-person mixed-sex groups on 
the basis of when they could be scheduled to partici- 
pate in the experiment. (It was necessary to eliminate 
one of the three-person groups because some group 
members misunderstood the task.) 


Procedure 


The experiment was conducted in a group of indi- 
vidual cubicles having removable partitions for group 
discussions, TV monitors, and headsets for two-way 
communication between subjects and experimenter. 
All task instructions were presented via the head- 
phones and the monitors. 

Upon their arrival, subjects were placed in a 
cubicle and instructed in the use of the headset to 
communicate with the experimenter. The subjects 
were then given a booklet entitled “Opinion Ques- 
tionnaire IV” which contained the two situation 
descriptions, the alternatives to be ranked for each, 
and the choice questions for scaling their preferences. 

Subjects were instructed to read each situation 
and rank the alternatives presented in terms of how 
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TABLE 1 


ANALYSIS OF PROBLEMS 1 AND 2 FoR THE 
Sraxparp Risky Suter 


r= ; | 
| Mean group) Mean group. 
pre- post- Mean 


Problem discussion discusion Ld Li 
Item 1 653 | 600 |.ss |2157 
Item 2 445 | 401 | 44 |1829 
3person® groups | 1115 | 1020 | 95 | 1.831" 
4-person” groups | 10.80 9.82 | .98 | 2.134* 


L 

Note.—Smaller values reflect greater risk. Thus, positive 
differences reflect a shift toward risk. 

a f test for related samples, One-tailed test used. 

* The analysis of shifts in three- and four-person groups was 
donc on the total on both problems. 

*p «.05. 


they saw their overall attractiveness for the central 
character in each situation. After doing this, they 
were to respond to the 15 choice items for each 
situation by indicating which of two available 
courses of action they would recommend the cen- 
tral character taking. Subjects were given 5 minutes 
to read each situation and rank the alternatives and 
10 minutes to respond to the 15 choice items (15 
minutes per problem). 

After 30 minutes (or when all subjects indicated 
they had completed the task, if sooner) the initial 
individual responses were collected, the partitions 
between the cubicles were removed, and subjects 
were given a new booklet containing only the two 
situation descriptions and the five available alterna- 
tives for each. Subjects were then told that the 
purpose of the initial tasks had been to insure 
that they were thoroughly familiar with the situa- 
tions and the alternatives. Their task was now to 
consider, as a group, each situation and come to a 
consensus about the ranking of alternatives, The 
groups were given 5 minutes to discuss each item 
and arrive at consensus about the ranking of alter- 


natives. (All groups reached consensus within the 
allotted time.) 


Task 


Risk preferences were assessed for the two choice- 
dilemma-like situations described below. The two 
situations are based on two Choice Dilemmas Ques- 
tionnaire items which were modified to make the 
alternatives confronting the central character com- 
patible with Siegel’s (1956a) technique for obtaining 
strength of preference on higher ordered metric 
scales, The situation, the alternatives, and a sample 
choice question from Item 1 are presented below. 
A complete list of the 15 choice items necessary to 
determine a subject’s higher ordered metric scale of 
preference may be found in Siegel (1956a). 


Item 1: Mr. A, an electrical engineer, who is 
married and has one child, has been working for 
a large electronics corporation since graduation 
from college 5 years ago. He is assured a lifetime 
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job with a modest, though adequate, i 
liberal pension benefits upon retirement. Om 
other hand, it is very unlikely that his salary 
increase much before he retires. Because of | 
he has decided to look at the possibility of a 
other job. While attending a professional conves 
tion, Mr. A hears of five jobs with other 6 
panies for which he would be qualified. The fi 
cial future of these five companics varies coms 
erably, Each of these companies would pay 

somewhat more than his present salary to st 
However, Mr. A has noted that the opportu 

for raises and promotions vary with the size a 
potential financial stability of the company. (T 
is, the smaller, potentially less financially stable 
companies offered the greatest possibility for fue 
ture raises and promotions—if they survive am 
prove financially sound.) 

Imagine you are advising Mr. A. Listed bel 
are the five jobs he heard of and the probab 
of each of the companies surviving the compel 
and proving financially sound. Rank each of t 
five positions in terms of how you see their o 
attractiveness for Mr. A. 


——The chances are 1 in 10 that company 
will survive and prove financially sound. - 
— The chances are 3 in 10 that company N. 
financially sound. 
——The chances are 5 in 10 that company O . © 
financially sound. 
— The chances are 7 in 10 that company P .« 
financially sound. 
— The chances are 9 in 10 that company Q . - 
financially sound. 


The following questions depict the kind of choic 
which Mr. A might reasonably be expected 
have to make among offers from companies 
job interviews due to time limitations at the con 
vention. In this instance you may assume that if 
he chooses the alternative of interviewing with 
two companies, dividing his time equally bet 
them, he is virtually assured of receiving a 
offer from one of the two companies, although hi 
cannot be sure which it will be (a 50-50 chance of | 
a job offer from either company). Alternative 
if he chooses to accept the alternative of inte 
ing intensively with one company, he is vii 
assured (100% sure) of receiving a job offer f 
that company. 

Which course of action would you recomm 
that Mr. A follow in each of the following in- 
stances? k 


a— Interview with both his first- and 
choice companies, or 

b. Interview intensively with only his se 
choice company. 


The second item dealt with the problem 
fronting a student who had to choose among fi 
graduate schools. The choice items for the seco 
question required the subject to indicate wheth 
for example, he would recommend that the cen 
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character accept a position on the waiting list of his 
first- and third-choice schools, with a 50-50 chance of 
being accepted by either, or if he should accept a 


ite offer from his second-choice school 


RESULTS 


I'he group decisions were first analyzed for 
the standard risky shift, and to determine if 
similar shifts were obtained in three- and four- 
person groups. The first choices of the indi- 
viduals and the groups were used to determine 
the average of the individual group members’ 
prediscussion preferences on the items and 
their group decision for this analysis, As indi- 
cated in Table 1, a significant risky shift was 
obtained on both items and in three- and 
four-person groups. As can be seen in Table 
1, similar shifts were obtained in three- and 
four-person groups. 

For the second portion of the analysis, four 
expected group decisions were calculated for 
each group on each problem. The first of 
these was a least-squares estimate, which 
was used to provide a base line comparable 
to the mean for assessing the relative pre- 
dictive efficiency of the MR, MR/in, and 
AL social choice models. The second set of 
expected group decisions was based on the 
MR model using only the information in the 
individual ranking of alternatives on each 
problem (MR). The third set of expected 
group decisions (MR/in) was also based on 
the MR model. In addition to the information 
contained in the individual ranking of alter- 
natives, it also considered the extent to which 
an individual was indifferent among two or 
more alternatives. Indifference among alter- 
natives was determined by considering any 
apparent intransitive choices in the 15 items 
used to derive each subject's preference scale 
as indicative of equal-appearing intervals 
(Siegel, 1956a). For example, if a subject’s 
choices indicated that the distances between 
his first and second choices and first and 
third choices were apparently equal, it was 
assumed that he was indifferent between his 
second and third alternatives. 

The fourth set of expected group decisions 
(AL) was derived by considering the addi- 
tional information from the AL model in ad- 
dition to that provided by the MR model 
with indifference considered. 
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TABLE 2 


Cut-Sevake Goopxrss-or-Frr Tests roa me 
Decrston Mopets iw Tuxxk- AND 
Fove-Presox banal 


| Discrepancy between the ac- 
| tual and predicted rank of 


the alternati com 
Group | Model the group's 

: - 
ofr [ 1 a a} 4 

3-person | | 
LS |H 9/0 0 0 
MR 10| 9| 1 0 0 
MR/in| 14 5|0 1 0 
AL "M 5|0 1 0 

4-person | | 
|LS 8|2|o0]|0]|0 
| MR 12| 7 | 1|0/|0 
MR/in| 18| 1] 1 0] 0 
L 1 0] 0 


Note.—The peen obtained for Problems 1 and 2 were 
combined for this portion of the analysis. 

Predicted group indifference between alternatives, when one 

of them was the actual group choice, was treated as a difference 


of one, 
è With df = 3, xt = 11.4, p «.05. 


The effectiveness of each model for predict- 
ing the actual group decisions in three- and 
four-person groups was examined with a 
series of chi-square tests (Maxwell, 1961). 
The results of these tests are indicated in 
Table 2. As can be seen from Table 2, all of 
the models predicted the group decisions sig- 
nificantly better than chance. 

None of the models exhibited any signifi- 
cant differential validity between three- and 
four-person groups (x"rs = .90, df = 1, ms; 
xum = 40, dí = i ns; X uin = 2.50, df= 
1, ns; x*an = 2.50, df = 1, ns). 

There was no difference in the overall pre- 
dictive accuracy of the MR/in model and the 
AL model. The predictions of these two mod- 
els differed only in the case of four group 
decisions. Two of these decisions were pre- 
dicted by the MR/in model and not by the 
AL model, while the other two were more 
accurately predicted by the AL model. 

When individual indifference was not taken 
into account with the MR model, the overall 
predictions were no better than with the LS 
estimate of the group decision (x? = .96, df 
ELAS). 

The overall accuracy of the MR/in and 
AL models was signiñcantly greater than the 
LS model (x? = 5.28, df = 1, p < .025). 
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TABLE 3 


NUMBER OF OBSERVED GROUP Decisions INCORRECTLY 
PREDICTED WHICH ARE RISKIER OR MORE 
Cautious THAN PREDICTED 


Group decisions Group decisions 


Model hich ki which were more " 
[Uh preted | cantious dan 

A 14 7 .095 

M 16 2 ‘001 

MR/in 8 0 “004 

Ak 8 0 “004 


* The significance of the direction of AE INR of prediction 


was examined with a sign test (Siegel, 1956b). 


The incorrectly predicted decisions under 
each model were examined with a sign test 
(Siegel, 1956b) to determine if there was 
any difference in the proportion of times the 
obtained decision was more risky or more 
cautious. These results are presented in Table 
3. As indicated in Table 3, the errors for all 
of the models were chiefly in the direction of 
predicting more cautious group decisions than 
were obtained in these instances. 

All of the decisions not predicted by the 
MR/in model were instances of a relatively 
unconcerned majority confronting a concerned 
minority (i.e, a majority of group members 
were largely indifferent among the alterna- 
tives, while the minority-group member had 
strong consistent preferences with few satis- 
factory alternatives). Two of these situations 
were correctly predicted by the AL model. In 
the other six cases, the first choice of the 
group appeared to be a compromise between 
the position advocated by the concerned mi- 
nority member and the position advocated by 
the relatively unconcerned majority. 

Of the 32 group decisions correctly pre- 
dicted by the AL and the MR/in models, 19 
were situations in which there was a risky 
shift, 8 were cautious shifts, and 5 were in- 
stances in which there was no shift. The re- 
sults of an analysis of the extent of the risky 
shift evidenced in these 32 correctly predicted 
group decisions, for both AL and MR/in mod- 
els, are indicated in Table 4, 

As can be seen in Table 4, there was a sig- 
nificant risky shift evidenced in the group de- 
cisions correctly predicted by the AL model 
on both Items 1 and 2. There was a signifi- 
cant risky shift in the group decisions cor- 
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rectly predicted by the MR/in model on 
Item 1, and a shift on Item 2, of marginal sig- 
nificance in the expected direction. 


DISCUSSION 


Taken as a whole, the results of the present 
study suggest that a major portion of the 
apparent risky shift or group choice shift may. 
be accounted for by the use of a more ap- 
propriate model to estimate the expected 
group decision. The majority of the group 
decisions (32 of 40) did not represent any 
type of shift from what would have been ex- 
pected if a more appropriate model were used. 
(A more appropriate model in this situation 
would be one which takes task characteristics 
and situation requirements into account— 
something which the averaging model does not 
do.) 

The eight group decisions not predicted cor- 
rectly were all risker than expected, as indi- 
cated in Table 3, Thus, the models considered 
do not seem to fully account for the risky 
shift. However, it should be noted that in each 
instance, the group decision predicted by the 
social choice model (MR/in and AL) was 
riskier than the average of the prediscussion 
individual positions. It was simply not as 
risky as the final decision reached by the 
group. These failures in prediction might be 
attributable to a variety of causes. One possi- 
bility is that these groups employed social 
choice processes for resolving their differences 
to which the models considered were not sen- 
sitive. The characteristics of the preference 
data and the impressions of the observers in 
these instances lend support to this conjec- 
ture. In all of these cases not predicted by 
the AL and MR/in models? a majority of 
the group members were largely indifferent 
among the alternatives. The minority mem- 
ber, toward whose preferences the group deci- 
sion shifted, had a strong consistent (no in- 
difference) preference for risk. It would not 
be unreasonable to assume that the minority 
member was able to persuade the other group 


5 There were a total of 10 group decisions which 
were not predicted by both AL and MR/in models. 
Two of the decisions not predicted by the AL model 
were predicted by the MR/in model, while 2 of 
those not predicted by the MR/in model were pre- 
dicted by the AL model, 


members to his position in such a situation 
before the group decision was reached. Indeed, 
given the degree of indifference among the 
majority-group members, it is also possible 
that the minority member was able virtually 
to dictate the solution in these cases. This 
does not necessarily imply that the riskier in- 
dividual is always less indifferent among the 
available options (i.e., more committed) than 
- a less risky individual. Rather, in those in- 
stances when a concerned minority is con- 
fronting a relatively unconcerned majority 
within a group, the concerned minority will 
have a disproportionate influence on the out- 
come. As such, this provides partial support 
- for the confidence version of the leadership 
hypothesis advanced by Burnstein (1969). 
However, the overall results also suggest that 
in and of itself, the confidence-leadership hy- 
pothesis cannot sufficiently account for all of 
the obtained group choice shifts. 

The success of the MR/in social choice 
model in predicting the group decisions would 
appear, in part, due to the fact that it consid- 
ers more information about the subjects’ 
preferences than does an averaging or LS 
type of model. Also, it would appear that the 
considerations of the MR/in model are in 
closer accord with the subjects’ task than 
those of an averaging model. From a norma- 
tive standpoint, the MR/in model attempts 
to define a maximally “fair” or representa- 
tive group decision given ordinal information 
about subjects’ preferences, In particular, an 
MR/in model may be shown to minimize the 
possibility that an individual will either be 
supporting a position which the majority of 
the group rejects, or opposing a position 
which the majority of the group advocates 
(Taylor, 1969). Such considerations would 
appear closely analogous to the task of the 
group members in the group decision condi- 
tion of the risky-shift paradigm who must 
amalgamate their individual preferences to ar- 
| rive at consensus. 

The main reason that the group decisions 
predicted by the MR/in and AL models are 
riskier than the average of the group mem- 
bers’ first preferences becomes readily ap- 
parent upon an inspection of the preference 
data. In the majority of cases, the second 
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TABLE 4 
ANALYSIS OF THE 32 Decisions CORRECTLY 
PREDICTED BY THE AL AND MR/IN 
Mopets ror Risky Surrr 
Mean group position Mi 
Model group Ls 
Prediscussion | Postdiseussion shift 
AL 
Item 1 6.55 6.13 42 |1.809* 
Item 2 4.70 4.29 41 |1.784* 
MR/in 
Item 1 6.59 6.13 46 |1.921* 
4.46 4.18 28 |1.408* 


Item2 | 


,Note.—Smaller values reflect greater risk. Thus, positive 
differences reflect a shift toward risk. 
, *itest for related samples. One-tailed test used with df = 15 
in each instance. 

*p <.05. 


choices of individuals were riskier than their 
first choices. Thus, except in those instances 
when there was a clear majority-rule solution 
evident in the first choices, the group would 
be expected to move in the direction of con- 
sidering riskier alternatives. This tendency 
for the subjects’ second choices to be riskier 
than their first choices might be attributable 
to the properties of the scale to which they 
were responding. However, the findings of 
Levinger and Schneider (1969) and Pruitt 
(1969) suggest an alternative. Specifically, it 
might be assumed that this reflected a reluc- 
tance of a majority of the subjects to strongly 
endorse a riskier (but actually more admired) 
alternative. In those cases when the group 
discussion indicated that they could not 
achieve a consensus from a consideration of 
first choices, the group members could then 
move, with group support, to a consideration 
of these riskier (and more admired) alterna- 
tives. Such group support would not neces- 
sarily entail an overt decision to move to- 
ward accepting greater risk by the group. 
Rather, such support could simply grow out 
of the need to explore alternatives to 
achieve consensus. 

The lack of differential validity between 
the MR/in model and the AL model may be 
viewed as somewhat surprising in view of 
Harnett’s (1967) results. The limited num- 
ber of cases (four) in the present study 
where the two models made different predic- 
tions is an insufficient basis for any conclu- 
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sions. However, it should be noted that in 
these four instances, the differences were in 
the expected direction. The lack of differenti- 
ation between these two models may reflect a 
natural situation (ie, the situations where 
they lead to different predictions may occur 
only infrequently in nature). Alternatively, 
these results in the present study may be the 
result of the extent to which most subjects 
were indifferent among two or more alterna- 
tives on at least one problem. The amount of 
indifference, particularly where it occurred 
among second, third, and fourth choices, could 
have easily masked the operation of an as- 
piration level phenomenon, or rendered its 
considerations inapplicable. 

The degree of indifference noted may have 
stemmed from two sources operating either 
independently or in conjunction. (a) Many 
subjects complained about the lack of rele- 
vant information on which to base a judg- 
ment. Hence, their preferences might reason- 
ably be expected to be only minimally stable. 
Indeed, in this case the scaling task may have 
acted to stabilize their preferences to a greater 
degree than would normally have been the 
case. This might then have acted to increase 
the applicability of the social choice model. 
In addition, this also could have contributed 
to the smaller than usual group shifts (Pruitt 
& Teger, 1967) noted in the first portion of 
the analysis. (5) The alternatives presented 
with the revised questions were all equal in 
expected value over the long run. This could 
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have added to individual indifference among 
alternatives. 
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TOWARD AN UNDERSTANDING OF CHOICE SHIFTS 
IN GROUP DISCUSSION 


DEAN G. PRUITT * 
State University of New York at Bufalo 


The articles in this special issue contribute to an understanding of the nature 
of the choice shift and the mechanisms underlying it. In particular, evidence 
is presented which suggests that shifts on non-risk-involving items are due to 
the same causes as shifts in risk taking. There is also more reason than ever 
to believe that values underlie these shifts, Cross-cultural differences exist in 
the direction and extent of shifts in risk taking, but these appear to be due 
to differences in values rather than the way in which values induce these 
shifts. The nature of the values underlying shifts toward risk in Western 
society is not altogether clear, but there is some reason to believe that they 
are ego ideals relating to personal capability, Of the five value theories, 
relevant-arguments theory is best supported by the articles in this special issue. 
However, this theory may very well explain only part of the choice shift. 
If such a two-process approach is adopted, release theory may possibly account 
for the other part of the shift. Changes in the utility of outcomes associated 
with the risky alternative appear to accompany shifts in risk taking, suggesting 
that these may mediate the impact of relevant arguments or behavior. Further 


An attempt is made here to integrate the 18 
articles in this special journal issue on the 
choice shift in group decision. The first topic 
considered is the contribution made by these 
articles to an understanding of the basic shift 
phenomenon and the direction in which it can 
be generalized. After that, attention is turned 
to the causes of the shift, and the evidence in 
these articles is brought to bear on the various 
theories of this phenomenon. Finally, a brief 
statement is made about the kinds of research 
that are needed for the future. The reader of 
this article will be assumed to have already 
read the Introductory Review which appears 
at the beginning of this special issue. 


Tue Basic SHIFT PHENOMENON AND 
Its GENERALITY 


Use of a Between-Subjects Research Design 


Past research on this phenomenon has al- 
ways employed a within-subjects design, in 
which both individual and group (or post- 
discussion individual) decisions are made by 
the same subjects. This is the kind of design 


1 Requests for reprints should be sent to the au- 
thor, Department of Psychology, State University of 
New York at Buffalo, 4230 Ridge Lea Road, Am- 
herst, New York 14226. 


evidence for the existence of a “Walter Mitty” effect is also presented. 


that permits use of the term “shift.” In con- 
trast, two of the articles in this special issue 
(Carlson & Davis; McCauley, Teger, & Ko- 
gan) employed a between-subjects design, in 
which different subjects were used for the in- 
dividual and group decisions. The decisions 
concerned the Kogan and Wallach choice di- 
lemmas. Statistically significant differences 
were not found, though groups tended to take 
more risk than individuals, and the differences 
between individuals and groups were roughly 
the same size as those found in within-sub- 
jects designs. These results are possible be- 
cause between-subjects designs are typically 
less sensitive than within-subjects designs. 
A cautious interpretation of these results 
would hold that it is improper, on the basis of 
existing evidence, to conclude that groups 
take more risk than individuals on the Ko- 
gan-Wallach items. We can only say with con- 
fidence that individuals shift toward risk on 
these items when they participate in group 
decisions, A bolder interpretation might hold 
that sizable nonsignificant trends in two 
studies are enough basis for concluding that 
groups take more risk than individuals. Yet 
even if this latter conclusion is adopted, we 
must recognize that the unreliability of these 
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trends makes prediction to specific individual- 
group differences in the real world quite 
hazardous. 


Type of Decision Problem Employed 


The decision problems employed in the 
studies reported in this special issue were 
mostly of the choice-dilemmas variety. Most 
of the studies used only risk-oriented items 
and obtained the usual shifts toward risk. But 
four of them (Ferguson & Vidmar; Silver- 
thorne; Stokes; Vinokur) employed caution- 
oriented choice dilemmas drawn from the work 
of earlier authors and obtained cautious 
shifts. 

In one of these studies (Stokes), shifts 
were examined under five experimental condi- 
tions. The pattern of risky shifts across these 
conditions for risk-oriented items was very 
similar to the pattern of cautious shifts for 
caution-oriented items. Hence, Stokes con- 
cludes that the same process must account for 
shifts in both directions. If true, this would 
permit the economy of theory which one 
hopes to find in scientific endeavors. How- 
ever, Stokes’ conclusions are based on such a 
small number of experimental conditions (a 
correlation coefficient of —.97 across five con- 
ditions is only significant at the .10 level) 
that his results need to be replicated before 
firm conclusions are drawn. 

Risk-involving decision problems other than 
the choice dilemmas were used in two of the 
studies, Following the lead of Wallach, Ko- 
gan, and Bem (1964), Carlson and Davis 
found a risky shift in choices among ability- 
test items with differing probabilities of suc- 
cess. Johnson and Andrews employed an origi- 
nal and unusual set of items in which a hypo- 
thetical consumer must Choose between buy- 
ing an available but unknown (and hence 
risky) brand or going to the inconvenience of 
seeking out a known brand. 

The latter authors were actually able to 
manipulate the direction of shift on th risky 
dimension by varying the perceived probabil- 
ity and cost of failure of the unknown brand, 


?A "risk-oriented item" is defined as one that 
usually shifts toward risk, and a "caution-oriented 
item” as one that usually shifts toward caution. 
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Risky shifts were found for products that 

seem relatively unlikely to fail and whose 

failure entails small cost; cautious shifts, for 

products that seem relatively likely to fail and 

whose failure entails large cost. This finding 

is reminiscent of Stoner’s (1968) success in 

predicting the direction of shift on the basis 

of a comparison between the perceived value 

of success and the perceived cost of failure on 

the risky alternative. It also echoes (a) 

Clark and Willems’ (1969) finding that larger 

risky shifts occur on items in the Kogan- 

Wallach set that have less serious perceived 

consequences and (5) Marquis and Reitzs 

(1969) finding of a risky shift on bets with 

positive expected value and a cautious shiít 

on bets with negative expected value. All of 
these studies, taken together, suggest that ex- 
pectancy-value considerations of the kind that 
govern individual decision making also de- 
termine the direction (and presumably the 
extent) of group shift on the risky dimension. | 
Such an assumption is central to the value 
theories of this phenomenon. 

One experiment (Myers & Bishop) em- 
ployed decisions on a series of dilemmas that 
did not involve risk. Group-induced shifts 
were found on these items, suggesting that 
the phenomenon we are studying is broader 
than shifts on the dimension of risk. Guesses 
about the decision that would be made “by 
the majority of your fellow students” were 
mostly on the opposite side of the initial de- 
cision from the direction of group shift. This 
is a typical finding for problems that exhibit 
shifts in risk taking (see, e.g., Ferguson & 
Vidmar in this special issue) and appears to 
be related to the dynamics underlying such 
shifts. Hence, there is reason to believe that 
shifts in the Myers and Bishop study were 
due to the same causes as shifts in risk taking. 

All of the shifts in the Myers and Bishop 
study were away from neutrality on a Likert 
scale, that is, involved polarization. Similar 
results have been obtained by Moscovici and 


3 The term "alternative," in this article, refers to 
the two major possible decisions available to the 
hypothetical decision maker in the choice dilemmas 
items, Thus, in one of these items, an engineer 1$ 
faced with two alternatives: take a job with a new 
company (the risky alternative) and keep the pres- 
ent job (the cautious alternative), 


Risky-Suirt CONCLUSIONS 


Zavalloni (1969) in other non-risk-involving 
material. 


Studies Outside the United States 


Two studies were performed in Western 
countries other than the United States: Fer- 
guson and Vidmar in Canada, and Lamm, 
Schaude, and Trommsdorff in West Germany. 
Both yielded shifts that were in the same di- 
rection on any given item as found in the 
United States. This suggests that values are 
quite similar in the three countries. 

In a third transnational study (Carlson & 
Davis), the behavior of Ugandan students 
was found to be quite different on a number 
of Kogan-Wallach items from that of West- 
erners. For one thing, there was no overall 
shift toward risk, such as is always found in 
the West on these items. Furthermore, in con- 
trast to American controls, the Ugandans 
made cautious initial decisions and did not, 
on the average, view their peers as more cau- 
tious than themselves. These findings, and 

E ond results on the ability-test items, 
suggest that risk taking (or the outcomes as- 
sociated with risk taking) is not as valued in 
Nigeria as it is in the West. In further sup- 
port of this conclusion, Carlson and Davis re- 
port that Ugandans have the reputation of 
being low risk takers. 
While this study suggests that there may 
) be value differences between Uganda and the 
West, it would be improper to conclude that 
group-induced shifts are not found in Uganda. 
Two of the 11 Kogan-Wallach items shifted 
toward caution, and trends on other items 
suggest that more might have shifted if a 
more sensitive, within-subjects design had 
been used. Furthermore, examination of their 
Table 2 reveals that most (4 out of 5) 
of the items on which others were viewed as 
more risky than oneself shifted toward cau- 
tion, and most (5 out of 6) of the items on 
| which others were viewed as more cautious 
than oneself shifted toward risk. As men- 
| tioned earlier, this finding is characteristic of 
studies in Western countries and seems to be 
related to the dynamics underlying choice 
shifts, Hence, it can be argued that these two 
parts of the world differ in values but not in 
the basic dynamics of group shift. 


Jj 


497 


Nature of Shifts within the Group 


Changes in the decisions of individual 
group members as a result of group discussion 
were investigated by Ferguson and Vidmar. 
They classified subjects from “most risky” to 
“most cautious” on the basis of their rank in 
the group on initial risk. The picture that 
emerged was one of convergence, with the 
riskier group members typically shifting to- 
ward caution and the more cautious members 
toward risk. Groups converged on a point 
above the initial group average on risk-ori- 
ented items by virtue of the fact that the 
more cautious group members shifted further 
toward risk than the riskier members did 
toward caution. The opposite was true for 
caution-oriented items, 


DIFFUSION-OF-RESPONSIBILITY THEORY 


Diffusion-of-responsibility theory attributes 
the risky shift to reduced concern about fail- 
ure resulting from sharing the blame with 
others. Hence, Vinokur’s finding that group 
discussion reduced the perceived cost of fail- 
ure connected with the risky alternative might 
be construed as supporting this theory. How- 
ever, other evidence in this special issue casts 
doubt on this already shaky thepry. The the- 
ory cannot account for cautious shifts, which 
were found in a number of the studies; yet 
Stokes’ evidence that the same mechanism 
underlies risky as cautious shifts suggests 
that a valid theory can be expected to account 
for both kinds of shift. Furthermore, Dion, 
Miller, and Magnan obtained negative results 
in a test of a clear prediction from Wallach 
and Kogan’s (1965) version of this theory, 
that more cohesive groups will exhibit a 
larger risky shift. Hence, there is further rea- 
son for doubt about the validity of this 
theory. 


FAMILIARIZATION THEORY 


Another study (Stokes) can be added to 
the long list of those that have failed to rep- 
licate Bateson’s familiarization effect. Indeed, 
Stokes found that familiarization produced a 
slight shift in the direction opposite to the 
orientation of the item. Hence, further doubt 
is cast on familiarization theory. 
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LEADERSHIP THEORY 


With the exception of a small part of one 
article, very little attention was paid to lead- 
ership theory in this special issue, This is 
regretable in the light of the evidence favor- 
ing Burnstein's (1969) leader-confidence ver- 
sion of this theory, which attributes the risky 
shift to greater confidence and assertiveness 
on the part of high risk takers. The one ex- 
ception is found in the article by Castore, 
Peterson, and Goodrich, where it was re- 
ported that highly confident risk takers were 
members of groups that shifted more radically 
toward risk than predicted by a group-decision 
model. The authors of this article argue in 
favor of a two-process theory involving both 
a group-decision model and Burnstein's leader- 
confidence paradigm. 

Some of Stokes’ results bear on a point 
made about leadership theory in the /ntro- 
ductory Review. Stokes once again confirmed 
Teger and Pruitt's (1967) finding that ex- 
change of information about initial decisions, 
in the absence of discussion, produces a par- 
tial risky shift.* Furthermore, precisely com- 
parable results in a cautious direction were 
obtained on caution-oriented items. Stokes' 
study is important for leadership theory be- 
cause information was exchanged at a dis- 

tance rather than, as with earlier studies, in a 
face-to-face setting. Group members were un- 
able to observe each other and hence unable 
to communicate nonverbally. These findings 
argue against the notion, mentioned in the 
Introductory Review, that nonverbal leader- 
ship occurs in the information-exchange con- 
dition. Since partial shifts occur in this condi- 
tion, we are forced to rule out as a sole expla- 
nation of risky and cautious shifts all versions 
of leadership theory that attribute the shift 
to stylistic differences between high and low 
risk takers, for example, leader-confidence 
theory. However, theories of this kind can 
still be construed as explaining that part of 
the shift apparently added by group discus- 
sion, over and above that obtained from sim- 


4The findings described in this sentence were also 
obtained by Clark, Crockett, and Archer. One study 
in the special issue failed to obtain an information- 
exchange effect (Myers & Bishop). Perhaps signifi- 
cantly, this latter study employed nonrisk items. 
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ple information exchange. Hence, the Castore 
et al. two-process theory mentioned above is 
viable in the face of this evidence, as is the 
two-process theory mentioned in the /ntro- 
ductory Review which combines one of the 
first three versions of value theory (social- 


comparison, pluralistic-ignorance, or release - 


theory) with leader-confidence theory or rele- 
vant-arguments theory. 


RoLE or VALUES 


The assumption that values are important 
in generating the choice shift has already 
been made several times in this article. The 
special issue contains a good deal of evidence 
concerning this assumption, which will now 
be reviewed. After that, attention will turn to 
the five value theories that were described in 
the Introductory Review. 


Attraction Toward the Risky or Cautious 
Alternative 


As was mentioned in the /ntroductory Re- 
view, value theories have in common the as-, 
sumption that groups shift in a direction’ 
toward which most members are already at- 
tracted as individuals. Considerable evidence 
favoring this assumption has already accumu- 
lated. The Johnson-Andrews and Silverthorne 
articles provide further support in that their 
subjects initially took a good deal more risk 
on risk-oriented than on caution-oriented 
items. Support outside the realm of risk tak- 
ing can be seen in Myers and Bishop’s find- 
ing that shifts on Likert-scaled, non-risk-in- 
volving items consisted of increased endorse- 
ment of alternatives already moderately fa- 
vored in the initial decision. 


Values as a Source of this Attraction 


A number of authors, starting with Brown 
(1965), have argued that values underlie 
attraction toward the dominant alternative. 
One source of evidence for this assumption is 
the common finding that other people are 
initially assumed to favor greater caution 
than oneself on risk-oriented items and greater 


risk on caution-oriented items. This self-other - 
discrepancy is interpreted as evidence for the ' 


operation of values, because it is assumed 
that people try to view themselves as adher- 
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ing more closely to their values than do their 
fellow men. (This interpretation has been 
challenged by Levinger & Schneider, 1969, 
who regarded the self-other discrepancy as a 
measure of the extent to which the individual 
dares to deviate from an assumed group 
standard.) 

The self-other discrepancy on risk- and cau- 
tion-oriented items was replicated in several 
studies of Western subjects reported in this 
issue (Baron, Dion, Baron, & Miller; Carl- 
son & Davis; Ferguson & Vidmar; Lamm, 
Schaude, & Trommsdorfí; McCauley, Kogan, 
& Teger). Furthermore, as previously men- 
tioned, while the definition of risk- and cau- 
tion-oriented items must be changed to ac- 
commodate Ugandan values, the same rela- 
tionship between the direction in which an 
item shifts and the self-other discrepancy can 
be seen in the Carlson and Davis data on 
Ugandan students. 

Additional evidence of a similar kind can 
be seen in a finding by Ferguson and Vid- 
mar. These authors report that after the 
group discussion, individuals felt it desirable 
to be among the most risky people in the 
risk-oriented items and among the most cau- 
tious people in the caution-oriented items. 

Levinger and Schneider’s (1969) finding 
that on risk-oriented items, people admire a 
position more risky than they themselves have 
chosen was replicated by Lamm et al. This 
finding provides support of a different kind 
for the notion that attraction toward the risky 
alternative on risk-oriented items is a func- 
tion of values. It suggests, in addition, that 
the value or values involved may be ego 
ideals, living up to which would cause a per- 
son to admire himself. 

Tf we view the values that underlie risk 
taking on risk-oriented items as ego ideals, we 
are regarding them as internalized, personal 
values. An alternative position would regard 
them as superficial cultural values that are 
adhered to out of conformity to assumed 
group norms. The latter position would in- 
terpret as self-presentation, rather than as an 
expression of true belief, the fact that sub- 
jects generally portray others as more cautious 
than themselves and state that they admire 
highly risky choices. Lamm et al. endorse the 
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former position that we are dealing with ) 
personal values rather than social conformity, 
citing as evidence two of their results: (a) 
they found no difference in initial risk or 
risky shift between people with high, as op- 
posed to low, need for approval; and (b) 
they found a nearly significant tendency for 
subjects who had shifted more toward risk to 
indicate greater self-esteem at the end of the 
experiment. 

Additional evidence concerning the nature 
of the value(s) underlying attraction toward 
the risky alternative can be seen in the Jelli- 
son and Riskind article in this special issue, 
which represents an extension of these au- 
thors' earlier work (Jellison & Riskind, 1970). 
They show here that subjects regard people 
with higher ability as likely to take more risk, — 
This suggests, as does their earlier research, 
that the values underlying risk taking entail 
admiration for ability. If we interpret these 
values as ego ideals, then we will view the 
high risk taker as trying to show /tipiself how 
much ability he has. If we interpret them as 
the product of social conformity, then we will 
view him as trying to show how capable he is 
to other people. 

A good deal of evidence has just been pre- 
sented about the nature of the values associ- 
ated with risk-oriented items. Unfortunately, 
there is no evidence in this special issue con- 
cerning the nature of the values associated 
with caution-oriented items. 


Walter Mitty Effect 


Further evidence can be seen in two studies 
(Higbee; McCauley, Kogan & Teger) for the 
existence of a “Walter Mitty” effect. By this 
is meant the tendency which people seem to 
have to describe their behavior as more risky 
than it actually is. The relevance of this ef- 
fect to the role of values in risk taking will 
be discussed after the evidence has been re- - 
viewed. 

It will be recalled that in the original study | 
of this phenomenon (Pruitt, 1969) two kinds 
of conditions were employed: the R-D condi- 
tion, in which subjects first rated as “risky” 
or “cautious” the options® in a Kogan-Wal- 


5 The term “option” will be used to refer to the 
6 (or sometimes 11) choices available to the subject 


es a 


lach item and then decided among these op- 
tions, and the D-R condition, in which they 
first made decisions and then ratings. Only 
the results for the risk-oriented items are im- 
portant here. In the R-D condition, people 
tended to choose a level of risk which they 
had previously rated at cautious. In the 
D-R condition, they tended to choose the 
same level of risk but to rate the level chosen 
as risky. Hence, it can be said that they de- 
scribed their behavior as risky but that from 
their own viewpoint as expressed in an ob- 
jective moment outside the context of decision, 
it was actually cautious. 

In the experiment of McCauley, Kogan, 
and Teger, the subjects answered risk-oriented 
choice dilemmas for themselves and for others 
in two orders: self-other (S-O) and other-self 
(O-S). In both conditions, the other was 
seen as more cautious than the self, but this 
difference was much greater in the S-O than 
in the O-S condition. If viewing others as 
more cautious than oneself in the S-O condi- 
tion is a way of describing oneself as risky, 
these findings constitute further evidence for 
the Walter Mitty effect. Reminiscent of Pru- 
it's (1969) results, the subjects’ own deci- 
sions did not differ substantially between the 
two conditions, but others were seen as con- 
siderably more cautious in the S-O than in 
the O-S condition. 

Higbee's article first reports a replication of 
Pruitt's results and then describes an experi- 
ment involving a strategic game with five 
stages. Higbee found that at the end of every 
stage, people pictured themselves as having 
been less risky in the past than they would 
be in the future. Yet, there was no reality 
support for this picture, in that their level of 
actual risk taking and perception of past risk 
taking did not change from stage to stage. 
This suggests a possible refinement of the 
Walter Mitty effect. Not only do people de- 
Scribe their current behavior as riskier than it 
actually is, but they describe their future be- 
havior as riskier than their past behavior. 


in the choice-dilemmas items as usually administered. 
In the typical method, these are 1/10, 3/10, 5/10, 


= 7/10, 9/10 and none. “Option” is not used inter- 


changeably with "alternative," whose meaning was 
defined in Footnote 3. 
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This refinement is compatible with an — 
interpretation of the Walter Mitty effect 
as evidence of conflict over risk taking. Peg- 
ple are attracted toward the risky alternative 
on risk-oriented items, but they are afraid to 
fully endorse it. They resolve the conflict by < 
acting cautiously yet calling their actions 
risky. It is easier to describe future behavior 
as risky than to describe past behavior this” 
way, because there are fewer reality coms 
straints on what is said about the future. 

Assuming that attraction toward risk re-- 
sults from values, there are two possible in- — 
terpretations of the tendency to view one’s 
actions as riskier than they really are. If 
these values are regarded as superficial group 
norms, this tendency can be interpreted as 
self-presentation. But if they are internalized 
ego ideals, we are probably dealing with 
fantasy. For reasons given earlier, the author 
prefers the latter interpretation, but it is not 
possible to be sure at this time. A critical 
study to distinguish between these two in- 
terpretations would compare the Walter Mitty 
effect under public as opposed to private con- 
ditions. 

The McCauley, Kogan, and Teger findings 
are quite parallel to some earlier results re- 
ported by Jellison and Riskind (1970), 
which may therefore help understand the ori- ' 
gins of the Walter Mitty effect. These au- 
thors had subjects indicate the probability | | 
that the risky action would be successful, on - 
the 10 risk-oriented items in the Kogan-Wal- 
lach list. (This is quite a different task from 
choosing a preferred level of risk.) Every sub- 
ject went through the list twice, once with 
himself as the main character in the choice- 
dilemmas situations and once with an “aver- 
age underclassman" as the main character. — 
There were two conditions: In the S-O condi- 
tion, own chances of success were rated first 
and then those of others. In the O-S condi- 
tion, the order of these tasks was reversed. 
Self-ratings were almost identical under the 
two conditions, and in both conditions others 
were seen as less likely to succeed than one- 
self. However, the latter effect was much more 
marked in the S-O than in the O-S condition. 
Another way of describing the latter finding 
is that people portrayed themselves as much 


more likely to succeed than others only if 
they were able to rate others’ chances of suc- 
cess after they rated their own. 

The similarity between these results and 
those of McCauley, Kogan, and Teger is 
striking; hence, it is quite possible that we 
are dealing with the same phenomenon in 
both studies. This phenomenon might be 
viewed as arising from a conflict between the 
desire to see oneself as having a lot of ability 
and the reality-based necessity of viewing 
one’s ability accurately, When asked to indi- 
cate the probability of personal success, in 
the Jellison and Riskind task, people are at 
least partially constrained by the need for 
accuracy—perhaps because they know a good 
deal about themselves or because estimating 
such probabilities is often connected with 
adaptive behavior and therefore habitually 
subject to realistic considerations. But when 
given the task of rating other people’s proba- 
bility of success in the context of their own, 
in the O part of the S-O condition, they have 
an opportunity to view themselves as having 
a lot of ability by portraying others as less 
likely to succeed than themselves. Reality 
constraints have less impact on judgments of 
others, because others are not so well known 
as oneself and because it is not so dangerous 
to be mistaken about the ability of others. 
This analysis can be extended to the Walter 
Mitty effect by making the familiar assump- 
tion that people value risk because it implies 
the possession of ability. When making an 
actual decision, people must act more or less 
. in accordance with their actual ability, be- 
| cause such an approach is necessary for real- 
istic adaptation to the world. But they are 
not so constrained when developing images 
of past, and especially future, decisions. 
Hence they are freer to satisfy their idealized, 
highly capable self-image by picturing them- 
Selves as high risk takers. 

In the analysis just presented, the desire 
lo portray oneself as capable has been viewed 
as an ego ideal and the Walter Mitty effect 
às an effort to boost self-esteem without en- 
langering adaptation. But as mentioned be- 
lore, the Walter Mitty effect can also be seen 
43 a matter of self-presentation, in which case 
the desire to portray oneself as capable would 
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be viewed as arising from a wish to impress 
other people. The critical research with re- 
spect to these alternative positions has yet to 
be done. 


Impact of Values on the Choice Shift 


The findings described in the last two sec- 
tions provide good reason to believe that 
values underlie attraction toward the risky 
alternative on risk-oriented items and some 
reason to believe that values underlie attrac- 
tion toward the cautious alternative on cau- 
tion-oriented items, However, there is only 
indirect support in these findings for the no- 
tion that values actually produce risky and 
cautious shifts. 

We are fortunate to have two articles in 
this special issue (Clark, Crockett, & Archer; 
Lamm et al.) that link values directly to the 
risky shift on risk-oriented items. The basic 
design in both experiments involved compos- 
ing groups out of people with homogeneous 
values. The measure of “value” used was dif- 
ferent in the two studies, providing conver- 
gent validity to this concept. Clark et al. used 
the own-other discrepancy (averaged across 
items) as an operational definition of the 
degree to which a subject valued the risky 
alternatives, while Lamm et al. used the dis- 
crepancy between own decision and the deci- 
sion for which the individual had the “high- 
est regard." ° The Lamm et al. study was 
controlled for possible regression artifacts. 
The Clark et al. study did not employ such 
controls and did not need them. 

The basic finding in both studies was that 
groups of people who initially gave evidence 
of valuing the risky alternatives exhibited 


ë Cartwright, in his article in this special issue, 
has criticized group-composition studies that employ 
averages across a set of items as a basis for deter- 
mining an individual's score, He argues that interitem 
reliability is quite small on measures such as initial 
risk, and hence that a group which seems properly 
composed on the basis of cross-item averages may 
actually be improperly composed on the separate 
items. This is an issue that clearly deserves concern. 
But in the two studies described in this paragraph, 
the differences between the cross-item averages for 
high-value and low-value subjects were so large that 
it seems reasonable to assume that they were mani- 
fest in most of the group compositions on separate 
items. 
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larger risky shifts than did groups of people 
who did not show such values. This implies 
that the values which produce attraction to 
the risky alternative on risk-oriented items 
produce a£ least part of the shift in group 
discussion of these items. 

One wonders about the relationship be- 
tween the value measures in the Clark et al. 
and Lamm et al. studies. Lamm et al. report 
a moderate negative correlation (r = —.34) 
between own-other discrepancy and ideal-own 
discrepancy in the larger sample from which 
they assembled their group. The present au- 
thor performed a reanalysis of data from the 
subjects actually used in the experiment and 
found a smaller but still negative correlation 
(r = —.09) between these variables. These 
correlations suggest the unexpected conclusion 
that quite a few of the subjects who fell into 
the high value-of-risk condition in one study 
would have fallen into the low value condition 
in the other. This may mean that the sup- 
posed measures of “value of risk” employed 
in these studies are actually tapping two dif- 
ferent phenomena. An alternative interpreta- 
tion would hold that a negative self-correla- 
tion between the two measures, due to the 
fact that both include own level of risk, is 
obscuring a fairly sizable positive correlation 
or an identity between the phenomena under- 
lying these measures. 


VALUE THEORIES 


The evidence just cited that values underlie 
the risky shift gives added stature to the five 
value theories described in the Introductory 
Review. At least one of these theories would 
seem to have a very good chance of being the 
correct explanation for at least part of the 
shift. Some of the findings in this special issue 
also pertain to specific value theories. 


Social-Com parison Theory 


Ferguson and Vidmar demonstrated the 
validity of four propositions that can be de- 
rived from all versions of social-comparison 
theory: (a) Perceptions of the decisions made 
by others shift toward risk on risk-oriented 
items and toward caution on caution-oriented 
items. (b) After discussion, others are seen as 
more cautious than oneself on risk-oriented 
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items and more risky than oneself on caution- 
oriented items. (This had previously been 
shown only for prediscussion decisions.) (c) 
Shifts in the perception of where others 
stand are larger and more reliable than shifts 
in personal preference. This suggests that the 
former shifts may be antecedent to the latter, 
(d) Group members who start out at a lower 
level of risk shift further on risk-oriented 
items and less on caution-oriented items. The 
last finding is implied by the fact that groups 
converge as they shift; hence, it provides no 
distinctive support for any theory of the 
choice shift. However, the first three findings 
do seem to support social-comparison theory, 
though it will be argued in the next section 
that they also support pluralistic-ignorance 
theory. 

Another derivation from social-comparison 
theory was tested by Baron, Dion, Baron, 
and Miller. It can be expressed in the form of 
two hypotheses: (a) On risk-oriented prob- 
lems, individuals are more influenced by group 
discussion if they are part of a group whose 
other members argue in favor of risk than if 
they are part of a group whose other members 
argue in favor of caution, and (5) on caution- 
oriented problems, individuals are more in- 
fluenced by group discussion if they are part 


of a group whose other members argue in | 


favor of caution as opposed to risk. These 
hypotheses were not verified; instead, the di- 
rection of the arguments presented by the 
other group members completely determined 
the direction and extent of shift. Hence, some 
doubt is cast on all versions of social-com- 
parison theory.* 


Pluralistic-I gnorance Theory 


Support for pluralistic-ignorance theory 
can be drawn from several of the findings 
already cited. The theory postulates that on 
risk-oriented items, people are in conflict be- 
tween an ideal of risk and a cautious assumed 


* Dion, Baron, and Miller (1970) reported an un- 
published study that apparently produced much the 
same findings as that performed by Baron et al. In 
this study, Steiner presented false norms about 
others’ decisions on choice-dilemmas items. The sub- 
jects shifted toward these norms without regard to 
whether they were dealing with a risky or cautious 
item, 


group standard. As mentioned earlier, evi- 
dence of such conflict is seen in the studies 
that demonstrate the Walter Mitty effect 
(Higbee; McCauley, Kogan, & Teger). 

One feature of the evidence presented by 
McCauley, Kogan, and Teger calls into ques- 
tion an assumption made by the proponents 
of both pluralistic-ignorance theory (Levinger 
& Schneider, 1969) and social-comparison 
theory (e.g., Brown, 1965). This is that post- 
decisional judgments of the level of risk taken 
accurately measure the perceptions of where 
others stand, upon which the subject assert- 
edly bases his decisions. McCauley, Kogan, 
and Teger found large differences between 
predecisional and postdecisional judgments of 
others' positions, which suggests that deci- 
. sions affect judgments about where others 
stand. Hence, it seems more appropriate for 
future researchers who are interested in these 
theories to employ predecisional judgments to 
assess the perception of others at the time of 
decision. 

The first three propositions whose validity 
was demonstrated by Ferguson and Vidmar 
are as easily derived from pluralistic-igno- 
rance theory as from social-comparison the- 
ory. The reason for this is that both theories 
attribute the group-induced shift to a revised 
perception of where others stand on the item 
in question and assume that decisions will 
always, on the average, be somewhat in the 
valued direction of where others are perceived 
to stand.* As mentioned earlier, Ferguson and 
Vidmar's fourth proposition is of little in- 
terest, since it follows automatically from 
the facts of convergence and shift. 

The implications of Baron et al.’s findings 
for pluralistic-ignorance theory are not alto- 
gether clear. But it can be argued that this 
theory predicts rather different results from 
those obtained and hence that the validity of 
the theory must be questioned. It will be re- 
called that these authors employed confed- 
erates for the other group members, who sta- 
tioned themselves at or moved to a point that 


*'The possibility that risky behavior om risk-ori- 
ented items entails conformity to an assumed group 
Norm was discussed earlier (and tentatively re- 
jected). Pluralistic-ignorance theory attributes cau- 
tious behavior, not risky behavior, to an assumed 
Broup norm. 
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was either 40 percentage points riskier (Con- 
dition A) or 40 percentage points more cau- 
tious (Condition B) than the subject's initial 
decision. Others are usually seen initially as 
more cautious then oneself on risk-oriented 
items. (McCauley, Kogan, & Teger show 
that this is even true when the assured posi- 
tion of others is measured before the initial 
decision.) Hence, on these items, the position 
taken by others is more fully in accordance 
with expectations in Condition B than in Con- 
dition A. This means that the perceived group 
standard should move toward risk further in 
Condition A than it moves toward caution in 
Condition B. This in turn implies that there 
should be larger shifts toward risk in Condi- 
tion A than toward caution in Condition B, 
since the theory holds that changes in de- 
cision are a direct function of changes in the 
perceived group standard. By a similar line 
of reasoning, it can be shown that pluralistic- 
ignorance theory implies for caution-oriented 
items that shifts toward caution will be 
larger in Condition B than shifts toward risk 
in Condition A. The data do not support 
either derivation; all four shifts (risk- versus 
caution-oriented items, risky versus cautious 
group consensus) were as close to equal in 
absolute size as one can hope to find in data 
of this kind. 


Release Theory 


The evidence of conflict about initial deci- 
sion, which was cited above in connection 
with pluralistic-ignorance theory, is equally 
supportive of release theory, since these two 
theories do not differ substantially in their 
account of the processes underlying the initial 
decision. Otherwise, there is little evidence 
for or against this version of value theory 
among the articles in this issue. 

Ferguson and Vidmar’s findings with re- 
spect to changes in the perception of where 
others stand cannot be deduced from release 
theory, as they can from social-comparison 
and pluralistic-ignorance theory, but they do 
not contradict this theory. 

It might be argued that Baron et al.’s find- 
ings are evidence against release theory. Such 
an argument would hold that the release 
mechanism should reinforce the pressure of 
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group consensus in those conditions where the 
consensus favors the value orientation of the 
item. This should cause a greater shift than 
would be found in conditions where the 
group consensus opposes the value orientation 
of the item, Yet Baron et al. found equal 
shifts in all conditions. However, in defense 
of release theory, it should be noted that 
Baron et al. employed groups of confederates 
that, after a little discussion, were wnani- 
mously in favor of a single option on the risk 
dimension. It is reasonable for proponents of 
release theory to assume that the individual 
will follow the group consensus in such cir- 
cumstances, because the release mechanism is 
viewed as only coming into play when the 
group lacks unanimity and one or more mem- 
bers openly endorse the alternative that is 
secretly favored by the individual under 
study. 


Relevant-Arguments Theory 


Considerable evidence supporting relevant- 
ts theory is found in this special 

issue. St. Jean’s (1970) finding of a full risky 
shift in discussions involving arguments only, 
where group members are not permitted to 
reveal their initial decisions, was replicated 
by Clark, Crockett, and Archer. In addition 
to supporting relevant-arguments theory, this 
finding might be viewed as evidence against 
social-comparison, pluralistic-ignorance, and 
release theories, since these theories trace the 
shift to the availability of information about 
others’ preferences among the available op- 
tions, and this information is not obviously 


- available in the arguments-only condition. 
_ However, Clark et al. provide indirect evi- 
dence that their subjects were able to make 


relatively accurate guesses about the initial 
preferences of their fellow group members in 
the arguments-only condition. Hence, there 
appears to be an opportunity in this condition 
for the mechanisms postulated in these three 
theories to operate. Furthermore, as was indi- 
cated in the Introductory Review, these three 


. theories can be restated as holding that shifts 
.. result from an exchange of information about 
- attitude toward the basic risky alternative in 


Ps 


contrast to the basic cautious alternative. 
Such information is clearly available in the 


. arguments-only condition; hence, the mech- 


anisms postulated by these three theories have 
an opportunity to operate in this condition. 

The strongest evidence to date for relevant. 
arguments theory can be seen in Silverthorne’s 
article in this special issue. In one part of his 
study, this author counted the number of 
risky and cautious arguments that came out | 
in group discussions and found an overwhelm- 
ing proportion (12.5 to 1) of risky arguments 
in the discussion of items that shifted toward 
risk and an overwhelming proportion (6 to 1) < 
of cautious arguments on items that shifted — 
toward caution. These findings can be di- 
rectly predicted from  relevant-arguments 
theory. Yet this evidence does not conclu- 
sively prove the validity of this theory, be- 
cause there is no way of determining whether 
the comparative weight of the arguments ac- 
tually influenced the direction of shift, A 
critic of this interpretation could argue that 
Silverthorne's subjects were producing risky 
or cautious arguments as rationalizations to 
support their risky or cautious shifts rather 
than shifting because they had heard these 
arguments. 

In an effort to answer this criticism, Silver- 
thorne ran several conditions in which he 
manipulated the preponderance of arguments. 
He asked some subjects to introduce argu- 
ments for caution into their discussions, oth- 
ers to introduce arguments for risk, and still 
others to introduce an equal number of ar- | 
guments on both sides. These instructions | 
were carried out and produced sizable risky 
shifts when the arguments were biased toward 
risk and sizable cautious shifts when they 
were biased toward caution. The case which — 
these findings make for relevant-arguments | 
theory would seem to be overwhelming, if it 
were not for the possibility that the instruc- 
tions directly influenced the shifts rather than ' 
indirectly via the balance of arguments. This 
study should be repeated using a manipula- 
tion that does not carry such obvious demand — 
characteristics. j 

Further support for relevant-arguments - 
theory can be seen in the article by Myers 
and Bishop. Like Silverthorne, these authors 
found a higher proportion (three to one) of 
arguments favoring the direction of shift than 
the opposite direction. In addition, they - 
found a positive correlation (r = .37), across — 


ents favoring the dominant direction and 
the magnitude of shift in that direction, As is 
true of the first part of Silverthorne's study, 
Myers and Bishop's findings are vulnerable to 
An alternative causal interpretation. This 
would trace the relative propertion of risky 
and cautious arguments to the direction of 
shift rather than the direction of shift to the 
relative proportion of arguments. 

Despite this reservation and the problem of 
demand characteristics in the second part of 
Silverthorne’s study, the evidence favoring 
relevant-arguments theory in these two stud- 
jes is impressive 

A word needs to be said about the import 
for relevant-arguments theory of findings 
Cited earlier in connection with other value 
theories. While Ferguson and Vidmar's find- 
ings would not be predicted by relevant-argu- 
ments theory, they do not seem to contradict 
it. The same can be said for the evidence of 
conflict over risk taking in the Higbee and 
McCauley-Kogan-Teger studies. On the other 
hand, Baron et al.’s results are clearly pre- 
dictable from relevant-arguments theory, since 
they found equal movement in the face of a 
standard group advocacy, regardless of the 
value pull of the item. 

If relevant-arguments theory is correct, one 
_ wonders why there is so little evidence for 
- the familiarization effect. One might think 
3 that the value orientation of an item would 


influence the relative proportion of arguments 
- for and against risk taking written down by 
__the individual while working alone, just as it 
Seems to influence the relative proportion of 
arguments produced in group discussion. By a 
process of self-persuasion, this should cause 
a shift in his preferences similar in direction 
if not size to the group-induced shift. But, 
as mentioned earlier, the familiarization effect 
is ephemeral at best. 

_ A partial solution to this puzzle may be 
derived from Stokes’ study, in which familiar- 
ization was found to produce slight (nonsig- 
lificant) shifts in a direction opposite to the 
alue orientation of the items. In a content 
halysis of the arguments produced in this 
Indition, Stokes found a slight prepond- 
ance of arguments favoring caution on the 
k-oriented items and a slight preponderance 


s05 
of arguments favoring rib on the centiem 
oriented items, He also found small positive 
correlations, across subjects, between the pro- 
portion of risky over cautious arguments and 
the extent of shift toward. risk. bis 
data provide some support for 
that arguments written down 
vidual while wor 


known, the value orientations of the choice 
dilemmas items are not reflected in these ar- 
guments. This element of Stokes’ finding ix 
doubly mysterious because Burnstein, Miller, 
Vinokur, Katz, and Crowley report 

value orientations of items have considerable 
influence on what people say when they are 
asked to “think out loud” into a tape re 
corder. There is clearly a need for research to 
unravel this mystery. 


Vinokur proposes a modified version of - 


relevant-arguments theory which holds that 
arguments produce utility changes which pro- 
duce shifts in risk taking. This theory and 
the evidence for it will be discussed in a 
later section on utility theories. 


Commitment Theory 


apparent in the articles in this special issue. 
However, a finding cited as evidence against 
the theory in the /ntroductory Review is 
called into question by one of these articles. 
It was pointed out that McCauley (1970) had 
found that individuals (as opposed to groups) 
do not shift toward the extremes of a Likert 
scale. This finding contradicts a clear impli- 
cation of commitment theory. However, My- 
ers and Bishop, working with non-risk-involv- 
ing items, found a small individual shift 


toward extremity, as predicted by the theory. 


Overview of Value Theories 
In the Introductory Review, it was pointed 


out that commitment theory has no real sup- — 

port and that several findings seem to con- 
tradict it. While, as was just mentioned, My- 
ers and Bishop have failed to replicate one of 
these contradictory findings, the case for this - 


theory still seems weak. 
Of the other four value theories, relevant- 
arguments theory receives the most support 
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from the articles in this special issue. It is 
hard to believe that group members are not 
influenced by the direction of the discussion 
when arguments favoring risk outnumber ar- 
guments favoring caution by a ratio of 12.5 
to 1 on risk-oriented items! 

Of course, arguments cannot explain every- 
thing, since partial shifts are found in the 
situation where people exchange information 
about their initial choices without uttering a 
word. Hence, if we believe the evidence for 
relevant-arguments theory, it seems appropri- 
ate to adopt a fwo-process approach, in which 
this theory is viewed as a major contender for 
explaining that part of the shift added by 
discussion, over and above the impact of ex- 
changing information about initial prefer- 
ences. It is possible instead to adopt a single- 
process approach which would argue that 
relevant-arguments are the whole story and 
that in the information-exchange condition, 
people are using their imaginations to recon- 
struct the arguments which their fellow group 
members would probably employ to justify 
their initial preferences. But this approach 
seems a bit farfetched. 

Earlier in this article, Burnstein's leader- 
confidence theory was also said to be a possi- 
ble explanation for that part of the shift 
added by discussion. The evidence favoring 
this theory seems weaker than the evidence 
favoring relevant-arguments theory, but it 
may not be necessary to choose between these 
two theories. Both could be correct if we as- 
sumed that relevant arguments are mainly 
generated by the high-risk individual or peo- 
ple following his leadership. 

The other three value theories (social- 
comparison, pluralistic-ignorance, and re- 
lease theory) would appear to be contenders 
for explaining that part of the shift due to 
exchange of information about initial pref- 
erences. This means that they are contenders 
for status in a two-process theory along with 
relevant arguments theory, leader-confidence 
theory, or some combination of the two. An 
alternative, single-process approach would 
. assume that one of these three theories is 
- totally adequate for explaining the shift and 

that a full-scale discussion simply enhances 
the process that is causing a partial shift un- 
der information exchange. While this ap- 
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proach does not seem entirely farfetched, the 4 
author of this article has been unable to en- 4 
visage a convincing mechanism whereby dis 
cussion enhances the effect of information exe 
change. It might be argued that discussio 
leads to greater personal involvement with 
other group members and hence greater 
terest in their initial preferences. But 
position cannot apparently accommod 
Lamm's (1967) discovery of a full risky 
on the part of subjects who observed a group 
from behind a one-way mirror and, therefoj 
were not personally involved with its meme 
bers. 

In the Introductory Review, existing 
dence was said to give social-compari 
theory a slight edge over pluralistic-ignora 
theory. This situation remains uncha 
since exactly the same evidence in this spe 
issue supports both theories. It was also 
gued that release theory had a slight adv: 
tage over the other two. There is really no 
ing for or against release theory in the spe 
issue; but since social-comparison and plu 
istic-ignorance theories have difficulty 
commodating the findings by Baron et al; 


amount and type of evidence favoring these 
three theories is too small to allow any firm 
conclusions about their relative merit. 

Two other positions that can be regarded 
as value theories are presented in the nex 


them (Vinokur’s utility-change-due-to-rel 
vant-arguments theory) seems viable thoug 
not fully supported at this time. 


Urry Decision THEORIES 


Two of the studies in this special issue 
(Burnstein et al.; Vinokur) tested the hy- 
pothesis that shifts on the risk dimension 
result from group-induced changes in 1 
utilities assigned by group members to the- 
outcomes ? that are associated with the avail- 
able alternatives. 1 


? The term “outcome” refers to the events that. 
may possibly happen as a result of decision. For ex- 
ample, in the choice-dilemmas item referred to in | 
Footnote 4, there are three possible outcomes: two - 
associated with the risky alternative, the new com- 
pany fails (the “unfavorable” outcome) and the 
new company proves financially successful (the 
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Burnstein et al. present evidence that (a) 
decisions on choice-dilemmas items, both be- 
fore and after group discussion, can be pre- 
dicted from knowledge of the utilities assigned 
to the outcomes, (5) group-induced shifts in 
individual risk taking can be predicted from 
changes in utilities, and (c) group discussions 
about how to rate the utilities of the outcomes 
induce shifts that are comparable in direction 
and size to those produced by conventional 
discussions of what level of risk to take. The 
latter finding resembles somewhat Clark et 
als discovery of a risky shift in the argu- 
ments-only condition. 

Though the Burnstein et al. study is pro- 
vocative, it appears to have a major methodo- 
logical weakness: the method by which the 
utility of outcomes was supposedly measured 
can be construed as a simple rating scale for 
attitude toward the cautious alternative. Con- 
sider the following example given in their 
article. The subject is asked to locate the 
cautious alternative (keeping the present job) 
on the following scale: 


Low High 
value value 
Take new job; Take new job; 
company fails company proves 
financially 
successful 


Tf he attends to the words above the line (as 
he may very well do, since they are quite 
salient), he is simply rating the cautious al- 
ternative somewhere between *low value" and 
“high value." This is indistinguishable from 
attitude measurement, 

This methodological critique is particularly 
important if we assume that attitudes are 
also being measured by the risk-level-pref- 
erence method typically employed with 
choice-dilemmas items? because this would 
mean that the first two findings in the Burn- 
stein et al. paper are due to self-correlation 


“favorable” outcome), and one associated with the 
cautious option, moderate remuneration in the pres- 
ent job. 

10 Cartwright, in this special issue, seems to sup- 
port this assumption in his statement that “shifts 
would best be conceived as instances of the effect of 
group discussion on attitudes.” 
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and that the third is a simple example of the 
choice shift. Utility concepts would not be 
needed under this assumption because, as was 
mentioned earlier, other nonutility theories 
are capable of dealing with shifts in attitudes 
toward the basic alternatives. The assump- 
tion that attitudes are being measured in the 
typical experiment is not idle speculation 
because, as was pointed out in the /ntroduc- 
tory Review, there is evidence (Fraser, Gouge, 
& Billig, 1970) that ordinary risky and cau- 
tious shifts are found when Likert-type atti- 
tude-measurement scales are used with choice- 
dilemmas stems. 

Vinokur's article reports on four experi- 
ments. In the first experiment, he repeated 
part of the Burnstein et al. study and suc- 
cessfully replicated their findings a and b. In 
the fourth experiment, which involved no 
measurements of preferred risk level, he found 
that group discussion produced shifts in 
placement of the cautious alternative on the 
Burnstein et al. scale reproduced above. He 
interprets this result as evidence about the 
causal relationship between shifts in utility 
and shifts in risk preference. Since, as he 
claims to have shown, shifts in utility can 
occur when no risk-level decisions are being 
made, it follows that utility shifts are not 
caused by risk-preference shifts. Hence, he 
argues that the correlation between these two 
kinds of shifts, which he found in his first 
experiment, results from the impact of utility 
shifts on risk-level shifts. 

The trouble with this conclusion is that as 
was observed earlier, there is good reason to 
question the validity of the Burnstein et al. 
method of measuring utility. An alternative 
interpretation of the findings in Vinokur's 
fourth study would be that an ordinary 
choice shift was produced, with attitudes 
toward the cautious alternative (and, by im- 
plication, the risky alternative) being mea- 
sured by a different kind of scale than usually 
employed. Such a shift could be explained 
with almost any theory, not necessarily a util- 
ity theory. 

In his second and third experiments, Vino- 
kur used much better methods of measuring 
utility, involving separate ratings of the three 
possible outcomes associated with each prob- 
lem, He found (a) some correspondence be- 
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tween the actual probabilities chosen and 
those predicted from knowledge of indi- 
vidual utilities and (5) moderately to very 
close correspondence between the extent of 
actual shift in risk taking and the extent of 
shift predicted from changes in utilities. 
While these two experiments demonstrate 
quite convincingly that utility changes ac- 
company shifts in risk preference, the further 
conclusion that utility changes cause shifts in 
risk preference seems unwarranted at this 
time. The data seem equally compatible with 
the alternative position that utility changes 
are caused by shifts in risk preference. 

Vinokur interprets his results as support 
for his theory that arguments heard in a 
group discussion produce utility changes 
which, in turn, produce shifts (Vinokur, 
1971). This is clearly a version of relevant- 
arguments theory, which augments this theory 
by specifying the precise impact of arguments 
on the decision maker. As pointed out earlier, 
relevant-arguments theory has a lot of sup- 
port. Hence, if Vinokur can demonstrate that 
changes in utility really do cause preference 
shifts and not vice versa, his utility-changes- 
due-to-relevant-arguments theory will be in a 
really strong position. However, as is true of 
all relevant-arguments theories, it would 
appear to be a contender for explaining only 
that part of the shift which is added by group 
discussion over and above that produced by 
information exchange. 

It should be noted that Vinokur does not 
regard his theory as a version of value theory. 
But it can easily be adapted in that direction. 
We need only postulate that values, of the 
kind presumably measured in the Clark et al. 
and Lamm et al. studies, determine the bias 
in the arguments encountered in group dis- 
cussion and thereby the direction of utility 
changes. Vinokur's theory can also, appar- 
ently, be reconciled with Burnstein's leader- 
confidence theory if we assume that high-risk 
individuals on risk-oriented items and low- 
risk individuals on caution-oriented items 
dominate the discussions by virtue of their 
greater confidence. 

Some of the findings reported by Vinokur 
in this special issue seem inconsistent with 
another version of utility theory: Miller's 
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(1970) hypothesis that risky and cautious 
shifts emerge from group convergence on the 
utilities associated with the various possible 
outcomes. This hypothesis is called into ques- 
tion by the finding of substantial s/ifts in 
utility that parallel risky and cautious shifts. 


Group Decision MODELS 


In the last article of this special issue, 
Castore, Peterson, and Goodrich test the pre- 
dictive efficiency of three mathematical mod- 
els of group decision making which are clearly 
related to the majority-rule principle. Two of 
these models, “majority rule with considera- 
tion of indifference (MR/in)” and “aspira- 
tion level (AL)” were quite successful in 
predicting the direction and extent of group 
shifts on risk-oriented items (32 right out of 
40 in both cases). However, the errors of pre- 
diction were all in the direction of underesti- 
mating the extent of risky shift, which sug- 
gests that there is more to the shift than cov- 
ered by these models. It will be recalled that 
Castore et al. postulate a two-process theory 
to account for these errors of prediction, 
which combines a majority-rule model with 
Burnstein’s leader-confidence ideas. 

Castore et al. link their MR/in model to 
value theory, arguing that risk-oriented val- 
ues cause many people to prefer for their 
second choice a riskier option than that most 
preferred. The need to compromise in group 
discussion makes some of these people move 
to their second choice—and thus a risky shift 
is produced. This link reconciles these au- 
thors’ approach with the large amount of 
evidence supporting the importance of values, 
which was reviewed earlier in this article. 

In support of the link to value theory, Cas- 
tore et al. cite evidence that second choices 
were usually more risky than first choices. 
(Their method involved an adaptation of one 
devised by Siegel.) On the other hand, in an 
unpublished study, the present author 
achieved just the opposite results: second 
choices were more cautious than first choices 
on risk-oriented items. (The method em- 
ployed to obtain second choices was very 
simple: the subjects were instructed to cover 
up their first choices and decide among the 
remaining alternatives.) The discrepancy be- 
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» tween these two studies is probably due to 
the different methods used. It clearly needs 
unraveling. Castore's second-choice theory 
also faces the seemingly difficult problem of 
accounting for the fact that full shifts are 
found for subjects who merely observe à 
group discussion without participating 
(Lamm, 1967). It is hard to understand why, 
as nonmembers, these subjects would feel that 
they had to compromise with members of the 
group and move to their second choices. 

This theory would also seem to be in diffi- 
culty because of the questions raised in the 
- Introductory Review about — majority-rule 
. models in general. There are two studies that 
"create problems: Vinokur (1969) and Wal- 
» lach and Mabli (1970). Both failed to find a 
relationship between the direction in which 
initial decisions were skewed across the mem- 
bers of a group and the extent of shift on 
risk-oriented items. On the other hand, Cart- 
wright presents evidence in favor of majority- 
rule and similar models in his article in this 
special issue. 

In summary, the evidence concerning all 
majority-rule models, and in particular Cas- 
tore's second-choice version, is mixed. Any of 
the first three value theories would appear to 
be a stronger contender for explaining that 
part of the shift attributable to exchange of 
information about initial preferences. Still 
the strength of the Castore et al. findings is 
such that these models should be taken seri- 
ously and more research done on them. 
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CONCLUSIONS TO THE CONCLUSIONS 


The articles in this special issue have 
clearly moved us toward a fuller understand- 
ing of the nature of the choice shift and the 
mechanisms underlying it. Yet there is room 
| for a great deal more research on this phe- 
nomenon; and this research seems worth do- 
ing given the apparent generality of the phe- 
nomenon and the opportunity it seems to 
offer for revitalizing the field of group dy- 
namics. Several specific research suggestions 
have already been mentioned. A few other, 
more general points will now be made. 

More attention should be paid to the rela- 
ionship between item characteristics and the 
direction and extent of shift. The work of 
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Stoner (1968) and of Johnson and Andrews 
in this special issue suggests that we are close 
to being able to predict shifts in risk taking 
on the basis of item characteristics. But very 
little research of this kind has been done. 
Even more pressing is the need for research 
on shifts in non-risk-involving material. We 
know that some material of this kind exhibits 
shifts and some does not, but we do not know 
how to predict the difference. Efforts to 
understand the relationship between item 
characteristics and shifts should be useful for 
two reasons: (a) clarifying the direction in 
which what we know about group-induced 
shifts can be generalized, and (5) gaining 
further understanding of the dynamics under- 
lying these shifts, An example of a study 
that would fill the latter need is one in 
which groups discuss non-risk-involving items, 
decisions on which reflect perceptions of one's 
own ability. If Jellison and Riskind (1970) 
are right, all such items should shift in the 
direction of attributing greater ability to one- 
self. 

Another need is for greater clarity about 
the nature of values and an application of 
what is known about values to the choice 
shift. Various ideas about values have been 
articulated by choice-shift theorists: for ex- 
ample, values lead to a desire to be ahead of 
the crowd (Brown, 1965), values are ideal 
points on a dimension (Levinger & Schneider, 
1969), values are personal ego ideals (Lamm 
et al.), values are socially induced standards 
(Brown, 1965). But there is little agreement 
about the meaning of “value” and no sys- 
tematic thinking about the structure and func- 
tion of these entities. It seems particularly 
important to link the notion of value as used 
by theorists in this area to concepts of social 
need. For example, the work of Jellison and 
Riskind (1970) suggests that the value of 
risk, if such exists, may be closely related to 
need for achievement)! But nobody has as 
yet explored this possibility. 

This section ends with a plea for more 
studies of the process by which shifts come 
about. Process studies have been a lot less 
common in this field than they should have 


11 This idea was suggested to the author by Joel 
Raynor. 
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been; indeed it seems almost incredible that 
Silverthorne, whose article is included in this 
special issue, was the first researcher to do a 
thorough examination of the ratio of risky 
and cautious arguments in group discussion, 
and that the Vinokur article is the first to 
report the use of measures of change in the 
utility of individual outcomes. Questions such 
as the following need to be answered by 
means of careful content analysis of discus- 
sions; Are group members who take greater 
initial risk more talkative on risk-oriented 
items and less talkative on caution-oriented 
items? Do other group members differentially 
reinforce the expression of risky and cautious 
sentiments? We also need more studies of the 
kind done by Baron et al. and before them 
Pilkonis and Zanna (1969), Vidmar (1970), 
and Wallach and Mabli (1970), in which pre- 
dictions about the effect of conditions on the 
direction and extent of shift are derived from 
process notions. 

In designing studies of the latter kind, it 
should be kept in mind that the final theory 
may well specify two or more processes. Hence 
our designs need to tell us how much of the 
variance in the full choice shift is accounted 
for by the process under study, not simply 


whether this process is capable of producing 
a shift. 
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